搜索资源列表
K_CenterPoint_PAM
- k中心点算法,也就是PAM算法。是数据挖掘中聚类分析的一种手段,用途广泛。-k center algorithm, i.e. PAM algorithm. Data mining is a means of cluster analysis, and versatile.
kmeans
- k均值聚类方法。 在给定一个有n个对象的数据集,划分聚类技术将构造数据进行k个划分,每一个划分代表一个簇,k小于等于n。-k-means clustering method. Given a set of n objects data, dividing the data clustering techniques to construct k partitions, each partition represents a cluster, k less than or equal n.
k-means
- K均值算法,将数据矩阵命名为data,设置聚类簇个数k,可对多维数据进行聚类。-K mean algorithm, the data matrix is named data, set the number of clusters K, can be used to cluster the multi-dimensional data.
isodata-cluster
- isodata迭代自组织聚类算法源代码,直接运行,效果不错-Iterative Selforganizing Data Analysis cluster algorithm,source code without error
Clustering-master
- 使用java实现了基于凝聚的聚合簇类算法,可直接运行,数据输出可导入导出-Use java class implements cluster aggregation algorithm based on aggregated, can be directly run, the data output can import and export
lightlda-master
- LightLDA is a distributed system for large scale topic modeling. It implements a distributed sampler that enables very large data sizes and models. LightLDA improves sampling throughput and convergence speed via a fast O(1) metropolis-Hastings algori
DBScan03
- DBScan算法实现,用Java高级编程语言正确实现DBSCAN算法,DBScan是一种基于密度的聚类算法,它有一个核心点的概念:如果一个点,在距它e的范围内有不少于MinP个点,则该点就是核心点。核心和它e范围内的邻居形成一个簇。在一个簇内如果出现多个点都是核心点,则以这些核心点为中心的簇要合并。最终输出找到的簇及其数据点。-DBScan algorithm, using high-level programming language Java is implemented correctly
maxmin
- 数据挖据中聚类问题的实现 通过简单数据进行聚类分析 该聚类为最大最小聚类-According to the data mining, the clustering problem is achieved by using simple data to cluster analysis.
semi-supervised-cluster-algorithm
- 半监督聚类是利用少量的标记数据提高聚类算法的性能,文中综述了半监督聚类算法的若干进展-Semi supervised clustering is a method to improve the performance of clustering algorithm by using a small amount of labeled data,Some advances about semi supervised clustering algorithms are reviewed in thi
ecoli
- 聚类是将数据对象分组成多个簇(Cluster),同一个簇内 部的任意两个对象之间具有较高的 ),同一个簇内 部的任意两个对象之间具有较高的 相似度,而属于不同簇 的两个对象间具有较高的 ,而属于不同簇 的两个对象间具有较高的 相异度。相异度可以根据描述对 象的属性值计算,对象间的距离是最常采用的度量指标。-Clustering is a data object into a plurality of clusters (the Cluster), with a clu
Cluster
- 机器学习和数据挖掘中常用的K-means聚类算法,包含两个文件,kmeans.py是Python实现代码,bank-data.csv是测试数据-Machine learning and data mining commonly used K-means clustering algorithm contains two files, kmeans.py is a Python implementation code, bank-data.csv test data
parzen
- 本代码应用非参数估计法对一簇未知分布的数据进行分布函数估计- The code in the application of non parametric estimation method for estimation of the distribution function of a cluster of unknown distribution data The code in the application of non parametric
Dbscan
- Dbscan聚类源代码。 可以实现对多维数据进行聚类。(Dbscan cluster source code. The multidimensional data can be clustered.)