资源列表
EM
- 对于混合高斯分布的情况,使用最大期望算法,通过不断计算每个样本的均值与方差,使得似然函数达到最大值。可以很好地处理满足一定概率分布的数据。 代码中通过mvnrnd()函数,设定其中的参数,产生符合混合高斯分布的一组数据集。-For the case of a mixed Gaussian distribution, using expectation-maximization algorithm, through continuous calculation of the mean and
zh_FFT
- 经典的数据处理技术,福利叶变换,研究数据在频域里的特性。-Classic data processing technology, the welfare transform, study the characteristics of data in frequency domain.
pca11
- 使用matlab编程实现PCA算法,此算法经过测试,没有问题,可以放心使用-Use matlab programming PCA algorithm that has been tested, no problem, you can rest assured that use
w27
- 适合处理海量数据的python源码,可读入txt和csv 格式文件。-Suitable for massive data processing python source, read into txt and csv Format files.
minimum_spanning_tree
- 本程序是基于最小洗漱树的分类器 可得到分类效果不错的分类器-This procedure is based on a minimum wash tree classifier good classification results obtained classifier
GAKMeans
- 由于Kmeans聚类分析是一个局部的搜索过程,因此加入遗传算法进行全局搜索选择最优的初始中心点使得Kmeans算法产生较大的改进-Since Kmeans Cluster analysis is a local search process, so join a global search for the genetic algorithm to the optimal initial centers such Kmeans algorithm produces greater improve
AKPCA
- 南京大学数据挖掘研究所提出的主动学习算法,对于改进KPCA有着重要的借鉴作用。-The active learning algorithm proposed data mining research institute of nanjing university, has an important reference for improving the KPCA.
k_nn
- kNN的思想:计算待分类的数据点与训练集所有样本点,取距离最近的k个样本;统计这k个样本的类别数量;根据多数表决方案,取数量最多的那一类作为待测样本的类别。距离度量可采用Euclidean distance,Manhattan distance和cosine。-kNN The idea is simple: the training set and calculated data points to be classified all sample points taken the neare
meanshift
- Clustering using Meanshift Algorithm
CF
- 这是用matlab写的协同滤波算法主程序,程序简单,易于理解。可以应用于推荐系统-It is used to write collaborative filtering algorithm matlab main program, the program is simple and easy to understand. Recommended system can be applied。。。。。。
FJIR.R
- 模糊聚类分析 - 建立模糊相似矩阵,以R语言实现。-build a fuzzy matrix
PCA
- 主成分分析法PCA,matlab源码,欢迎下载。-Principal component analysis PCA, matlab source code, welcome to download.