资源列表
处理缺失数据的高级方法
- 数据探索分析中处理缺失数据的高级处理方法(Advanced processing methods for missing data processing in data discovery analysis)
方差分析
- 方差分析又称“变异数分析”,是R.A.Fisher发明的,用于两个及两个以上样本均数差别的显著性检验(ANOVA, also called variance analysis, was invented by R.A.Fisher, which was used to test the significance of the mean difference between two and more than two samples)
重抽样与自助法
- 当数据抽样于非正态分布时,如未知或混合分布、样本量过小、存在离群点、基于理论分布设计合适的统计检验过于复杂且数学上难以处理等情况,这时基于随机化和重抽样的统计方法可派上用场。(When the sampling data in non normal distribution, such as the unknown or mixed distribution, the sample size is too small, there are outliers, based on the theor
主成分和因子分析
- 主成分分析是多元统计分析中用来分析数据的一种方法,它是用一种较少数量的特征对样本进行描述以达到降低特征空间维数的方法(Principal component analysis is a method of data used in multivariate statistical analysis, it is describing the samples with characteristics of a small number of methods to reduce the dimens
cubalagi (with function)
- MultiAgent for matlab. Still need to update the code to function better.
cuba psm
- Need to update to function better
clssifier
- 自带100个数据点,二分类,二属性,基于lda分类(Linear classifier based on LDA)
MATLAB的数值分析
- 本书以matlab为平台,介绍了数值分析预图形可视化,内容涉及matlab介绍,数学分析的数值基础,数值方法在工程科学数学问题中的应用以及绘图等内容。(This book is based on MATLAB, introduces the numerical analysis of pre visualization, covering MATLAB, numerical basis of mathematical analysis, numerical method in Mathemati
Zhihu_voters-master
- 爬知乎数据,转载某博客,用于投票信息获取,亲测可用。(python-Zhihu_voters-master)
cluster
- 快速搜索与发现密度峰值聚类方法来确定聚类中心(Clustering by fast search and find of density peaks)
apcluster.m
- ap算法完成ap聚类操作 需要输入参数为数据集 偏向参数 输出结果为聚类数目(The AP algorithm completes the AP clustering operation, the input parameter is the data set bias parameter, and the output result is the number of clusters)
kyfu
- 摩拜大赛数据挖掘大赛2017,简短示意代码(Mobell data mining contest contest 2017, a brief sketch of code)