资源列表
clssifier
- 自带100个数据点,二分类,二属性,基于lda分类(Linear classifier based on LDA)
cuba psm
- Need to update to function better
cubalagi (with function)
- MultiAgent for matlab. Still need to update the code to function better.
主成分和因子分析
- 主成分分析是多元统计分析中用来分析数据的一种方法,它是用一种较少数量的特征对样本进行描述以达到降低特征空间维数的方法(Principal component analysis is a method of data used in multivariate statistical analysis, it is describing the samples with characteristics of a small number of methods to reduce the dimens
重抽样与自助法
- 当数据抽样于非正态分布时,如未知或混合分布、样本量过小、存在离群点、基于理论分布设计合适的统计检验过于复杂且数学上难以处理等情况,这时基于随机化和重抽样的统计方法可派上用场。(When the sampling data in non normal distribution, such as the unknown or mixed distribution, the sample size is too small, there are outliers, based on the theor
方差分析
- 方差分析又称“变异数分析”,是R.A.Fisher发明的,用于两个及两个以上样本均数差别的显著性检验(ANOVA, also called variance analysis, was invented by R.A.Fisher, which was used to test the significance of the mean difference between two and more than two samples)
处理缺失数据的高级方法
- 数据探索分析中处理缺失数据的高级处理方法(Advanced processing methods for missing data processing in data discovery analysis)
广义线性模型
- 这种模型是把自变量的线性预测函数当作因变量的估计值(The model takes the linear prediction function of the independent variable as the estimate of dependent variable)
python-logistic
- 对因变量为0-1属性变量,利用Logistic算法,对其进行回归预测(The dependent variable is the 0-1 attribute variable, and the Logistic algorithm is used to predict it)
python-knn
- 主要利用Python软件,利用KNN算法对垃圾邮件进行分类(This paper mainly uses Python software to classify spam mail by using KNN algorithm)
GAM
- 主要利用R语言进行广义加法模型,进行回归预测(This paper mainly uses R language to carry on the generalized additive model, and carries on the regression forecast)
House_price
- 主要是对二手房房价的因变量房价和其相关的因变量之间的关系进行简单的描述统计分析(Mainly for the second-hand house price dependent variable housing prices and its related variables of the relationship between the simple descr iption of statistical analysis)