资源列表
emd1d
- 用python写的EMD分解,可以实现一维与二维分解(EMD decomposition written in Python)
天气爬虫
- 爬取各个地区近8年的天气历史数据,大家可以帮忙看看还有什么可以优化的。(Climb the historical weather)
Clustering-master
- 超级强大的聚类算法+详细的程序说明; Kmeans聚类+ISODATA聚类算法;(Super powerful clustering algorithm + detailed program descr iption; Kmeans clustering +ISODATA clustering algorithm;)
House_price
- 主要是对二手房房价的因变量房价和其相关的因变量之间的关系进行简单的描述统计分析(Mainly for the second-hand house price dependent variable housing prices and its related variables of the relationship between the simple descr iption of statistical analysis)
GAM
- 主要利用R语言进行广义加法模型,进行回归预测(This paper mainly uses R language to carry on the generalized additive model, and carries on the regression forecast)
python-knn
- 主要利用Python软件,利用KNN算法对垃圾邮件进行分类(This paper mainly uses Python software to classify spam mail by using KNN algorithm)
python-logistic
- 对因变量为0-1属性变量,利用Logistic算法,对其进行回归预测(The dependent variable is the 0-1 attribute variable, and the Logistic algorithm is used to predict it)
广义线性模型
- 这种模型是把自变量的线性预测函数当作因变量的估计值(The model takes the linear prediction function of the independent variable as the estimate of dependent variable)
处理缺失数据的高级方法
- 数据探索分析中处理缺失数据的高级处理方法(Advanced processing methods for missing data processing in data discovery analysis)
方差分析
- 方差分析又称“变异数分析”,是R.A.Fisher发明的,用于两个及两个以上样本均数差别的显著性检验(ANOVA, also called variance analysis, was invented by R.A.Fisher, which was used to test the significance of the mean difference between two and more than two samples)
重抽样与自助法
- 当数据抽样于非正态分布时,如未知或混合分布、样本量过小、存在离群点、基于理论分布设计合适的统计检验过于复杂且数学上难以处理等情况,这时基于随机化和重抽样的统计方法可派上用场。(When the sampling data in non normal distribution, such as the unknown or mixed distribution, the sample size is too small, there are outliers, based on the theor
主成分和因子分析
- 主成分分析是多元统计分析中用来分析数据的一种方法,它是用一种较少数量的特征对样本进行描述以达到降低特征空间维数的方法(Principal component analysis is a method of data used in multivariate statistical analysis, it is describing the samples with characteristics of a small number of methods to reduce the dimens