资源列表
regress
- 一个xgboost实现的回归模型预测,数据集来源于kaggle的taxi竞赛(Regression model prediction based on a xgboost implementation)
Crawler.tar
- 利用了python3.5编写了一个爬虫,爬取豆瓣上电影《声之形》的评论,并统计评论词的频率,制作了词云(Using python3.5 to write a crawler, climb the comments on the movie "sound shape", and statistics the frequency of the comment word, making the word cloud)
kmeans
- 对数据和图像进行聚类分析,k-means聚类方法多应用于模式识别,人工智能,机器学习等方面(Clustering analysis of data and images, K-means clustering method should be used in pattern recognition, artificial intelligence, machine learning and so on)
cc
- 可以执行矩阵的相关性子矩阵挖掘,代码开始部分的备注里包含实例矩阵,大家可以实验看看,代码原创,实验可以,但是如果用在商业或者学术里,请和我联系~(Relative matrix mining of matrices can be performed)
K_Means
- K-Means是聚类算法中的一种,其中K表示类别数,Means表示均值。顾名思义K-Means是一种通过均值对数据点进行聚类的算法。K-Means算法通过预先设定的K值及每个类别的初始质心对相似的数据点进行划分。并通过划分后的均值迭代优化获得最优的聚类结果。(K-Means is one of the clustering algorithms, in which K represents the number of classes, and Means means the mean. As t
emd1d
- 用python写的EMD分解,可以实现一维与二维分解(EMD decomposition written in Python)
天气爬虫
- 爬取各个地区近8年的天气历史数据,大家可以帮忙看看还有什么可以优化的。(Climb the historical weather)
Clustering-master
- 超级强大的聚类算法+详细的程序说明; Kmeans聚类+ISODATA聚类算法;(Super powerful clustering algorithm + detailed program descr iption; Kmeans clustering +ISODATA clustering algorithm;)
House_price
- 主要是对二手房房价的因变量房价和其相关的因变量之间的关系进行简单的描述统计分析(Mainly for the second-hand house price dependent variable housing prices and its related variables of the relationship between the simple descr iption of statistical analysis)
python-knn
- 主要利用Python软件,利用KNN算法对垃圾邮件进行分类(This paper mainly uses Python software to classify spam mail by using KNN algorithm)
python-logistic
- 对因变量为0-1属性变量,利用Logistic算法,对其进行回归预测(The dependent variable is the 0-1 attribute variable, and the Logistic algorithm is used to predict it)
广义线性模型
- 这种模型是把自变量的线性预测函数当作因变量的估计值(The model takes the linear prediction function of the independent variable as the estimate of dependent variable)