资源列表
Maltab
- 文件里面是数据挖掘中各种经典算法的MATLAB的源代码,尤其适合不只懂原理不会写代码的人进行数据建模- The document is a variety of data mining algorithms in the classic MATLAB source code, especially for people who do not understand the principle of not only the code to write data modeling
classification
- Data Mining and Classification for NLP
ROC
- 二分类模型评价方法ROC,是一种新的评价方法,用R语言实现。-ROC is a new uation method, which is realized by R language.
yuce
- 预测算法,用于一次指数预测模型,C++实现-Forecasting algorithm for an exponential forecasting model, C ++ implementation
datamining-sequentialpatterns-master
- implementation of prefixspan algorithm in c#
Read_LibSVM_files_in_R
- Read LibSVM files using R. LibSVM is a MATLAB based library of UCI repository Data Sets. You can convert the data sets to R using this code
Combination_prediction
- 组合预测模型,五个单项模型的组合预测模型和两个单项模型组合的预测模型-Combination forecasting
discriminant-analysis
- 判别样本所属类别的方法,主要包括Fisher判别、朴素Bayes判别和距离判别等-The method for determining the sample belongs to the category of specific methods include Fisher discriminant, discriminant and Bayes discriminant distance
outlier
- 离群点检测的算法,适应于MATLAB,包含关于距离和LOF的聚类算法-Outlier detection algorithm, adapted to MATLAB, including distance and LOF clustering algorithm
Cluster
- 机器学习和数据挖掘中常用的K-means聚类算法,包含两个文件,kmeans.py是Python实现代码,bank-data.csv是测试数据-Machine learning and data mining commonly used K-means clustering algorithm contains two files, kmeans.py is a Python implementation code, bank-data.csv test data
prog-hive-1st-ed-data
- Hive编程指南源代码,里面含有数据源,例如股票的信息。因为自己平时也要用积分下载资源,所以设置了一分。-Hive Programming Guide source code, which contains the data source, such as stock information. Because they usually have to use points to download resources, so we set up a point.
Hadoop-data-find
- Hadoop数据挖掘算法 在mapreduce中的实现-Hadoop data mining algorithms implemented in the mapreduce