资源列表
Drools_CleanData
- 使用Drools规则引擎,自定义规则来进行脏数据清洗的实例-Use the Drools rules engine to customize rules for dirty data cleaning instances
frequency_filters
- 本人在实际项目中使用的各种滤波算法,实践证明可用,效果好。-Use a variety of filters in their own projects to verify the good use.
python-fp-growth-master
- source code for fp_grouth algorithm by paython
Hadoop-data-find
- Hadoop数据挖掘算法 在mapreduce中的实现-Hadoop data mining algorithms implemented in the mapreduce
prog-hive-1st-ed-data
- Hive编程指南源代码,里面含有数据源,例如股票的信息。因为自己平时也要用积分下载资源,所以设置了一分。-Hive Programming Guide source code, which contains the data source, such as stock information. Because they usually have to use points to download resources, so we set up a point.
Cluster
- 机器学习和数据挖掘中常用的K-means聚类算法,包含两个文件,kmeans.py是Python实现代码,bank-data.csv是测试数据-Machine learning and data mining commonly used K-means clustering algorithm contains two files, kmeans.py is a Python implementation code, bank-data.csv test data
outlier
- 离群点检测的算法,适应于MATLAB,包含关于距离和LOF的聚类算法-Outlier detection algorithm, adapted to MATLAB, including distance and LOF clustering algorithm
discriminant-analysis
- 判别样本所属类别的方法,主要包括Fisher判别、朴素Bayes判别和距离判别等-The method for determining the sample belongs to the category of specific methods include Fisher discriminant, discriminant and Bayes discriminant distance
Combination_prediction
- 组合预测模型,五个单项模型的组合预测模型和两个单项模型组合的预测模型-Combination forecasting
Read_LibSVM_files_in_R
- Read LibSVM files using R. LibSVM is a MATLAB based library of UCI repository Data Sets. You can convert the data sets to R using this code
datamining-sequentialpatterns-master
- implementation of prefixspan algorithm in c#
yuce
- 预测算法,用于一次指数预测模型,C++实现-Forecasting algorithm for an exponential forecasting model, C ++ implementation