搜索资源列表
pc4.5.tar
- 此代码是用c语言编写的决策树的c4.5代码,它是数据挖掘分类算法中的一种,可以对给定数据集进行分类,挖掘出规则-this code is c language of the decision tree Bank code, which is data mining classification algorithm of a can of a given data set for classification, tapping rules
Clustering.zip
- 数据挖掘算法的实现,基于模糊聚类的最大树算法,数据集是darpa99,也就是KDD-CUP99中采用的数据集,The realization of data mining algorithms, based on fuzzy clustering of the largest tree algorithm, a data set is darpa99, which is used in KDD-CUP99 data set
Sort
- 对给定数据集合进行分类。是数据挖掘算法的KNN的分类实例。-For a given data set to classify. Data mining algorithms is the classification of examples of KNN.
agedata2001.ZIP
- its a good data set for data mining operation.
shujuwajuelunwen
- 这是数据挖掘的论文 关于决策树的 有测试集-This is the thesis of data mining on a test set of the decision tree
ENCLUS
- Entropy Based Subspace Clustering for Mining Data - ENCLUS - a new version of PROCLUS algorithm for clustering high dimensional data set.-Entropy Based Subspace Clustering for Mining Data- ENCLUS- a new version of PROCLUS algorithm for clustering hi
parallel
- 并行程序,并行离散化算法实现,粗糙集数据挖掘程序-Parallel programming, parallel discrete algorithms, rough set data mining program
Clustering.Algorithms.Research
- 软件学报 2008年论文《聚类算法研究》,作者孙吉贵, 刘杰, 赵连宇。pdf格式,14页。对近年来聚类算法的研究现状与新进展进行归纳总结.一方面对近年来提出的较有代表性的聚类算法,从算法思想、关键技术和优缺点等方面进行分析概括 另一方面选择一些典型的聚类算法和一些知名的数据集,主要从正确率和运行效率两个方面进行模拟实验,并分别就同一种聚类算法、不同的数据集以及同一个数据集、不同的聚类算法的聚类情况进行对比分析.最后通过综合上述两方面信息给出聚类分析的研究热点、难点、不足和有待解决的一些问题.上
K-MEANS
- 数据挖掘,K-means源码,数据集为iris-Data mining, K-means source code for the iris data set
cure
- 数据挖掘,CURE算法实现,数据集为iris-Data mining, CURE algorithm, the data set for the iris
DIANA
- 数据挖掘,DIANA算法实现,数据集为iris-Data mining, DIANA algorithm, the data set for the iris
lda-0.1-matlab
- lda是一个集合概率模型,主要用于处理离散的数据集合,目前主要用在数据挖掘(dm)中的text mining和自然语言处理中,主要是用来降低维度的-lda is a set of probability models, mainly for dealing with discrete data sets, the key used in data mining (dm) in the text mining and natural language processing, is mainly u
kddcup.data
- KDD cup 1999 Data. This is the data set used for The Third International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-99 The Fifth International Conference on Knowledge Discovery and Data Mining.
Wine-Quality-Data-Set
- 红酒、白酒质量数据集,可作为机器学习中的数据挖掘数据库-Red wine, white wine quality data sets can be used as data mining machine learning database
Forest-Fires-Data-Set
- 森林火灾数据集,可作为 数 据 挖 掘 的数据库-Forest fires dataset can be used as data mining database
data-set-public-affairs
- public affair data set for the matlab input to the data minig concept the data mining the input will be read by the user in the tex formet
mining-p.zip
- Data mining is the process of extracting potentially useful information from a data set. Clustering is a popular data mining technique which is intended to help the user discover and understand the structure or grouping of the data in the set accord
data--preprocessing-using-kdd-data-set
- Data Mining process model selected is KDD which starts selection of data.Initially the researcher has taken the Kddcup.data-10-perecnt which contains total of 311,027 records which includes both labeled and unlabeled records-Data Mining process model
Geolife Data 1.3
- Geolife GPS 轨迹数据集–用户指南 这一 GPS 轨迹数据集是在 (微软研究亚洲) Geolife 项目中收集的, 178 用户在四年 (2007年4月至 2011年10月) 期间。该数据集的 GPS 轨迹由一个时间戳点序列表示, 每一个都包含纬度、经度和高度信息。该数据集包含17621个轨迹, 总距离为1251654公里, 总持续时间为48203小时。该轨迹数据集可以应用于移动模式挖掘、用户活动识别、基于位置的社交网络、位置隐私和位置推荐等多个研究领域。(Geolife GPS t
sklearn-tree-BN-knn
- 分类器的性能比较与调优: 使用scikit-learn 包中的tree,贝叶斯,knn,对数据进行模型训练,尽量了解其原理及运用。 使用不同分析三种分类器在实验中的性能比较,分析它们的特点。 本实验采用的数据集为house与segment。(Performance comparison and optimization of classifiers: We use tree, Bayesian and KNN in scikit-learnpackage to train the dat