资源列表
1
- 检测中文文章的相似度,首先对中文文章分词处理,然后提取特征,计算特征向量夹角。检验是否相似-Similarity detection Chinese article, the first article of the Chinese word processing and feature extraction, feature vector angle calculation. Test whether similar
2
- 每一位作家都有自己的写作风格,词语的使用方面会有很大的不同,根据文章中的虚词的使用情况,计算两篇文章的余弦值。检测两篇文章的相似度。-Every writer has their own style of writing aspect, the use of the words will be very different, according to the article in the function words usage, calculate the cosine of the two
1
- 运用R语言进行正态分布时的期望假设检验与方差假设检验-R languages using normal expectation and variance when the hypothesis test hypothesis testing
2
- 运用R语言进行正态分布时的数学期望假设检验与方差假设检验-R languages using normal expectation and variance when the hypothesis test hypothesis testing
1
- 运用R语言进行经验分布函数构造并画图分析经验分布函数-R languages using the empirical distribution function construction and analysis of the empirical distribution function draw
demmentia
- 血管性痴呆数据及分类。 该数据从3甲医院神经科和放射科而来,非常宝贵,请勿商用-vascularr demmentia data and classification it is hospital and very valuable
code
- 完成三个聚类算法:k-means,非负矩阵分解,谱聚类-Please implement three clustering algorithms: k-means, clustering by non-negative matrix factorization (NMF), and spectral clustering.
CARTandRFclass
- 使用Decision Tree 实现cart和实现随机森林算法,比较好的学习工具,而且也能用,速度之前和别人讨论,被优化过-using decision tree trick to fullfil the CART and RandomForest ,it is good learing tool ,and ,practical use. it was optimize speed
libsvm-guide
- 为libsvm或者svm初学者编写的文档,特别介绍数据挖掘的步骤-for beginner of libsvm or svm
C45
- Visual basic program use for clasification data minning with C45 algorithm
Tiki-5-Okt
- Visual Basic data mining program
hadoop-book-master
- 这个是hadoop权威指南的课后源码,包含书中的全部实例,方便大家学习-This is the authoritative guide to the after-school hadoop source, including all example in the book, it facilitates learning