资源列表
2
- 每一位作家都有自己的写作风格,词语的使用方面会有很大的不同,根据文章中的虚词的使用情况,计算两篇文章的余弦值。检测两篇文章的相似度。-Every writer has their own style of writing aspect, the use of the words will be very different, according to the article in the function words usage, calculate the cosine of the two
1
- 检测中文文章的相似度,首先对中文文章分词处理,然后提取特征,计算特征向量夹角。检验是否相似-Similarity detection Chinese article, the first article of the Chinese word processing and feature extraction, feature vector angle calculation. Test whether similar
PageRank_M
- 实现基本的pagerank算法,从文件中读取数据生成矩阵进行计算-Pagerank algorithm to achieve
SPPP3.0
- 包含了信号处理的各种算法程序,如传统的傅里叶变换等时域和时频域分析。-Contains a variety of signal processing algorithms
InfoRetri
- 基于朴素贝叶斯的文本分类,包含去停用词,分词,特征提取,分类等-Text classification, based libsvm, included to stop words, segmentation, feature extraction and classification
document
- Analysis of NSL-KDD Dataset for Fuzzy Based Intrusion Detection System
IJETAE_0313_100
- Intrusion Detection System Using Data Mining Technique: Support Vector Machine
fp_growth
- 数据挖掘的FPgrowth算法,快速的获得频繁项集-FPgrowth data mining algorithms, fast access to frequent item sets
Apriori
- 数据挖掘Apriori算法,产生频繁项集-Apriori data mining algorithms to generate frequent itemsets
ID3
- 决策树算法的实现,id3算法,内含实验数据以及报告-an example of id3 algorithm
KNN
- K-最邻近分类器的一个实例,实现了对数据的分类,内含测试数据-an example of K-nearest algorithm,containing a set of test data
the-APRIORI-algorithm
- 实现Apriori算法,应用于如下的数据(给定参数),挖掘频繁项集 -an example for Apriori algorithm in data mining