搜索资源列表
DocCluster
- 实现的文档聚类的JAVA程序,适宜入门。-Implementation document clustering JAVA program, suitable for entry.
javaSwingweb
- javaSwing实现的web文档聚类方法研究,不同权值与精度,直接输入新闻网址,可以自动解析并聚类web文档-web document clustering method implemented by javaSwing different weights with precision directly enter a news website, you can automatically parse and clustering web documents
Kmeans
- 算法思想:提取文档的TF/IDF权重,然后用余弦定理计算两个多维向量的距离来计算两篇文档的相似度,用标准的k-means算法就可以实现文本聚类。源码为java实现(Algorithm idea: extract the TF/IDF weight of the document, then calculate the distance between two multidimensional vectors by cosine theorem, calculate the similarity