搜索资源列表
JAVA实现文本聚类,用到TF/IDF权重
- JAVA实现文本聚类,用到TF/IDF权重,用余弦夹角计算文本相似度,用k-means进行数据聚类等数学和统计 知识。,JAVA realization of text clustering, using TF/IDF weight, calculated using cosine angle between the text of similarity, using k-means clustering for data such as mathematical and statistical
java-cluster.zip
- 用java语言实现文本聚类,包括聚类前的数据预处理:分词、降维、建立向量空间模型等,Implementation using java language text clustering, including clustering of the data pre-processing before: segmentation, dimensionality reduction, set up, such as Vector Space Model
LJClusterDemo
- 文本聚类是基于相似性算法的自动聚类技术,自动对大量无类别的文档进行归类,把内容相近的文档归为一类,并自动为该类生成特征主题词。适用于自动生成热点舆论专题、重大新闻事件追踪、情报的可视化分析等诸多应用。 灵玖Lingjoin(www.lingjoin.com)基于核心特征发现技术,突破了传统聚类方法空间消耗大,处理时间长的瓶颈;不仅聚类速度快,而且准确率高,内存消耗小,特别适合于超大规模的语料聚类和短文本的语料聚类。 灵玖文档聚类组件的主要特色在于: 1、速度快:可以处理海量规模
textcluster
- 文本聚类算法源码,包含tf.idf计算的实现,采用java语言编写-text cluster algorithm, including the computation of tf.idf ,written by Java
kmeans
- kmeans文本聚类算法的简单实现,是用JAVA实现的-kmeans a simple realization of the text clustering algorithm is implemented using JAVA
MyTest
- 利用wvtool实现文本分类的实例程序,自己写的,功能简单-Text Classification using wvtool instances of programs to achieve their own writing, feature a simple
TDIDF_Demo
- 基于k-means的文本聚类程序实现,希望对大家有帮助!-Based on the k-means clustering procedures for the realization of the text, I hope all of you help!
lingpipe-3.6.0
- 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of Speech Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character
text-data-mining
- 此程序实现了如何在TXT或WORD文档中进行数据挖掘,在文本中提取有用信息-The realization of this procedure how to TXT or WORD document to carry out data mining, in the text to extract useful information
extract_document
- 这是一个提取 Reuter-21578 的程序, 用做自然语言处理, 文本分类聚类,和信息检索的测试集!-This is an extract of the Reuter-21578 procedure, used for natural language processing, text classification clustering, and information retrieval test collection!
Clustering
- 用于文本的聚类,里面包含两个java文件,需放在同一个包内-For text clustering, which contains two java files, need to be placed within the same package
TextClustering
- 文本聚类,VC编程实现,作为一种无监督的机器学习方法,聚类由于不需要训练过程,以及不需要预先对文档手工标注类别,因此具有一定的灵活性和较高的自动化处理能力-Text Clustering, VC programming, as an unsupervised machine learning method, clustering by eliminating the need for the training process, and do not need to manually pre-ma
som
- based text clustering algorithm, has a detailed analysis and examples
program
- 基于无监督学习的谱聚类算法的文本的聚类分类。-Unsupervised Learning Based on spectral clustering algorithm for text clustering classification.
src_2
- an another k means clustering fot clustering the text data
MyCluster
- 聚类算法中的k均值算法,里面已经包含一个文本聚类的实验。-Clustering algorithm k means algorithm, which already contains a text clustering experiments.
matlab
- 这是一些文本聚类的matlab函数,是遗传算法过程中的一些常用函数-This is some text clustering matlab function, genetic algorithm is the process of some of the commonly used functions
text-clustering
- 文本聚类及主题挖掘相关论文合集,包括了kmeans,层次聚类,ap聚类等等相关方法-Text clustering and topic mining related collection of papers
frequent-term-based-text-clustering
- 一篇很好的基于主题的聚类方法论文,可以用在文本分类等众多领域-frequent term-based text clustering
Large-scale-text-clustering-master
- java 实现文本聚类 java 实现文本聚类 -the code of text clustering the code of text clustering the code of text clustering