搜索资源列表
LSA.zip
- LSA中包含几篇有关文本挖掘的资料,介绍了lsi和svd算法,LSA contains several text mining of the information presented and svd algorithm lsi
lingpipe-3.6.0
- 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of Speech Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character
simpack
- simple TF-IDF Algorithm for text mining
libsvm-2.9
- 文本分类工具libsvm-2.9.zip 信息检索和数据挖掘的中用到的工具包, 里面有C++、JAVA、Python等多个语言版本-Libsvm-2.9.zip text classification tool for information retrieval and data mining tools used in the package, inside C++, JAVA, Python and other languages
MinCutSeg
- 一组很直观的有关于文本算法的java代码。对Text Mining 的研究能起到一定的帮助。-A very intuitive method of java on the text of the code. Research on Text Mining can play some help.
text_data_mining
- java编写的数据挖掘方面的代码,里面包含有文本分类,作者身份识别方面的java源码,本人亲自参与编写-java code about data mining;include:text cluster ,authorship identification,
1111
- 文本挖掘-中文分类器搜索,可以挖掘出文本主干,利用贝叶斯算法。-Text mining
tpTextMining
- tp text mining in java eclipse
Indexing
- 典型的文本挖掘案例,用于java程序开发平台的插件,dragontool-A typical route for the development of text retrieval and mining applications is illustrated in Figure 1. First of all, it is required to prepare a collection of machine-readable documents.
java_stopword_code
- Java 数据挖掘进行文本预处理的一步,除去文本中包含的停用词-Java Data Mining for text preprocessing step to remove the text contains the word disabled
ictclas4j
- 中科院中文分词系统完成的java源码,能很好的实现中文的分词,为文本挖掘提供基础。-Chinese Academy of Sciences Chinese word segmentation system to complete the java source code, can achieve good word of Chinese, provide a basis for text mining.
kmeans
- k-means算法是文本聚类经典算法,也是数据挖掘十大经典算法之一。k-means算法Java实现。-k-means algorithm is a classical algorithm text clustering, data mining is one of the ten classic algorithms. k-means algorithm is implemented in Java.
cs224n-pa1-master
- java text mining ibm model
rseslib-3.0.4-src
- 包含很多知名算法实现,支持向量机,决策树,粗糙集,贝叶斯分类器等,适合学术研究,短评论意见挖掘,文本分类等(It includes many well-known algorithm implementation, support vector machine, decision tree, rough set, Bias classifier, etc., which is suitable for academic research, short comment mining, text c