搜索资源列表
SurveyTextMining
- 这是一本关于文本挖掘的书籍,包括聚类 分类 信息提取的内容
lingpipe-3.6.0
- 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of Speech Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character
JiaoChaShang
- 文本挖掘中交叉熵算法实现,通过词汇左右出现的概率来计算交叉熵-Text mining cross entropy algorithm,The task of part2of2speech iden t if icat ion is to au tomat ically assign a part2of2speech tag to an unknow n wo rd w ith emp ty part2of2speech info r2 mat ion. A part2of2speech
1111
- 文本挖掘-中文分类器搜索,可以挖掘出文本主干,利用贝叶斯算法。-Text mining
TextMining-Tools
- 北大杨建武文本挖掘课件第15章,详细介绍了文本挖掘的工具和流程,可以在一天之内掌握文本挖掘的来龙去脉!-North Jian-Wu Yang text mining courseware Chapter 15, details the text mining tools and processes, can one day master the ins and outs of text mining!
Text-mining
- 10几篇文本挖掘方面的论文 例如 web内容挖掘综述 web内容挖掘技术研究.-Text mining,data mining,web mining.10 several text mining papers such as the web content mining Summary of Web Content Mining.
ARFFInputformat
- hadoop下自定义的读文件格式类,对于数据挖掘分类算法的训练测试文本的特殊格式有很大帮助.-hadoop read the file format class custom of great help for training in the special format of the test text data mining classification algorithms.
util
- 很多文本处理有用的工具,NLP,数据挖掘都能用到-A lot of useful text processing tools, NLP, data mining can be used
Indexing
- 典型的文本挖掘案例,用于java程序开发平台的插件,dragontool-A typical route for the development of text retrieval and mining applications is illustrated in Figure 1. First of all, it is required to prepare a collection of machine-readable documents.
NaiveBayes
- 基于朴素贝叶斯算法实现的文本分类程序,对数据挖掘的初学者具有很好的学习参考价值。-Based on Bayesian text classification algorithm procedures, data mining beginners a good learning reference value.
ictclas4j
- 中科院中文分词系统完成的java源码,能很好的实现中文的分词,为文本挖掘提供基础。-Chinese Academy of Sciences Chinese word segmentation system to complete the java source code, can achieve good word of Chinese, provide a basis for text mining.
TFIDF
- 很好用的程序,进行中文特征的提取!相信能帮到大家对文本特征的提取,数据的挖掘-It s very useful!
Datamining
- 数据挖掘特征选择,选择文本中的特征词语来分析文本-Data mining feature selection
MyApriori
- 使用Apriori算法进行关联规则挖掘,数据文本格式参阅例子文件夹-Association rule mining with Apriori
rseslib-3.0.4-src
- 包含很多知名算法实现,支持向量机,决策树,粗糙集,贝叶斯分类器等,适合学术研究,短评论意见挖掘,文本分类等(It includes many well-known algorithm implementation, support vector machine, decision tree, rough set, Bias classifier, etc., which is suitable for academic research, short comment mining, text c