搜索资源列表
Index
- 方便检索匹配等应用的,中文后序最大匹配算法的实现,数据库中词表和停用词自行更改即可-Convenient retrieval matching, Chinese applications such as after the realization of maximal matching algorithm sequences, the database tables and stop words seen to change can
DeleteStopWord
- 此源码组要用于中文文本预处理。源码首先进行文本分词,分词之后对文本中的停用词进行过滤。-text preprocessing
fenci
- 基于IKAnalyzer2012的中文分词java代码,可以去除停用词。-The Chinese word segmentation based IKAnalyzer2012 java code, you can remove stop words.
ExcludeStopWord
- 对一段中文文本经中文分词后,根据停用词表,去除文档中的停用词。-After a period of Chinese text by the Chinese word, according to the stop list, the removal of stop words in the document.
ReadFiles
- 对中文文本进行分词,去停用词以及计算tf-idf值-The Chinese text segmentation, excluding stop words and computing tf- idf values
ictclas
- 用java语言实现中文分词去停用词,中科院分词软件ICTCLAS-To achieve the Chinese word to stop word
FileDemo
- 对文件进行分词的例子.输出带词性的中文分词,已经去掉了停用词.-Examples of the file segmentation output of the Chinese word with POS, has been removed stop words.