搜索资源列表
stopword-list
- 在文本进行分类聚类之前,必须对文本进行预处理。预处理的第一步是分词,这中间需要去除停用词。这个文件就是停用词列表-Must preprocess the text before the text classification clustering. The first step in preprocessing is the word, the middle need to remove the stop words. This file is the stop word list
Text-classification
- 文本分类之词频统计 分词、词干提取、去停用词、计算词频,有界面-Text classification of word frequency statistics word stemmer, to stop words, calculate word frequency, interface
segmentation
- 对文本进行分词,使用停用词表去除停用词,标点等。(segmentation, and deleting stop words and punctuations.)