搜索资源列表
Word-Segmentation
- 分词程序,用于文本分词,可以进行分词,统计词频-Segmentation procedure, used for text segmentation
Complete-Training-of-TC
- 用贝叶斯模型实现文本分类,;里面包含分词,词频统计,去除停用词等模块,目前完成的是分类的训练阶段。-realize text categorization by using the NaiveBayes Model
Lucene
- Lucene中文词频统计,包括分词,统计,排序,运行高效,分词手段使用Lucene封装的类库,操作简便-Lucene Chinese word frequency statistics, including segmentation, statistics, sorting, efficient operation, word means using Lucene library package, easy to operate