搜索资源列表
中文停用词
- 中文停用词
stopwords.rar
- 中文词的停用词表,可以作为中文信息处理中停用词删除的索引词典使用。,Chinese Vocabulary words out, Chinese information processing can be used as stop words in the dictionary using the index to delete.
Index
- 方便检索匹配等应用的,中文后序最大匹配算法的实现,数据库中词表和停用词自行更改即可-Convenient retrieval matching, Chinese applications such as after the realization of maximal matching algorithm sequences, the database tables and stop words seen to change can
TextCategorizer
- 自己实现的中文分词器、贝叶斯文本分类器 附分词词典、中文停用词表 用于数据挖掘学习、交流 Visual Studio 2010 开发-Realize his Chinese word segmentation, Bayesian text classifier the attached word dictionary, the Chinese stop word table is used for data mining learning, exchange of the Visua
fencizf
- 基于java编程,采用最大匹配算法实现简单的中文分词,并过滤停用词-The maximum matching algorithm based on java programming, simple Chinese word segmentation and filtering stop words
stopwords-
- 中文停用词表,比较全面,有1208个,通用词就是的,是,呢,了这样的词-Chinese stop word table, more comprehensive, 1208, is a generic term, is that it, such a word
Engilsh-Chineas-StopWords
- 中文和英文的停用词词库,在信息检索方面能用到-this is the English and Chines Stop-words,you can use this in Information Searching program
DeleteStopWord
- 此源码组要用于中文文本预处理。源码首先进行文本分词,分词之后对文本中的停用词进行过滤。-text preprocessing
fenci
- 基于IKAnalyzer2012的中文分词java代码,可以去除停用词。-The Chinese word segmentation based IKAnalyzer2012 java code, you can remove stop words.
ExcludeStopWord
- 对一段中文文本经中文分词后,根据停用词表,去除文档中的停用词。-After a period of Chinese text by the Chinese word, according to the stop list, the removal of stop words in the document.
ReadFiles
- 对中文文本进行分词,去停用词以及计算tf-idf值-The Chinese text segmentation, excluding stop words and computing tf- idf values
ictclas
- 用java语言实现中文分词去停用词,中科院分词软件ICTCLAS-To achieve the Chinese word to stop word
FileDemo
- 对文件进行分词的例子.输出带词性的中文分词,已经去掉了停用词.-Examples of the file segmentation output of the Chinese word with POS, has been removed stop words.
ChineseChuLi
- 中文文本处理的python程序,包括分词、删除特殊字符、删除停用词、爬虫程序、PCA降维、Kmean聚类、可视化等(Python programs for Chinese text processing, including participle, deleting special characters, deleting disuse words, crawler programs, PCA dimensionality reduction, Kmean clustering, visuali
stopwords
- 中文停用词表(停用词是指在信息检索中,为节省存储空间和提高搜索效率,在处理自然语言数据(或文本)之前或之后会自动过滤掉某些字或词,这些字或词即被称为Stop Words(停用词)。这些停用词都是人工输入、非自动化生成的,生成后的停用词会形成一个停用词表。)(Chinese Stop Words List)
新建文件夹
- 文本处理,自然语言处理,包含中文和英文停用词(text processing,including chinese and english stopwords)
停用词表
- 在自然语言处理任务中常用的停用词表,可以去除中文停词(Frequently used stop lists in natural language processing tasks, Chinese stop words can be removed)
Python中文文本预处理
- 包括删除标点、分词、删除停用词、计算相似度、文本聚类等功能