搜索资源列表
NLP
- 自己写的NLP处理软件,带有去停用词,去空行,组合生成指定笛卡尔集合的功能。-Own the NLP processing software to write, with a go stop words, go blank line, generated by combining the functionality specified Cartesian collection.
TFIDF-master
- tf–idf, short for term frequency–inverse document frequency, is a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus.[1]:8 It is often used as a weighting factor in information retrieval an
toolkit_for_words_En
- 处理英文中的停词、同词干词,不改变文章结构。适用于文本分类、文本聚类、推荐预处理。-Processing of stop words in English, with the stem word, does not change the structure of the article. Suitable for text categorization, text clustering, recommend pretreatment.