搜索资源列表
ProbWordSeg1
- 基于最大概率的分词,首先读入.mdb数据库(字典与其统计词频),然后读入你要分词的.txt-based on the maximum probability of the word, first read into. Mdb database (with dictionary word frequency statistics). Then you should read into the word. txt
wordpos
- 给定带有分词和词性标注信息语料,从中总结单词的词频,并按照出现次数排序输出-given with sub-term and part-of-speech tagging information corpus, it is concluded that the words and phrases, and in accordance with the order of the output frequency
seg_delphi
- delphi版的基于词频字典的分词算法,其中dict目录下为词频字典。对于未注册词也有一定的识别能力。
ictclas4j_0[1].9.1
- 基于java语言的分词系统,可以标注词性、词频等信息,可用于二次开发
MFC查词典、分词、词频统计程序
- MFC编程,功能是查词典(用户可自己导入文本),分词,统计词频,还可以保存结果!我们MFC课的期末作业,强烈推荐!-MFC programming function is to check dictionary (users can import their own version), participle, statistical, frequency, the results can be saved! We MFC class at the end operations, strongly
MFC编程,功能是查词典(用户可自己导入文本),分词,统计词频
- AppWizard has created this RMM application for you. This application not only demonstrates the basics of using the Microsoft Foundation classes but is also a starting point for writing your application. This file contains a summary of what you
TF/IDF 算法
- 统计词频,和对文档进行分词处理,计算tf-idf值,JAVA实现
ChineseSplit
- 一个基于VB.NET开发的中文分词及关键词提取系统,采用双向最大匹配、词频统计、快速排序等算法实现。-VB.NET developed based on Chinese word segmentation and Key Extraction System, the largest two-way matching, word frequency statistics, such as quick sort algorithm.
zhongwenfenci
- 使用字典的中文分词程序 输入: Dic.txt: 词典文件 Target.txt: 目标文件 输出: Result_back.txt:后向分词的结果 Result_front.txt:前向分词的结果 Result.txt: 双向扫描,对词频统计分析得到的结果 -The Chinese word dictionary program input: Dic.txt: dictionary file Target.txt: Target file output
2
- c#中文分词源码,基于词频,词性等,可提取自定义数量的关键词-Chinese word c# source code, based on word frequency, part of speech, can customize the number of keywords extracted
ictclaszyfc-v2009
- 中科院分词系统,包含添加词汇、统计词频等。-Chinese Academy of Sciences segmentation system, including adding vocabulary, word frequency and other statistics.
Src____wordcloud_src
- 基于词频的国外分词代码,这个可以和国内的一些代码做比较。-Frequency-based segmentation of the foreign code, the number of domestic can be compared to the code.
051_276_project3
- 根据分词程序结果用倒排文件对词频进行统计,并输出索引-Segmentation results of the proceedings in accordance with the inverted file used to compile the statistics on the frequency and the output index
WindowsApplication1
- 处理的对象是:完成分词和词性标注的语料,实现的结果是:统计出现词频完成降序排列。-Dealing with the object are: the completion of word segmentation and POS tagging of the corpus, the results achieved are: the completion of word frequency statistics appear in descending order.
1
- 最大概率分词法,词频词典用的是北语版的也有可能是词典的原因-Maximum probability sub-lexical, word frequency dictionary used in the North language version of the dictionary is also possible that the reasons for
ngrams
- 自然语言处理相关程序,有关分词的和词频统计-Natural language processing procedures, the statistical segmentation and word frequency
Word-Segmentation
- 分词程序,用于文本分词,可以进行分词,统计词频-Segmentation procedure, used for text segmentation
Complete-Training-of-TC
- 用贝叶斯模型实现文本分类,;里面包含分词,词频统计,去除停用词等模块,目前完成的是分类的训练阶段。-realize text categorization by using the NaiveBayes Model
课程设计作业
- 用分词包进行分词,并通过分词统计每个词频出现次数(use to seperate an article, and use the dictionary to find the frequency of each word)
wordseg
- 运用R语言进行中文分词处理,得到词频统计,并绘制词云图直观表示(Chinese word segmentation and word cloud drawing)