搜索资源列表
distinguishword
- 文字识别,输入一段文字可以统计和识别段落和字母。-character recognition, text can be imported for some statistics and identification paragraphs and letters.
FlexCRFs-0.3
- Hieu Xuan Phan & Minh Le Nguyen 利用CRF统计模型写的可用于英文命名实体识别、英文分词的工具(开放源码)。CRF模型最早由Lafferty提出,全名conditional random fields,该模型后来被广泛地应用在语言和图像处理领域,并随之出现了很多的变体。FlexCRF就是对CRF模型的一个实现应用工具,可用于文本信息处理
personNER
- 基于CRF(conditional random fields)统计模型的文本人名识别工具源代码,是Mallet开放源码项目的一部分-based on CRF (conditional random fields) statistical model of text my name recognition tools source code, open source Mallet is part of the project
HMM
- 基于统计的分词,采用隐马尔可夫模型,并有实验报告-Based on statistics segmentation using hidden Markov models, and there is experimental report
envec.py
- 识别中文,对中文词进行统计,打印出每个中文词的数目。-Identify Chinese, the Chinese word for statistics, print out the number of each Chinese word.
hanzitongji
- 汉字字频统计的小程序,使用前需要修改源码中的读取路径,另外txt必须Unicode编码才能识别。-Chinese characters word frequency statistics applets, you need to modify the source code (the read path) before using. Additionally, the Unicode encoding txt must be identified.