资源列表
SV
- IBM Model 1 Expectation Algorithm which takes two pieces of texts in different languages, and outputs the text alignment in a table, as well as the Viterbi alignment
82c55adatasheet
- 82c55芯片资料的详细功能、包含内容说明,和用法-82c55 chip of detail features, including the content descr iption, and usage, etc.
HanziStatics.rar
- 汉字统计程序
svmlight-6.01
- svm(支持向量机)是著名的分类算法,svmlight是其中的一种实现的最新版本。完全开源。
maxent
- 运用最大熵对一个文本中的类进行训练模型,然后可用模型进行预测,结果返回类名,是机器学习语言的重要部分,支持汉字分类-Use of maximum entropy of a text in class training model, the model can then be used to predict the results returned class name is an important part of machine learning languages, support for
CRFPP-0.53-
- CRF++-0.53,条件随机场命名实体识别,0.53版本,顺利通过测试运行--0.53 CRF, conditional random field named entity recognition, 0.53 version, successfully passed the test run
keyword-chouqu
- 基于逆向最大匹配算法的分词及基于HMM模型的词性标注系统,包括了未登录词的识别、数据库的添加等内容。(需要手动修改数据库的路径才可以运行)-Reverse Maximum Matching Algorithm Based on the sub-word HMM-based model and part of speech tagging system, including the unknown word identification, such as the contents of the d
SogouW.tar
- sougou在2006年统计的互联网词库,据说统计量有一亿网页。-sougou
At_functions
- Document contents AT-Functions for Siemens tc65 GSM terminal
PDFlib7UM
- PDFlib7中文用户手册,调用PDFlib库函数直接输出中文PDF文件
WordSeg
- 这是一个中文分词程序。用户将中文文件(.txt)打开,点分词后可看到分词结果。开源。
Windows-7-Loader
- dosya iste napı can yeter da