资源列表
ICTCLAS50_Windows_32_C
- ICTCLAS_5.0中文分词库,有中科院开发。目前最好的中文分词系统,此为最新版。 -ICTCLAS 5.0 for Windows(32 bits)
GBK2EBCDIC
- 中文GBK汉字与EBCDIC内码的对照表,还有unicode的码表对照。以GBK顺序排列。本人原创。互联网上应该目前是最完整,最全。最正确的资料。-chinese GBK code list and EBCDIC GBK code list,order by GBK code
SnakeYAML-all-1.9
- Java SnakeYAML: parser emitter for java
89346469Chinesemiraclelanguage
- rom修改工具,开始的了附件阿哥见阿福开始发见佛ie 发的时间哦附件奥飞爱神的箭-shoidaskegnlkageoigjvioa
CRFTagger-1.0.tar
- 一个利用条件随机场(CRF)开发的词性标注工具包
head_first_programming
- Easy to Learn Document for Applications Development. Python Programming with visual help.
syntax3.0
- 无监督的句法学习系统,用java开发的,采用简单的规则实现的句法分析器-Syntactic unsupervised learning system developed with java, using simple rules of implementation Parser
ok
- web2project加入中文语言, 亲测可用,-web2project with chinese simple language
ictclaszyfc分词接口
- 中文分词接口
abner
- 一个命名实体识别工具,是Mallet开放源码项目的一部分,可用于识别文本中的人名、地名等信息-a named entity recognition tools, Mallet OSS part of the project, Text can be used to identify the names, places and other information
xwrapelite.rar
- html页面在线抽取器的源代码,java编写,可实现在线自动抽取实体,Extractor online html page' s source code, java development, can be automatically extracted entities online
tse_segment
- tse_segment分词程序,它对中文分词进行了初步的尝试-tse_segment segmentation procedure, which the Chinese word for a preliminary attempt