资源列表
Dictory
- 用B-树作为查找存储结构,对中文单词进行哈希,用最长逆向匹配算法进行中文句段分词,实现中文词典以及分词。中文词典规模为十万八千多词汇。分词的正确率在90 以上。-Use the B-tree as storage structure , and hash the Chinese word while storing or searching. Use the longest reverse matching algorithm to split Chinese sentence to word
wenben
- 文本分类文本的选择似乎是根据内容来的, 而非一般所采用的语体分类-Text classification is based on the text of the choice seems to be content, rather than the general language used in Classification
windows_c_32(1)
- 中国科学院的分词软件,正确率98 以上,目前最好的分词工具-Chinese Academy of Sciences word software, over 98 accuracy rate is currently the best tool for sub-word
untitled
- 十字按实现滤波器 数字信号处理 实现滤波器 数字信号处理-MATLAB MATLAB MATLAB MATLAB MATLAB MATLAB
DIS
- ucdos矢量字库标准读程序,只是差填充函数-ucdos vector font standard reading program, but poor filled function
Multilizer_2007_Enterprise_v7.1.8.774
- Multilizer 2007 Enterprise - a powerful program designed for transfer programs. Understand the immense number of formats, can distinguish between the type of programming language used in the program. It is possible to edit files for different OS plat
huanhang
- 主要实现对句子进行分词。也就是说,实现一个词语一行。-The main achievement of the sentence to word. In other words, to achieve a word line.
IKAnalyzer3.1.1_userguide
- java分词程序,能够精确分词,包含词库等-java word program, word accurately, including the thesaurus, etc.
VTVectorSubtract
- 字符串列表相减器 工作中常遇到重复处理事务,比如求职过程中,经过一段时间后以前考虑过的职位又被重新考虑造成时间浪费。该工具帮助挑选出重复内容、不同内容,应用在多种场合,帮助减少重复工作: 给出A,B列表,计算A-B, (A-(A-B)), (B-(A-B))。-Strings list subtraction program. Frequently encountered in the work of double handling of affairs, such as job s
happy
- 正是一个程序,点开以后会出现一片雪花,祝大家元旦快乐,-It is a program will appear one after opening the snow, I wish you all a happy New Year s Day, He He
ngramtool
- This is a small program to extract and display user-definable ngrams in a collection of English texts.
GB2312ToUnicode
- GBK 转 unicode 提供二分法查询-translate the gbk to the unicode,with the bianary search way