搜索资源列表
语料库
- 一份很重要的语料库,为你的分词程序是一个很好用的资料库文件-a very important corpus, as your segmentation procedure is a very good use of the database file
复件 语料库试验程序
- 处理语料库信息的小程序-Corpus information handling procedures small
分词语料库
- 文本分词\分类的语料库
chinese
- 中文信息处理基础 第一讲VC环境编程简介 第二讲文件处理 第三讲字符编码 第四讲字频统计 第五讲文本断句 第六讲语料库-Basic information first deal with English-speaking environment for programming VC brief introduction stresses the second file handle character encoding the third stresses t
SogouC.reduced.20061127
- 搜狗语料 关于文本分类语料库的问题搜狗实验室搜狗实验室(Sogou Labs)是搜狗搜索核心研发团队对外交流的窗口,期望通过这个平台,展现搜狗研发团队强大的研发-Sogou corpus corpus corpus on the issue of text categorization Sogou Sogou Lab Lab (Sogou Labs) is the core of R & D team Sogou search window for foreign exchanges,
chinese-text
- 文本分类语料库,经过编辑手工整理与分类的新闻语料与对应的分类信息。其分类体系包括几十个分类节点,网页规模约为十万篇文档-Text classification corpus, edited manually compiled and classification of news corpus and the corresponding classification information. Their classification system includes dozens of classi
yuliaoku
- 对语料库的一些文献和资料集合,有一定的参考价值-Some of the corpus of literature and information collection, has some reference value
fenci
- 自己下载一个语料库,根据程序,计算权重,然后对语料库进行分词-Download a corpus itself, according to the procedures for calculating the weights, and then carried out on sub-word corpus
reuters21578
- 这是一个英文的语料库,可以用于进行文本的分类与聚类。是文本分类领域共用的一个语料库。-This is a corpus of English, can be used for text classification and clustering. The field of text classification is a common corpus.
tc-corpus-train
- 语料库训练集 , 适用于文本分类中的训练-ts-corpus-training
11
- 关于语音识别中语料库的建立与整理,以及分析统计-Speech Recognition Corpus on the establishment and finishing, as well as the analysis of statistical
22
- 关于语音识别中语料库的建立与整理,以及分析统计-Speech Recognition Corpus on the establishment and finishing, as well as the analysis of statistical
3
- 关于语音识别中语料库的建立与整理,以及分析统计-Speech Recognition Corpus on the establishment and finishing, as well as the analysis of statistical
4
- 关于语音识别中语料库的建立与整理,以及分析统计-Speech Recognition Corpus on the establishment and finishing, as well as the analysis of statistical
corpus
- 语料库,蒙语同音同形词管理与维护工具。c++builder + access结合的产品。算法经典-corpus
Chinese-Names-Corpus-master
- 中国人名语料库,txt类型文件,分词练习(Personal Name Corpus)
语料库检索工具
- 一个语料库检索工具,可以对文本形式的英汉词典(包含常用英文词汇)进行检索,是开发大型语料库工具的原型(the assistance of statistical package and computer programs)
icwb2-data
- NLP中文语料库,backoff语料库,可以用来训练(A wiki (Listeni/ˈ wɪ ki/ WIK-ee) is a website that provides collaborative modification of its content and structure directly the web browser. In a typical wiki, text is written using a simplified markup language an
aclImdb_v1.tar
- 英文影评语料库,用于英文情感分析。包含训练集和测试集,均为标注数据。(English movie reviews corpus)
文本处理高级语料库
- 自然语言处理语料库代码,能够提供大量方向基础入门信息。