搜索资源列表
textsegment
- 中文分词程序,用java写的,有gui界面-Chinese word segmentation procedures using java write a gui interface
fenci
- 支持java的中文分词程序-support the Chinese word segmentation procedures
IKAnalyzer3.2.8-bin
- IKAnalyzer是一个开源的,基于java语言开发的轻量级的中文分词工具包。从2006年12月推出1.0版开始,IKAnalyzer已经推出 了3个大版本。最初,它是以开源项目Luence为应用主体的,结合词典分词和文法分析算法的中文分词组件。新版本的IKAnalyzer3.0则发展为 面向Java的公用分词组件,独立于Lucene项目,同时提供了对Lucene的默认优化实现。 -IKAnalyzer is an open source, java based development o
wordsegment
- 中文分词系统,有IKAnalyzer和MMAnalyzer两种分词方式可供选择,有界面展示,可是清楚的比较两种的特点,各有千秋-Chinese word segmentation system, there are two kinds of segmentation MMAnalyzer IKAnalyzer and methods are available, there are interface shows, but a clear comparison of two characteri
je-analysis-1.5.3.jar
- 搜索引擎开中中文分词包JE分词器 开发者必备哦-Search engine to open in Chinese word segmentation package JE Oh device developers must
lingpipe-3.6.0
- 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of Speech Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character
LTP
- 哈工大LTP自然语言处理工具的java调用实例,利用jni调用dll,实现中文的分词,词性标注,建立依存树等-HIT LTP natural language processing tool called an instance of java using jni call the dll, to achieve in Chinese word segmentation, POS tagging, the establishment of dependency trees, etc.
phpanalysis
- 以前写的一个php无组件分词算法,这类算法比较少见,可用于SEO,搜索前端分词等用途-I used to write a php no component segmentation algorithm, such algorithms are relatively rare, can be used for SEO, search front-end applications such as word segmentation
windows_JNI_32bit
- ICT分词程序接口 用以进行中文文本分词,词性标注。-ICT segmentation program interface for the conduct of the Chinese text word segmentation, POS tagging.
PERL
- perl采用正向匹配算法,使用词库构建哈希结构,匹配分词-perl matching algorithm being used, use the thesaurus to build hash structure, matching word segmentation
ChineseWordsDemo
- LingPipe(开源自然语言处理的Java开源工具包) 中文分词java程序-LingPipe (open source natural language processing toolkit in Java open source) Chinese word segmentation procedure java
heritrix-1.14.3-src
- 高性能分词算法,采用java实现,能自动进行最小分词,用户可以筛选分词类别-Word segmentation algorithm for high-performance, the realization of the use of java, can automatically carry out the smallest sub-word, the user can filter category segmentation
qygl
- lucene 中文分词公用组件,对搜索引擎开发中中文分词做了很好的封装。-Chinese word segmentation lucene common components, the development of the Chinese search engine has done a very good segmentation of the package.
ictclas4j.doc
- ictcasj 中文分词技术 有详细的说明-ictcasj Chinese word segmentation techniques
IRSplit_new
- 用java实现的中文分词,是在哈工大IRSplit的基础上做的-With the java implementation in Chinese word segmentation, is based on the HIT IRSplit done
lucene-2.9.1
- Lucene 应该是最新版本的,主要是分词功能和检索功能特别强大,如果要达到中文分词的功能,需要Paoding与其结合才更好-Should be the latest version of Lucene, mainly sub-word features and retrieval is particularly strong, if you want to achieve the Chinese word segmentation functionality, combined with th
chinese_segment
- 一个中文分词算法的java语言实现,词典采用文本文件形式。-A Chinese word segmentation algorithm java language implementation
fenci
- 中文分词代码,利用Java写的关于搜索的中文分词-Chinese word segmentation code, the use of Java to write about the Chinese word search
php_programming_smallest_compound_word_segmentatio
- php编程最小切分的复合分词算法代码php programming smallest compound word segmentation algorithm code-php programming smallest compound word segmentation algorithm code
word
- java 实现简单的分词算法,自动匹配,代码注释详细。-Java to realize automatic word segmentation algorithm is simple, match, code notes in detail.