搜索资源列表
Chinese-Word-Segment-And-POS-Tagger
- 实现了中文分词和词性标注程序。分词方法采用“三词正向最长匹配”。词性标注使用HMM方法,用Viterbi算法实现。“三词正向最长匹配”保持了“正向最长匹配算法”快速的特点,同时提高了分词的准确性。-Chinese word segmentation and implemented procedures for POS tagging. Segmentation Methods, " the longest three-match positive words." POS tag
ChineseSplit
- 一个基于VB.NET开发的中文分词及关键词提取系统,采用双向最大匹配、词频统计、快速排序等算法实现。-VB.NET developed based on Chinese word segmentation and Key Extraction System, the largest two-way matching, word frequency statistics, such as quick sort algorithm.
ChineseSegment
- VC6.0实现的中文自动分词,包括最大正向匹配和最大逆向匹配切分,可运行-VC6.0 achieve automatic Chinese word segmentation, including the largest positive match and the maximum matching segmentation reverse, can run
proWordSegment
- 正向最大匹配中文分词c++源程序,在visual studio 2008中调试通过。-Chinese are the largest sub-word match c++ source code, visual studio 2008 in debug through.
SearchEngine
- 搜索引擎模块是一个很主要且常见的功能。一个好的搜索引擎能给用户使用带来方便。在“站内”选项卡下的搜索框中输入欲查询的关键词,如“PHP 字符串,处理技术!”,单击“极速搜索”按钮,系统会首先过滤欲查询关键词的标点符号,然后对过滤标点符号的关键词进行分词操作,接下来在数据库中检索与之匹配的信息资源。同时实现搜索计时及查询结果汇总功能,并在结果集中高亮显示查询关键词,最后以分页形式输出匹配结果给用户。-Search engine module is one of the main and commo
77433632mmseg-v0.1
- 使用最大正向匹配算法,实现中文自动分词,对于初学者有很高的参考价值。-The biggest being the use of matching algorithm, the realization of Chinese word segmentation, for beginners, has high reference value.
KSeg4J.1.0
- 简体中文机械分词模块,实现正逆向最大匹配消除歧义,jar封装,可以导入后直接使用-a simplified Chinese segmentation Jar
Lucene
- Lucene的基于正向最大匹配分析器,用于中文分词,可将查准率提高-Lucene chinese analyzer based by FMM algorithm
segment
- 基于n元语法的分词。先训练后,再通过前后向最大匹配初步分词,在通过2元语法来消岐。-Based on the n-gram sub-word. The first training and then through to the maximum matching before and after the initial word, in through a 2-gram to eliminate qi.
java
- java最大正向匹配算法 帮助大家了解分词原理-java maximum matching algorithm to help you understand the positive word principle
Segmenter
- 正向匹配正向最大分词算法,仅仅是个很初级的算法-Being the largest positive word matching algorithms
ChineseWordSegmentation1
- 中文分词-前向最大匹配方法;VS2008环境下实现-Chinese word- the former to the maximum matching method VS2008 environment to achieve
MainFrm
- 是设计并实现一个汉语自动分词系统。在分析了自动分词面临的主要困难和难点的基础上,旨在降低分词难度和提高分词精度,设计并实现了一个基于正向最大匹配的汉语自动分词系统。-The design and implementation of a Chinese word segmentation system. In the analysis of word segmentation and difficulty of the major difficulties facing based on the
fencisuanfa
- 用正向最大匹配发实现句子的分词。是基于词典的分词算法。该算法的特点是速度快,准确率高。-Made to achieve a positive match with a maximum sentence segmentation. Dictionary-based segmentation algorithm. The algorithm is characterized by fast and accurately.
SF
- 中文分词在中文信息处理中是最最基础的,无论机器翻译亦或信息检索还是其他相关应用,如果涉及中文,都离不开中文分词,因此中文分词具有极高的地位。中文分词入门最简单应该是最大匹配法-Chinese word segmentation in Chinese information processing is the most basic, whether in machine translation Yihuo information retrieval, or other related applic
sse4j
- SSE4J(Spatial Search Engine for Java)是针对地理信息数据源构建的垂直搜索引擎应用接口,是基于Lucene+JTS Topology Suite开源库设计的框架。 规划的SSE4J包含: 1)SSE4J应用开发包 2)SSE4J Webservice应用接口 3)SSE4J工具集 规划的SSE4J包含的功能: 1) POI搜索(名称、地址、类型等关键字搜索;周边搜索) 2)道路搜索、区域搜索 3)地址匹配、
Web-Chinese
- 中文自动分词技术是中文 Web信息处理的基础。文中以最大匹配法(MM)为基础 ,充分 考虑上下文(MMC) ,在内存中采用二分法进行分词匹配 ,有效地提高了分词的准确率和时效。-Chinese automatic segmentation technology is Chinese Web information processing foundation. The maximal matching method (MM) as the foundation, fully Conside
Chinese_wordspliter
- 使用正向最大匹配法的中文分词器 分别读入待分词文本文件及字典文件后,将经处理后的结果输出至另一文件予以保存。所用待分词文件样例约有字符184万个,处理时间在11s左右。程序使用C++语言编写。-Using the forward maximum matching device of Chinese words segmentation, respectively, to be read into the text file and dictionary file, the result will
EasySeg
- 实现中文分词的最大正向匹配算法,提前输入好文本。-Achieve maximum positive Chinese word matching algorithms, a good text input in advance
WordSegment
- 用C++开发的分词系统 运用基于哈希的逆向最大匹配算法 基于词典-Word in C development system uses a hash-based reverse maximum matching algorithm is based on dictionary