搜索资源列表
supervisedWSD
- 利用贝叶斯分类原理实现多义词的消歧。首先利用训练语料进行训练,然后基于机器已经获取的知识的基础上对生语料进行词义标注。
SIFT
- SIFT程序,用于图像识别,对于各种变换有很强的鲁棒性,是图像标注方面的精典算法-SIFT program for image recognition, for a variety of transformation has a strong robustness, is the classical algorithm for image annotation terms
HmmPos
- 基于HMM的中文词性标注代码,内有详细注释,并附有练习样本-Chinese POS tagging HMM-based code, with detailed notes, along with sample exercises
fortran
- 物性计算的核心程序,可用于各种程序的调用,由美国标注协会编写-The core of the calculation of the program, can be used for a variety of procedures, by the American Association for labeling
HanLP-1.2.7
- HanLP是一个致力于向生产环境普及NLP技术的开源Java工具包,支持中文分词(N-最短路分词、CRF分词、索引分词、用户自定义词典、词性标注),命名实体识别(中国人名、音译人名、日本人名、地名、实体机构名识别),关键词提取,自动摘要,短语提取,拼音转换,简繁转换,文本推荐,依存句法分析(MaxEnt依存句法分析、神经网络依存句法分析)。-HanLP is a dedicated to popularize NLP technology to production environment of
download_tweets
- 能够进行词性标注、词典匹配、否定词匹配,能够进行CRF之前的模型准备工作(Can do part of speech tagging, dictionary matching, negative word matching)