资源列表
textFCM
- 应用FCM(模糊c均值聚类)算法到文本聚类 采用两种方法计算文本相似度 采用ShootSeg分词 采用sogou互联网词库简化特征值计算-err
Taobao1
- 多语言淘宝客,程序可以根据域名不同自动翻译成多国语言。例如http://en.test.com就是英文,http://ru.test.com就是俄罗斯文。-Multilingual Taobao off, the program can automatically translate the domain into many different languages. For example http://en.test.com is English, http://ru.test.com is
ISN
- 常见的中文内码一般有GB2312,GBK和台湾那边用的BIG5,有时候看一些台湾编程里的资料,都是乱码-the exchage of the ISN
CharCodeConversion
- 字符集的UTF8、ANSI、Unicode编码转换-Character set UTF8, ANSI, Unicode encoding conversion
stop_wordslk
- 这是一个中文停用词汇表,适合于做学术研究,软件开发-this is a Chinese stop words table which is suitable for studying research and so on
JDBC-
- JDBC讲解PPT,JDBC知识点概述,JDBC框架-JDBC explain the PPT, JDBC overview of knowledge points, JDBC framework
big5togb
- 一个big5转换gb的例子--A VC code transform BIG5 to GB
pinyin
- 将汉字转换为拼音全拼,用C语言编写,iphone开发可以使用。-Chinese characters into pinyin spelling, written in C, iphone development can use.
PLSA
- PLSA 的Java实现,可以用于图像处理,文本分类,文本聚类等-code of PLSA in JAVA
proWordSegment
- 正向最大匹配中文分词c++源程序,在visual studio 2008中调试通过。-Chinese are the largest sub-word match c++ source code, visual studio 2008 in debug through.
VIPS
- 基于视觉的web页面分割算法(vips)-VIPSa Vision-based Page Segmentation Algorithm
freetype-doc-2.3.9
- 免费的中文文字显示接口,能够利用其显示各种中文字体-Chinese free text display interface, to take advantage of their shows a variety of Chinese fonts