搜索资源列表
libcharguess-src-1.0b.tar
- 判断一串字符是属于什么字符集的程序,如判断是否属于utf-8,gb2312
segment
- segment,一个简单的中文分词程序,命令行如下: java -jar segmenter.jar [-b|-g|-8|-s|-t] inputfile.txt -b Big5, -g GB2312, -8 UTF-8, -s simp. chars, -t trad. chars Segmented text will be saved to inputfile.txt.seg
GB2312_TO_UTF8
- 将gb2312编码的字符转换为UTF-8编码的字符。-convert gb2312 char to utf-8 char