资源列表
kgramjac
- 计算两个字符串的k-gram的jaccard系数,是信息检索理论判断两个字符串相似度的应用。-To calculate the jaccard value of the two strings, in terms of the k_gram theory.
ICTCLAS50_Windows_32_C
- 中科院分析系统 ICTCLAS的主要功能有:中文分词;词性标注;命名实体识别;新闻识别;用户词典-ICTCLAS segementword
LuceneInActionSRC.tar
- 搜索引擎Lucene的一本书的源码,对于看那本书确实很有帮助-Lucene search engine, a source book for Look at this book really helpful
LuceneInAction_SourceCode
- lucene是用在搜索引擎的开源工具,可以对所抓爬到的网页进行索引写入,对做好的索引可以进行快速的搜索。-Lucene is used in the open-source search engine tool, which can grasp onto to the website indexing write, the index can do rapid searches.
1
- 自己动手写搜索引擎第三章代码,随书光盘中的内容,整个太大,只能分别上传-Chapter code search engine to write himself, with the contents of the CD-ROM, the whole is too big, we were only able to upload
sxt_Lucene.rar
- 尚学堂的一个很不错的搜索引擎开发案例,内有详细开发文档及源码.,The school is still a very good search engine development case, which detailed the development documentation and source code.
MTGV
- 1, completely solve the source Website thumbnail problems. 2, increase the webmaster often use super chain tool. 3, increase the custom page perfect use, and in the home layout tool column, help to optimize. 4, increase classified catalogue, label
News Search3.01
- 一款新闻搜索软件-new information search software
heritrix-1.6.0-src
- 非常优秀的搜索引擎 LInux下 java版本的 robot-excellent search engine LInux under java version of the robot
bolangjiaoyu
- 一款功能强大的教育门户网站源码,asp+access,很适合参考-A powerful educational portal source asp+access very suitable for reference
clucene-0.9.8
- clucene是lucene的C版本。这是一个建立索引、搜索的函数库。-clucene lucene is the C version. This is an established index, search the libraries.
PDFBox-0.6.7a
- 采用java编写的处理PDF文档的程序,可从PDF文档中抽取txt文本,可与lucene搜索引擎相结合。-adopting the java programs compiled to dispose the PDF document, taking out the txt text from the PDF document, and combining with the lucene searcher.