资源列表
WebCrawler
- 本源码简单易懂,便于JAVA初学者参考编程,适合研究搜索引擎-the source straightforward, easy reference beginners JAVA programming, for the study of search engine
crawl
- 本模块是我自己开发的网络爬虫工具的核心代码,希望对大家学习搜索引擎有帮助-This module is developed my own web crawler tools, the core code, we want to learn search engine help
cgoogle_src
- 一个象google一样的搜索引擎的源代码
seek
- lapc中对生成矩阵中存在的短环进行搜索,可以搜4,6,8,10环!
stop-words-list
- 在搜索中的无效词等,包括中文,英文两个文档。基本包含了见的所有无效词-Invalid words in the search, including the English and Chinese documents. See all basically contains invalid word
ssdfile.zip
- 简单的全路径全文搜索的程序
Searcher
- 一个搜索引擎的demo,可以进行简单的搜索, -a demo of the search engine can perform simple searches
BlueSearch
- 搜索数据取自百度网站,可实现站内搜索和互联网搜索,速度超快.-The data of searching comes from www.baidu.com. The software can search not only the site,but the internet.And the speed is quit high!
TikaTest
- 关于Tika组件的使用示例,自己平时测试时用的可以支持各种文件到String的转换-About Tika components using the sample, usually used when testing can suppo
src
- PageRank算法, 包括 standard PageRank 以及 simple PageRank-PageRank algorithm, including the standard PageRank and simple PageRank
guangduyouxiansuosou
- 此算法是广度优先搜索的算法实现,已通过测验。-This algorithm is a breadth-first search algorithm, has passed the test.
Spider
- search engine spider