资源列表
google_hacker.pdf
- google黑客揭密,用google侵入系统-google hackers Unmasked, using google invasive system
ssdfile
- 简单的全路径全文搜索的程序 -Simple all path and text search a
wininet-spider
- 网络爬虫,完美演示了多线程和深度设置抓取网页数据。-crawl through internet to get web data. the win32 api supports applications that are pre-emptively multithreaded. this is a very useful and powerful feature of win32 in writing mfc internet spiders. the spider project is a
textcluster
- 文本聚类算法源码,包含tf.idf计算的实现,采用java语言编写-text cluster algorithm, including the computation of tf.idf ,written by Java
rsSearch
- 简单的全路径全文搜索的程序 -Simple all path and text search a
tspider
- TSpider is a application source code library that you can use in your own programs to scrape information from websites. If can be used to download whole websites, or just select information from specific pages. Source code is in Delphi-TSpider is a
Soukey
- Soukey的开源蜘蛛程序,全部源码开源,很好的界面操作,此为运行代码!如果觉得好,可以去官方下载源码-Soukey open source spider, all the source code open source, good interface operation, this is to run the code! If you feel good, you can download the source code to the official
ContentAnalyzer
- 搜索引擎正文提取程序,通过html分析和正则,去掉html代码,保留网页正文,只针对中文有效。英文稍加修改即可使用。-The body of the search engine extraction process, through analysis and regular html remove html code to retain the page text, only effective against the Chinese. Slightly modified to use Engl
speder
- 综合搜索引擎简单实现,功能全面,代码短小实用。-Simple implementation of comprehensive search engine, comprehensive, practical short codes.
JSearchEngine
- lucene search engine with pagerank
GOOGLE
- 详细介绍了Google搜索引擎的使用方法和发展-Details of the Google search engine use and development
selectjava
- 搜索引擎,基于java编写,简单示例,实话初学者学习使用-Search engine, written in java based on a simple example, truth for beginners to learn to use