搜索资源列表
Lucene2.0Heritrix
- 是对网络爬虫Heritrix的介绍 ,Heritrix是一个由java开发的 开源的web网络爬虫 -Is an introduction to Heritrix Web crawler, Heritrix is an open-source web development java web crawler
AMR
- 讲述概念格算法和本体算法,用于过滤URLs,指导爬虫进行搜索。-A concept lattice algorithm and ontology algorithm are used to filter the URLs and guide the crawler to search.
fwdthesisreport
- Migrating Parallel Web Crawler used for information retrival a review and complete thesis work-Migrating Parallel Web Crawler used for information retrival a review and complete thesis work