搜索资源列表
Nutch
- Apache-Nutch1.3 学习笔记,很完整的学习笔记,内容很全-Apache-Nutch1.3 study notes, very complete study notes, is the whole content
Hadoop-based-distributed-crawler
- 本文讨论了搜索引擎的基本技术和网络爬虫的基本原理,并对分布式爬虫的技术原型Nutch进行了剖析。 -This article discusses the basic principles and basic techniques of search engine web crawlers, and distributed Nutch crawler technology prototypes were analyzed.
Nutch-Teach
- Nutch搜索引擎架构的学习教程,有需要做爬虫的同学们可以学习下他的理念。-Nutch search engine architecture, tutorials, there is a need to do reptiles students can learn at his ideas.