搜索资源列表
hadoop-0.7.1.tar
- hadoop:Nutch集群平台,分布式编程模式,让Nutch可以自动在普通机器组成的集群中以并行方式分布执行-hadoop : Cluster Nutch software platform, distributed programming model, Let Nutch software can be automatically composed of general machinery cluster parallel to the implementation of distribut
je-analysis-1.5.3
- 在java环境下开发的分词源代码,本代码可以通过lucene,nutch调用,实现对中文的分词-Java development environment in the sub-etymology code, this code can be used with lucene, nutch call, the aim is to achieve the Chinese word
Lucene+Nutch
- Lucene+nuctch一书的全部源码 测试源码 和几个简单的项目-Lucene+ Nuctch a book all the source code and test a few simple items
Lucenechapter11
- nutch的小应用 ,看看应该对学习检索系统原理很有帮助-nutch small applications, take a look at should be very helpful to study the principle of retrieval system
nutchjar
- 搜索引擎nutch源码在eclipse中运行时所缺的俩个包,引进即可使用。-Nutch search engine in the eclipse source code at run-time is a lack of both a package, you can use to introduce.
search
- lucene应用实例程序,包含了建立索引到web搜索的完整代码,里面用到的数据库是dedecms的,可以自己去下载,config.xml为配置文件,需要配置索引目录和链接数据的用户密码。该代码实例可以直接作为你用lucene建立全文搜索的参考-lucene Applications programs, including the establishment of an index to the web search the complete code, which used the databa
shy
- 基于Lucene和nutch的搜索引擎,能实现普通搜索引擎的功能-search engine based on Lucene and nutch
NUTCHseconddeveopment
- nutch 二次开发,对于开发搜索引擎的朋友肯定有用!-Nutch second development, to develop search engine friend certainly useful!
Nutch-Web
- 在对目前具有代表性的开源网络抓取软件Nutch、Heritrix、WCT、Web-Harvest进行比较分析的基础上,提出基于Nutch的Web网站定向采集系统,并对种子站点的选取、抓取过程管理、网页去噪、新种子站点的发现等关 键问题进行重点探讨。 -The paperanalyzes typicalopen sourceWeb crawl software, such asNutch, Heritrix, WCT, andWeb-Har- vest. Following the a
NUTCH_RM
- nutch详解及入门 有关nutch环境搭建及测试等,还有对nutch系统架构比较详细的介绍等-nutch introduction
MyWordSpliter1
- java实现的分词程序,Nutch中文分词-java implementation of segmentation procedures
NutchAnalysis
- Nutch中,解决韩语无法解析的问题。文件为.jj文件,需要用JAVACC解析。相信用过NUTCH的人都知道,生成5个文件替换后,重新抓取,然后ant一下,打包新的nutch-1.0.jar,替换到tomcat下就行了。OK-Nutch, solve the problem cannot resolve in Korean. Documents. Jj files, need to use JAVACC analytical. Believe that used NUTCH knows that
Detailed-Nutch-command
- Nutch的命令详解,系统介绍nutch的各种命令,包括爬取,查询,索引等。-Detailed Nutch command, the system introduced nutch various commands, including crawling, query, index and so on.
apache-nutch-2.1-src
- nutch2.1源代码 ,分布式搜索引擎应用代码-nutch source code , search engine s application code
ddh_v1.0
- DDH垂直搜索引擎商业版,是目前互联网中唯一可以商业运作的垂直搜索引擎系统,由JAVA语言开发,可以运行在大规模集群中的网络信息整合系统。DDH整合Nutch(开源搜索引擎系统),UCI(网页信息抽取系统)和SOLR(企业级搜索应用服务器)。无论从可扩展性,系统的性能方面还是稳定性方面,DDH垂直搜索引擎系统,都可以算的上顶级垂直搜索引擎系统之一。-DDH vertical search engine business edition, is currently the only commerc
Lucene+nutch搜索引擎开发
- lucene search code develop search engine