资源列表
ModernInformationRetrieval
- Google写的,信息检索方面的文章非常好-verygood,perfect,and you love it
K-Means
- 一个很好的C均值聚类算法!通过运行此文件可以很好的进行数据的分类。-K-means
hibase-0.1.0.tar
- 一个使用的搜索引擎例子,可以在linux下运行-One example of the use of search engine, you can run linux
KARP_RAB
- karp rabbin searching algorithm
spider
- 网络爬虫,能实现基于关键词的抓取,是网络收索的好助手-spider
mifluz-0.24.0.tar
- mifluz 的目的是提供一个存储倒排索引c++库,允许存放关键词以便事后进行搜索。-The purpose of mifluz is to provide a C++ library to store a full text inverted index. To put it briefly, it allows storage of occurrences of words in such a way that they can later be searched. The basic id
interleaver
- interleaver research
Hadoop
- 基于Hadoop集群的分布式日志分析系统研究-Distributed Hadoop clusters based on log analysis system
Nutch
- 一种新型的基于Nutch的搜索引擎技术,时下热门研究方向-A new search engine based on Nutch technology research nowadays popular
Design
- 软件名称:基于主题的Web爬行器 运行环境:Windows 2000/XP/2003 实现环境:Eclipse 编程语言:Java 功能:实现主题网页的抓取 -Software name: theme-based Web crawler operating environment: Windows 2000/XP/2003 achieve environmental: Eclipse programming language: Java features: realizati
seoseach
- 一个适合SEO的自动检索点击源码,可实现自动检索 自动下翻10页-An automatic search for SEO click source, can be automatically retrieved automatically turned 10 next
webcrawler
- 一个java 开发的网络爬虫,采集功能比较强大-Development of a java web crawler, collecting more powerful features