资源列表
spider_engine
- 分析网页代码,提取url进行散列处理,提交客户端程序进行排重处理,然后存入客户机数据库,随后根据数据库中的url列表遍历整个网络。-Analysis of web code, extract the hashed url, submit re-schedule the client program to deal with, and then stored in the client database, and then the url list in the database through
mysearch
- 硕士读书时候,写的一个校园内网搜索引擎的程序,功能比较简单,但基本框架完整,可以提供初学者一些帮助。-When master reading, writing, web search engines within a campus program, function relatively simple, but the basic framework of integrity, can provide some help for beginners.
getEmailAddress
- 自动从论坛用户页面获取邮件地址,可自动翻页获取-User page automatically from the forum to get e-mail address, can automatically flip to get
sou
- 排序优化的搜索引擎--详细的搜索引擎介绍-Search engine ranking optimization- detailed descr iption of search engine
SSYQYHMFS
- 搜索引擎优化魔法书,帮助你的网站快速的被搜索引擎收录-Magic search engine optimization to help your site quickly indexed by search engines
test
- 一个小的爬虫程序,可以利用正则表达式匹配字符串,提取有用信息-spider program
WinSpider_src
- 网页爬虫。用于搜集,获取网页,并保存下来,供搜索使用-web-spider
larbin-2.6.3
- 一个高效的网络爬虫,可以自行修改配置文件,为linux下工作环境,很具有参考意义-An efficient Web crawler that can modify configuration files for linux work environment, it is a reference value
IndexerAndRetriever
- text indexer and retrieval search engine
197s
- 搜索引擎源代码,以任意2个搜索引擎对比显示的方式,让网友们在查找信息的时候一键比较和筛选,既为大家节约了时间,也为其提供了便携的搜索结果比较。-Search engine source code to compare any two search engines display, which allows users to find information when in their comparison and selection of a button, saving time both f
fenci
- 帮组我们实现中文分词,程序较为粗糙,请见谅,-Help us to achieve Chinese word group, the program is more rough, please forgive me,
mySegment
- 类库程序,基于词典的简单分词,可分中英文混合的情况-Simple dictionary-based segmentation procedure