资源列表
HostConfig
- PRogram to resolve dns to ip and update yourhost config.
internet
- 图书管理系统的选项页面,internet动态网页制作。-Options page of the library management system, internet dynamic web page.
Char04
- 网络搜索引擎代码,内涵各种爬行算法和相关子程序-This program code designed an eDonkey network crawling system which could avoid being added to the blacklist of the central server and break the count restriction of the results when crawler search something from the server.Af
UpdateAddrIndex
- 电信行业,编写的地址搜索引擎的代码,功能是更新索引库的类-Telecommunications industry, to write the address of the search engine code update the index library class
GenWordlib
- 电信行业,编写的地址搜索引擎的代码,功能是产生词典,用于分词。-Telecommunications industry, the address written in the code of the search engine, the function is to generate dictionary for word.
GenAddrIndexAll
- 电信行业,编写的地址搜索引擎,此类是用于建立地址库的源代码-Telecommunications industry, write the address search engine, such is used to establish the source code of the address database
GenAddrSegmIndex
- 电信行业,地址搜索的程序,此代码功能是根据区域,对更新索引库-Telecom industry, Address Search program, this code function is based on the region, the index is updated library
TokenTest
- 电信行业,此代码是地址搜索程序的一部分,该代码的功能是分词的测试程序。-Telecommunications industry, address search program, the function of this code is written in the sub-word test.
MemCache
- memcache缓存使用,能够减轻数据库压力,接口非常简单-memcache cache use database can reduce pressure, the interface is very simple.
this-is-search-engine
- 一本关于搜索引擎的书籍,强调原理而不纠缠技术细节-Books of a search engine, stressed that the principle of not entangled technical details
Spider-Java
- 网络爬虫的简要介绍及一点源代码,分享给想要学习爬虫的人-The web crawler brief introduction and point-source code
wordbag
- 根据一个人物名单文件,查找wekipedia上相应网页,读取网页文本,并统计每个人物在每个网页上出现的次数,最终形成word bag,人物500人,运行时间6分钟左右。-from a namelist making a word bag