资源列表
movieSE
- 专门抓取电源的网络爬虫,整合电影信息,以界面的方式展示出来-movie spider,with human interface
fenci
- 使用机器学期学习已有文本的中文分词,然后再对新文本进行分词的代码。-Using machine semester existing text of the Chinese word, and then the new code word text.
searchengine
- 使用Lucence开发的一个简单离线搜索引擎,能在本地的网页库中进行索引建立并检索,提供网页地址的返回。-Use Lucence developed a simple off-line search engine, in a local library' s website for indexing and retrieval, provides web addresses returned.
python_sina_crawl
- 新浪微博的爬虫程序。程序运行方式:保存所有代码后,打开Main.py,修改LoginName为你的新浪微博帐号,PassWord为你的密码。运行Main.py,程序会在当前目录下生成CrawledPages文件夹,并保存所有爬取到的文件在这个文件夹中。-Sina microblogging reptiles. Program operation: save all the code, open Main.py, modify LoginName for your Sina Weibo accou
search
- asp网站模块开发实例:站内搜索系统(模糊查询) 站内搜索系统即模糊查询,进入到进阶sql语句的学习 本程序是进阶asp教程,也是网站最常用功能,做动态网站必学。-asp Web Module Development Example: Search System (fuzzy query) Search system that is fuzzy query, go to the Advanced Learning sql statement This program is an
35dirpj_v2.2
- 程序介绍:35dir网站分类目录程序采用PHP+MYSQL开发,该版本为v2.2商业版,-Program Descr iption: 35dir Website Categories program uses PHP+MYSQL development, this version is v2.2 Business,
googlem
- google网页调用卫星地图源码,有需要的同学请加进自己网站的内页源码中实现调用地图的功能。-google satellite map site called source, there is a need for students, please add their own pages within the website source code implementation calls the map function.
search-eginee
- Luncene2.0+Heritrix开发自己的搜索引擎,书籍中的源码。-Luncene2.0+Heritrix develop its own search engine, in a book source.
baidusousuo
- 精仿百度搜索引擎源码搜猫V9.0正式版商业版-Fine imitation Baidu search engine source code search cat official version V9.0 Business Edition
hao123_5.0
- this hao123网址导航源码-this is hao123 site navigation source
qhelper-4-10
- 腾讯空间 下载java源代码-qzone download java source
3--blog_move-4-18
- 新浪博客,CSDN博客,腾讯空间的简单的爬虫系统源码,java版。-blog.sina.com,csdn, qzone, spider java source