资源列表
operaApi-(1)
- 豆瓣开放平台的sdk,已经进行过Maven打包盒编译,导入工程直接可以运行-Sdk watercress open platform, has been packaged box Maven compiled into the project can be run directly
test1
- 将获取的网页文本除去html标签,从而获得正文内容。-Page text html tags will get removed to obtain the text content.
Web-search-tools
- 网址检索工具,用于网络搜索、检索、查询等。-Web search tools, used in web search, retrieval, query etc..
MyGoodSearch
- .net环境下C#语言写的搜索引擎完整包。含有爬虫程序、索引程序、用户接口页面前台。提供给大家在此基础上做进一步的开发。-.net environment C# language to write the complete search engine, including crawler, indexer and search engine Webpage front. Provide the related researchers, further development.
masm_smb
- Scanner Utility for shared resources. You can customize the remote password cracker. This is just blank, there are errors. Works is simple - you need to specify IP companies with open port 445 and you can get a list of all shared resources if t
google-http-java-client-1.17.0-rc
- 谷歌的HTTP客户端工具,JAVA语言,可用于谷歌的HTTP应用程序-google http client tool in java language
pan_sou
- 整合各大网盘资源搜索引擎,功能强大,界面可以自该.-Integration of the major network disk resource search engine, powerful interface that can be customized.
spider
- 网络爬虫程序、针对主流新闻网站进行信息抽取-Web crawlers for information extraction mainstream news sites
Crawler
- 基于java开发的用于爬取数据的小程序,仅代码-Java-based applet developed for crawling data, only the code
id3
- 基于空间向量模型的高性能、高效率ID3算法决策树分类-Vector space model based on high-performance, high-efficiency ID3 decision tree classification algorithm
Search-Engine
- 搜索引擎介绍,适合初学者。包括搜索的本质,搜索界动态等。-The introduction of search engine for beginners. Including the nature of the search, the search industry dynamics.
02_lucene_searcher
- Lucene 学习 笔记 源码 注释 什么的都有想学习的-What annotated source Lucene study notes have to learn to see