搜索资源列表
htmlparser
- Csharp编写网页分析源代码!对于做搜索引擎有一定的帮助。-Csharp prepared analysis website source code! Search engines for so helpful to a certain extent.
sharpictclas
- sharpictclas分词系统_1.0,一个用CSHARP编写的分词系统
CSharpSpider
- 一个用Csharp做的网络蜘蛛,请值的去研究一下.
CSharpSpider
- csharp的蜘蛛程序,比较好,推荐使用
spider_demo.rar
- C#多线程网络爬虫,使用线程池来控制线程,效率不错。,C# multi-threaded network reptiles, use the thread pool to control the thread, good efficiency.
WebSpider_src.rar
- 一个非常好的 C# 网络爬虫程序源码清晰,A very good C# Web crawler program source code clearly
ContentAnalyzer
- 搜索引擎正文提取程序,通过html分析和正则,去掉html代码,保留网页正文,只针对中文有效。英文稍加修改即可使用。-The body of the search engine extraction process, through analysis and regular html remove html code to retain the page text, only effective against the Chinese. Slightly modified to use Engl
OpenWebSpiderCS_v0.1.3
- 一个web爬虫 CSharp开发的,很小很不错,是个开放源代码的项目-CSharp developed a web crawler, very small and very good open source projects is
Web-Crawler-Cpp
- 网页抓取,可以实现网页的下载,并过滤出想要的内容。很实用-Web crawling, Web page downloads can be achieved, and to filter out unwanted content. Very practical
TwitterData-csharp
- 爬社交网络数据程序, 用C#编写,比较基本,适用于初学者学习交流。-It is used to crawl data from online social networks. Realized basic functions such as making API connection, request data, etc.
ASP.NET 数据库搜索引擎
- 简单的数据搜索引擎-simple data search engine
google-maps-static
- google map 的api的使用,大家可以参考一下 -google map of the use of the api, we can refer to
GooglePageRankQuery
- 查询Google PageRank 破解全过程 1. 装个 google工具条 开启pagerank 2. 找个网络 sniffer 软件, 运行浏览器随便打开个网站, 3. sniffer将记录 google工具条发给 google的数据包 分析可得,传输协议是 http, 数据包内除了 有访问网站的地址, 关键还有个 ch参数 , ch参数根据网站地址不同 发生变化(看来关键是 ch怎么计算出来的!) 4.分析google工具条,得到计算 ch的汇编代码,然后翻
ESP
- 使用dotnet + 多线成的爬虫程序。 主要用于sina , 163 等大型论坛。 后台搭配数据库, 已经实现了 下载后的搜索, 图片已经实现下载在分类目录。 -Using dotnet+ Multi-line program into the reptiles. Mainly used sina, 163 and other large forums. Background with a database, has become a reality after downloa
NLuke0.12
- 这是一个基于网络的,扩展了lunce的一个搜索分词工具-This is a web-based, expanded lunce participle of a search tool
NewsCollection
- 新闻采集,可配置成采集任意新闻.包括图片自动下载,过滤HTML等功能-news collection
SearchBiDui
- 可以对搜索网页信息进行抓取,包括地址,关键字描述等-Information on the web page can crawl
CSharpSpider
- csharp 网络爬虫,升级版,适合初学者-CSharp Network reptiles, upgrade version, suitable for beginners
AnalyzerViewer_source
- Lucene.Net is a high performance Information Retrieval (IR) library, also known as a search engine library. Lucene.Net contains powerful APIs for creating full text indexes and implementing advanced and precise search technologies into your programs.
SearchEngine
- 用CSharp编写的源程序,开发环境是VS2005,这是一个小型搜索引擎系统。-Prepared using CSharp source, development environment is visual studio 2005, this is a small search engine system.