资源列表
200090681
- 一种网页分类中使用的中文分词方法,很有借鉴性,大家可以-a website classification of Chinese word segmentation method, a very useful reference, we can s
delphi_searchengine
- Search over 200 internet search engines. will launch the users default browser and show the results.. This source uses TLinkLabel By Vitaly Zayko on a few of the tabs It is not needed by the search engine itself. however it is included in
htdig-3.1.6.tar
- 比较大型的网络搜索引擎,C++实现,可惜只支持unix系统-relatively large network search engines, C realized, but unfortunately, only unix support system
CourseCrawler_1_0_0_final
- 搜索专业术语的爬虫,指定专业网站的列表从中搜索专业术语相关的网页。-search of the reptile's terms, the designated professional websites from the list of search terms related to the professional website.
lucene_cn
- lucene中文搜索包,用于构建搜索 Lucene不是一个完整的全文索引应用,而是是一个用Java写的全文索引引擎工具包,它可以方便的嵌入到各种应用中实现针对应用的全文索引/检索功能。 Lucene的作者:Lucene的贡献者Doug Cutting是一位资深全文索引/检索专家,曾经是V-Twin搜索引擎(Apple的Copland操作系统的成就之一)的主要开发者,后在Excite担任高级系统架构设计师,目前从事于一些INTERNET底层架构的研究。他贡献出的Lucene的目标是为
xunlong0.6
- 完整的.net搜索引擎采用LUCENE.net为索引核心,分布式架构.包含wordnet,分词,spider,简单webserver等-complete. Net using search engines for indexing LUCENE.net core, Distributed framework. includes WordNet, participle, spider, a simple webserver, etc.
spider_demo
- C#编写的spider demo 主要实现多线程的网页抓取及网页内容中URL的提取-prepared by the spider demo main multithreaded website crawls and website content URL Extraction
lucenesegment
- lucene中文分词源码,做搜索引擎需要用到的好东西哦-lucene Chinese word source and do search engines need to use the good stuff, oh
WebSearch(.NET)
- 迅龙中文Web搜索引擎(.NET) 下载完整版 代码 http://gforge.osdn.net.cn/projects/xunlong/ LGPL协议发行 作者: 宁夏大学 张冬 zd4004@163.com 欢迎技术交流 http://blog.163.com/zd4004/ 2007.2.26-Long Xun Chinese Web search engine (.NET) code download a
searchEngineArticle
- 介绍搜索引擎技术的一些文章,大致讲了搜索引擎的组成模块和相关技术。-Search engine technology articles, broadly speaking the search engine modules and related technologies.
htmlparser
- HTML的解析器,是Majestic-12分布式搜索引擎的一部分。作者Alex Chudnovsky, Majestic-12 Ltd (UK)。这个是3.0版本,性能经过多次优化,文档也比较全。也可以到http://www.majestic12.co.uk下载。-HTML parser, Majestic-12 distributed search engine part. Author Alex Chudnovsky, Majestic-12 Ltd (UK). This is versio
zilverline-src-1.5.0
- 桌面搜索引擎代码,供大家自由下载此源码,具体的可以参照网站上的-desktop search engine code for all to download this free source, the specific reference could look at the website