搜索资源列表
jspider-0.5.0-dev
- 一个由java实现的搜索引擎代码。实现对网页内容的分析和采集功能-a realization by the search engine code. Achieving the right Web content collection and analysis functions
weblech-0.0.3.tar
- WebLech is a fully featured web site download/mirror tool in Java, which supports many features required to download websites and emulate standard web-browser behaviour as much as possible. WebLech is multithreaded and comes with a GUI console.-WebLe
spider(java)
- 网页抓取器又叫网络机器人(Robot)、网络爬行者、网络蜘蛛。网络机器人(Web Robot),也称网络蜘蛛(Spider),漫游者(Wanderer)和爬虫(Crawler),是指某个能以人类无法达到的速度不断重复执行某项任务的自动程序。他们能自动漫游与Web站点,在Web上按某种策略自动进行远程数据的检索和获取,并产生本地索引,产生本地数据库,提供查询接口,共搜索引擎调用。-web crawling robots - known network (Robot), Web crawling,
webharvest1-project
- Web-Harvest是一个Java开源Web数据抽取工具。它能够收集指定的Web页面并从这些页面中提取有用的数据。Web-Harvest主要是运用了像XSLT,XQuery,正则表达式等这些技术来实现对text/xml的操作
jse
- 搜索引擎 说明:用JAVA实现的网络搜索引擎-search engine Note : JAVA Web search engine
使用Java搜索Internet
- Search Crawler 是用于Web搜索的一个基本的搜索程序,它展示了基于搜索程序的应用程序的基础框架。-Search Crawler Web search for a basic search procedures, it features based on the search application's basic framework.
Webloup
- WebLoupe is a java-based tool for analysis, interactive visualization (sitemap), and exploration of the information architecture and specific properties of local or publicly accessible websites. Based on web spider (or web crawler) technology. 开源搜索爬
heritrix.rar
- web 网络爬虫 用户可以使用它从网络上抓取想要得资源,开发者还可以扩展它的各个组件,来实现自己的抓取逻辑。,Reptile web network users can use it from the network you want to crawl resources, developers can also extend its various components, to achieve their own logic crawl.
searchenginecode.rar
- 主要工作是对web搜索程序进行研究;并且利用java语言实现了search crawler的搜索程序界面.,The main work is to study procedures for web search and the use of java language to achieve a search crawler search program interface.
Crawler
- 本人用c++开发的搜索引擎的网络爬虫 蜘蛛程序 欢迎参考。-I am using c++ developer' s Web crawler search engine spider welcome reference.
heritrix-1.14.4
- heritrix-1.14.4 纯JAVA开发的,开源的Web网络爬虫-heritrix-1.14.4 pure JAVA development, open source Web crawler
SearchCrawler
- java编写的网络爬虫程序用于检索网站资源和信息,多线程实例-java web crawler program written for searching website resources and information ,a multi-threaded example
crawler
- 一个针对分主题的网页分析和下载系统,能主动下载信息详细页-Automatically analyze and download classified web pages
Lucene
- 基于Lucene的Web站内信息搜索系统-Lucene-based Web site of the information search system
Search_Engine
- 描述了搜索引擎的系统结构,从网络机器人、索引引擎、Web服务器三个方面进行详细的说明,并通过实现一个新闻搜索引擎来进行例证。-Describes the search engine' s system structure, from the network robot, indexing engine, Web server three areas detailed explanation, and through the realization of a news search engi
spidering.tar
- spidering the web, work like crawler, and has visualization links. It is java
Design
- 软件名称:基于主题的Web爬行器 运行环境:Windows 2000/XP/2003 实现环境:Eclipse 编程语言:Java 功能:实现主题网页的抓取 -Software name: theme-based Web crawler operating environment: Windows 2000/XP/2003 achieve environmental: Eclipse programming language: Java features: realizati
webcrawler
- 一个java 开发的网络爬虫,采集功能比较强大-Development of a java web crawler, collecting more powerful features
JavaNetSpider
- Java网络爬虫(蜘蛛)源码 本程序利用java技术通过IP/TCP技术去捕捉网络数据。-Java web crawler (spiders) the source code The program use Java technology through the IP/TCP technology to capture network data.
Spider-Java
- 网络爬虫的简要介绍及一点源代码,分享给想要学习爬虫的人-The web crawler brief introduction and point-source code