文件名称:Java爬虫软件
-
所属分类:
- 标签属性:
- 上传时间:2019-09-17
-
文件大小:6.54mb
-
已下载:0次
-
提 供 者:
-
相关连接:无下载说明:别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容来自于网络,使用问题请自行百度
Java爬虫软件,爬取网站的URL后保存到Mongo数据库里面,并记录哪些爬过了,哪些没爬
(系统自动生成,下载前可以参看下载内容)
下载文件列表
压缩包 : Crawler4.zip 列表 Crawler/.classpath Crawler/.project Crawler/.settings/.jsdtscope Crawler/.settings/org.eclipse.core.resources.prefs Crawler/.settings/org.eclipse.jdt.core.prefs Crawler/.settings/org.eclipse.wst.common.component Crawler/.settings/org.eclipse.wst.common.project.facet.core.xml Crawler/.settings/org.eclipse.wst.jsdt.ui.superType.container Crawler/.settings/org.eclipse.wst.jsdt.ui.superType.name Crawler/WebContent/Index.jsp Crawler/WebContent/JS/jquery-1.8.0.js Crawler/WebContent/META-INF/MANIFEST.MF Crawler/WebContent/WEB-INF/lib/assertj-core-1.5.0.jar Crawler/WebContent/WEB-INF/lib/commons-codec-1.6.jar Crawler/WebContent/WEB-INF/lib/commons-collections-3.2.1.jar Crawler/WebContent/WEB-INF/lib/commons-io-1.3.2.jar Crawler/WebContent/WEB-INF/lib/commons-lang-2.6.jar Crawler/WebContent/WEB-INF/lib/commons-lang3-3.1.jar Crawler/WebContent/WEB-INF/lib/commons-logging-1.1.3.jar Crawler/WebContent/WEB-INF/lib/commons-pool-1.5.5.jar Crawler/WebContent/WEB-INF/lib/fastjson-1.1.37.jar Crawler/WebContent/WEB-INF/lib/guava-15.0.jar Crawler/WebContent/WEB-INF/lib/hamcrest-core-1.3.jar Crawler/WebContent/WEB-INF/lib/httpclient-4.3.3.jar Crawler/WebContent/WEB-INF/lib/httpcore-4.3.2.jar Crawler/WebContent/WEB-INF/lib/jedis-2.0.0.jar Crawler/WebContent/WEB-INF/lib/json-path-0.8.1.jar Crawler/WebContent/WEB-INF/lib/json-smart-1.1.1.jar Crawler/WebContent/WEB-INF/lib/jsoup-1.7.2.jar Crawler/WebContent/WEB-INF/lib/junit-4.11.jar Crawler/WebContent/WEB-INF/lib/log4j-1.2.17.jar Crawler/WebContent/WEB-INF/lib/mongo-2.7.2.jar Crawler/WebContent/WEB-INF/lib/slf4j-api-1.7.6.jar Crawler/WebContent/WEB-INF/lib/slf4j-log4j12-1.7.6.jar Crawler/WebContent/WEB-INF/lib/webmagic-core-0.5.2.jar Crawler/WebContent/WEB-INF/lib/webmagic-extension-0.5.2.jar Crawler/WebContent/WEB-INF/lib/xsoup-0.2.4.jar Crawler/WebContent/WEB-INF/web.xml Crawler/build/classes/crawler.properties Crawler/build/classes/log4j.xml Crawler/build/classes/mongo.properties Crawler/build/classes/pers/ghost/mongo/dao/AbstractBaseMongoTemplete.class Crawler/build/classes/pers/ghost/mongo/dao/MongoCrawerHtmlDao.class Crawler/build/classes/pers/ghost/mongo/dao/MongoCrawerUrlDao.class Crawler/build/classes/pers/ghost/mongo/dao/MongoCrawerUrlStatusDao.class Crawler/build/classes/pers/ghost/url/bean/CrawlerResult.class Crawler/build/classes/pers/ghost/url/bean/UrlResult.class Crawler/build/classes/pers/ghost/url/configure/CrawlerConfiguration.class Crawler/build/classes/pers/ghost/url/configure/MongoConfiguration.class Crawler/build/classes/pers/ghost/url/crawler/Crawler.class Crawler/build/classes/pers/ghost/url/crawler/DownLoadFilePipeline.class Crawler/build/classes/pers/ghost/url/crawler/StopQueueScheduler.class Crawler/build/classes/pers/ghost/url/crawler/UrlCrawlerProcessor.class Crawler/build/classes/pers/ghost/url/crawler/UrlDisplayProcessor.class Crawler/build/classes/pers/ghost/url/crawler/UrlDownLoadProcessor.class Crawler/build/classes/pers/ghost/url/server/CrawlerServer.class Crawler/build/classes/pers/ghost/url/server/CrawlerStopServer.class Crawler/build/classes/pers/ghost/url/server/CrawlerUrlGetServer.class Crawler/build/classes/pers/ghost/url/utils/UrlRegex.class Crawler/config/crawler.properties Crawler/config/mongo.properties Crawler/error.log Crawler/resource/log4j.xml Crawler/src/pers/ghost/mongo/dao/AbstractBaseMongoTemplete.java Crawler/src/pers/ghost/mongo/dao/MongoCrawerHtmlDao.java Crawler/src/pers/ghost/mongo/dao/MongoCrawerUrlDao.java Crawler/src/pers/ghost/mongo/dao/MongoCrawerUrlStatusDao.java Crawler/src/pers/ghost/url/bean/CrawlerResult.java Crawler/src/pers/ghost/url/bean/UrlResult.java Crawler/src/pers/ghost/url/configure/CrawlerConfiguration.java Crawler/src/pers/ghost/url/configure/MongoConfiguration.java Crawler/src/pers/ghost/url/crawler/Crawler.java Crawler/src/pers/ghost/url/crawler/DownLoadFilePipeline.java Crawler/src/pers/ghost/url/crawler/StopQueueScheduler.java Crawler/src/pers/ghost/url/crawler/UrlCrawlerProcessor.java Crawler/src/pers/ghost/url/crawler/UrlDisplayProcessor.java Crawler/src/pers/ghost/url/crawler/UrlDownLoadProcessor.java Crawler/src/pers/ghost/url/server/CrawlerServer.java Crawler/src/pers/ghost/url/server/CrawlerStopServer.java Crawler/src/pers/ghost/url/server/CrawlerUrlGetServer.java Crawler/src/pers/ghost/url/utils/UrlRegex.java
本网站为编程资源及源代码搜集、介绍的搜索网站,版权归原作者所有! 粤ICP备11031372号
1999-2046 搜珍网 All Rights Reserved.