文件名称:cobra
介绍说明--下载内容来自于网络,使用问题请自行百度
有js逻辑的页面,对网络爬虫的信息抓取工作造成了很大障碍。DOM树,只有执行了js的逻辑才可以完整的呈现。而有的时候,有要对js修改后的dom树进行解析。在搜寻了大量资料后,发现了一个开源的项目cobra。cobra支持Javascr ipt引擎,其内置的Javascr ipt引擎是mozilla下的 rhino,利用rhino的API,实现了对嵌入在html的Javascr ipt的解释执行-There js a logical page, the information on the Web crawler to crawl, caused a significant obstacle. DOM tree, only the implementation of the js logic can complete the presentation. And sometimes, there js want to modify the dom tree after parsing. A lot of information in the search and found an open source project cobra. cobra support Javascr ipt engine, which is mozilla Javascr ipt engine built under the rhino, the use of rhino' s API, allowing for the Javascr ipt embedded in the html interpreted
(系统自动生成,下载前可以参看下载内容)
下载文件列表
本网站为编程资源及源代码搜集、介绍的搜索网站,版权归原作者所有! 粤ICP备11031372号
1999-2046 搜珍网 All Rights Reserved.