搜索资源列表
wwwclusing
- In this paper, we propose a hierarchical clustering method using visual, textual and link analysis. By using a vision-based page segmentation algorithm, a web page is partitioned into blocks, and the textual and link information of an image c
TextExtract
- * 在线性时间内抽取主题类(新闻、博客等)网页的正文。 * 采用了<b>基于行块分布函数</b>的方法,为保持通用性没有针对特定网站编写规则。-Web text extraction code,* in linear time extract topic class (news, blogs, etc.) the body of the page. * using the < b > </b > line based on block