搜索资源列表
doubanzhuaqu
- 可以自动去豆瓣妹子网页抓取所有的妹子照片并保存到本地-Can automatically crawl all pages go watercress sister sister photo and save it to local
Douban
- scrapy爬虫,爬取豆瓣评分大于8.5分的电影名单,结果存储于MySql数据库。-scrapy reptiles, crawling watercress score greater than 8.5 of the list of films, the result is stored in the MySql.
getmovie
- 利用python爬虫爬取豆瓣电影评论并分类评论类型。(get the comment of some movies and classify the comment)
DoubanMovie250DataMining
- 用于抓取豆瓣电影前250位信息,可增加或修改需要抓取的信息(To crawl the information of Top250 movies in www.douban.com, if you need ,you can edit file to add or change the information you need.)
Crawler.tar
- 利用了python3.5编写了一个爬虫,爬取豆瓣上电影《声之形》的评论,并统计评论词的频率,制作了词云(Using python3.5 to write a crawler, climb the comments on the movie "sound shape", and statistics the frequency of the comment word, making the word cloud)
spider
- 豆瓣影评的爬虫,不依赖任何库,一定程度反反爬虫(a spider of douban's film critic,without any releases)
hearder.py
- 利用python抽取单个电影的豆瓣影评信息(use python to get all user reviews from douban movie site)
爬取豆瓣电影Top250
- 通过python语言,利用爬虫、词云等模块,爬取豆瓣电影评分前250(Climbing the top 250 of Douban Movie)
豆瓣
- 使用爬虫从豆瓣官网获得影评TOP250的电影,以Excel文件的形式存储在本地。(Use the crawler to get the top 250 movie reviews from Douban official website and store them locally as Excel files.)