python爬虫框架:Scrapy爬取网站数据案例
实战
1.自动模拟登陆豆瓣
(1).douban.py
(2).setting.py
USER_AGENT ='Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/68.0.3440.106 Safari/537.36'
2.爬取当当网数据入Linux中的mysql
(1)items.py
(2)dd.py
(3)pipelines.py
(4)setting.py中添加
ROBOTSTXT_OBEY = False
ITEM_PIPELINES = {
'dangdang.pipelines.DangdangPipeline': 300,
}