爬虫之爬取网页表格数据（三）

技术2022-07-10 168

爬虫之爬取网页数据（三）

爬虫之爬取网页表格数据（二）只是爬取到一条数据，如果想爬取多条数据该怎么做呢？

只需要修改spider下example.py文件的代码即可

如下：`

def parse(self, response): # housename = response.xpath('//*[@id="content"]/div[1]/ul/li[1]/div[1]/div[1]/a/text()').extract_first() # price = response.xpath('//*[@id="content"]/div[1]/ul/li[1]/div[1]/div[6]/div[1]/span/text()').extract_first() # item = FirstspiderItem() #创建一个Item实例 # item["housename"] = housename # item["price"] = price # return item ti_list = response.xpath('//*[@id="content"]/div[1]/ul/li') for tr in ti_list: item = FirstspiderItem() # .表示从当前位置开始 #这里从li开始 item["housename"] = tr.xpath('./div[1]/div[1]/a/text()').extract_first().strip() item["price"] = tr.xpath('./div[1]/div[6]/div[1]/span/text()').extract_first().strip() yield item`

其余地方不改动，重新运行项目，得到如图所示结果

Processed: 0.008, SQL: 9