当前位置:   article > 正文

python爬取孔夫子旧书网某一店铺所有书籍的书名 出版社 品相_python采集孔夫子网

python采集孔夫子网
  1. import requests
  2. from lxml.html import etree
  3. import time
  4. for i in range(1,6):
  5. print('http://shop.kongfz.com/34186/new/0_50_0_0_{}_sort_desc_0_0/'.format(i))
  6. url='http://shop.kongfz.com/34186/new/0_50_0_0_{}_sort_desc_0_0/'.format(i)
  7. res = requests.Session().get(url, headers={'User-Agent': 'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/48.0.2564.116 Safari/537.36'})
  8. #print(res.text)
  9. html = etree.HTML(res.text)
  10. div_books = html.cssselect('a.row-name')
  11. print("hello")
  12. print(len(div_books))
  13. #print(div_books)
  14. i=0
  15. for book in div_books:
  16. print(html.cssselect('a.row-name')[i].text)
  17. print(html.cssselect('div.row-author')[i].text)
  18. print(html.cssselect('div.row-quality')[i].text)
  19. print()
  20. i=i+1
  21. print(i)
  22. time.sleep(3)

声明:本文内容由网友自发贡献,转载请注明出处:【wpsshop博客】
推荐阅读
相关标签
  

闽ICP备14008679号