
推薦閱讀??點擊標(biāo)題可跳轉(zhuǎn)這篇文章主要介紹了python 如何用爬蟲爬取京東ps4售賣情況,幫助大家更好的利用python爬取自己想要的數(shù)據(jù),感興趣的朋友可以了解下
代碼
#!/usr/bin/env python
# -*- coding: utf-8 -*-
# @File : HtmlParser.py
# @Author: 趙路倉
# @Date : 2020/3/17
# @Desc :
# @Contact : [email protected]
import?json
?
from?lxml?import?etree
import?requests
from?bs4?import?BeautifulSoup
?
?
url="https://search.jd.com/Search?keyword=ps4&enc=utf-8&wq=ps4&pvid=cf0158c8664442799c1146a461478c9c"
head={
'authority':?'search.jd.com',
'method':?'GET',
'path':?'/s_new.php?keyword=%E6%89%8B%E6%9C%BA&enc=utf-8&qrst=1&rt=1&stop=1&vt=2&wq=%E6%89%8B%E6%9C%BA&cid2=653&cid3=655&page=4&s=84&scrolling=y&log_id=1529828108.22071&tpl=3_M&show_items=7651927,7367120,7056868,7419252,6001239,5934182,4554969,3893501,7421462,6577495,26480543553,7345757,4483120,6176077,6932795,7336429,5963066,5283387,25722468892,7425622,4768461',
'scheme':?'https',
'referer':?'https://search.jd.com/Search?keyword=%E6%89%8B%E6%9C%BA&enc=utf-8&qrst=1&rt=1&stop=1&vt=2&wq=%E6%89%8B%E6%9C%BA&cid2=653&cid3=655&page=3&s=58&click=0',
'user-agent':?'Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/66.0.3359.139 Safari/537.36',
'x-requested-with':?'XMLHttpRequest',
}
?
def?page(page):
print("開始")
url?=?"https://search.jd.com/Search?keyword=ps4&enc=utf-8&qrst=1&rt=1&stop=1&vt=1&wq=ps4&page="+page+"&s=181&click=0"
r=requests.get(url,timeout=3,headers=head)
r.encoding=r.apparent_encoding
# print(r.text)
b=BeautifulSoup(r.text,"html.parser")
#print(b.prettify())
_element?=?etree.HTML(r.text)
datas?=?_element.xpath('//li[contains(@class,"gl-item")]')
print(datas)
for?data?in?datas:
p_price?=?data.xpath('div/div[@class="p-price"]/strong/i/text()')
p_comment?=?data.xpath('div/div[5]/strong/a/text()')
p_name?=?data.xpath('div/div[@class="p-name p-name-type-2"]/a/em/text()')
p_href?=?data.xpath('div/div[@class="p-name p-name-type-2"]/a/@href')
comment=' '.join(p_comment)
name?=?' '.join(p_name)
price?=?' '.join(p_price)
href?=?' '.join(p_href)
print(name,price,p_comment,href)
?
if?__name__=="__main__":
page("5")
爬取結(jié)果

以上就是python 爬蟲爬取某東ps4售賣情況的詳細內(nèi)容
歡迎大家點贊,留言,轉(zhuǎn)發(fā),轉(zhuǎn)載,感謝大家的相伴與支持
萬水千山總是情,點個【在看】行不行
*聲明:本文于網(wǎng)絡(luò)整理,版權(quán)歸原作者所有,如來源信息有誤或侵犯權(quán)益,請聯(lián)系我們刪除或授權(quán)事宜