python实现对淘宝指定商品的价格，名称进行爬取

最新推荐文章于 2025-11-02 20:15:56 发布

原创最新推荐文章于 2025-11-02 20:15:56 发布 · 2.1k 阅读

1 ·

CC 4.0 BY-SA版权

文章标签：

#淘宝商品 #python #爬虫

爬虫专栏收录该内容

1 篇文章

订阅专栏

本文介绍了一个使用Python编写的简单爬虫程序，该程序能够从淘宝网站上抓取指定商品的信息，包括商品名称和价格，并将这些信息进行整理和展示。通过解析网页源代码并利用正则表达式提取关键数据，最后使用PrettyTable库以表格形式输出所有商品的价格排名。

部署运行你感兴趣的模型镜像

#!/usr/bin/env python
# -*- coding:utf-8 -*-
#Author: feng
import requests
import re
from prettytable import PrettyTable
from colorama import Fore,init,Back

def getHtmlText(url):
try:
r=requests.get(url)
r.raise_for_status()
r.encoding=r.apparent_encoding
return r.text
except:
print("Wrong!!")
return ""

def getProductPrice(itl,html):
name_regx=re.compile(r'"raw_title":"(.+?)"')
bag_name=re.findall(name_regx,html)
price_regx=re.compile(r'"view_price":"(\d+?.00)"')
price_name=re.findall(price_regx,html)
for i in range(len(bag_name)):
itl.append([bag_name[i],float(price_name[i])])
return itl

def PrintAllPrice(plist):
header=("序号价格商品").split()
pt=PrettyTable()
pt._set_field_names(header)
plist=sorted(plist,key=lambda x:x[1])
for index,i in enumerate(plist):
pt.add_row([Fore.RED+str(index)+Fore.RESET,Fore.LIGHTCYAN_EX+str(i[1])+Fore.RESET,Back.GREEN+i[0]+Back.RESET])
print(pt)

def main():
good="iphone"#商品的全拼
depth=2#爬取的深度
start_url="https://s.taobao.com/search?q="+good
infoList=[]
for i in range(depth):
try:
url=start_url+"&s={}".format(44*i)
Text=getHtmlText(url)
getProductPrice(infoList,Text)
except:
continue
PrintAllPrice(infoList)

if __name__=="__main__":
main()

您可能感兴趣的与本文相关的镜像