以下是一个简单的20行Python代码,用于获取各种书本的信息:
import requests
from bs4 import BeautifulSoup
def get_book_info(url):
response = requests.get(url)
soup = BeautifulSoup(response.text, 'html.parser')
title = soup.find('h1').text.strip()
author = soup.find('span', {'itemprop': 'author'}).text.strip()
description = soup.find('div', {'class': 'description'}).text.strip()
print('书名:', title)
print('作者:', author)
print('描述:', description)
book_urls = [
'https://www.example.com/book1',
'https://www.example.com/book2',
'https://www.example.com/book3',
'https://www.example.com/book4',
'https://www.example.com/book5'
]
for url in book_urls:
get_book_info(url)
print('-----------------')
请将book_urls
列表中的URL替换为实际的书本页面URL。此代码使用requests
库发送HTTP请求,使用BeautifulSoup
库解析HTML页面,并获取书本的标题、作者和描述。然后,将这些信息打印出来。您可以根据需要进行进一步的处理,例如将信息存储到数据库或写入文件中。