Practical Python项目笔记：Python容器类型实战指南-优快云博客

本文链接：https://blog.youkuaiyun.com/gitblog_00310/article/details/148416126

Practical Python项目笔记：Python容器类型实战指南

practical-python Practical Python Programming (course by @dabeaz) 项目地址: https://gitcode.com/gh_mirrors/pr/practical-python

在Python编程中，容器(Containers)是存储和组织数据的基础工具。本文将深入探讨Python中的三种核心容器类型：列表(List)、字典(Dict)和集合(Set)，并通过实际金融数据分析案例展示它们的应用。

容器类型概述

Python提供了多种容器类型来满足不同场景下的数据存储需求：

列表(List)：有序元素集合，适合保持数据顺序的场景
字典(Dict)：键值对集合，提供快速查找能力
集合(Set)：无序唯一元素集合，适合去重和成员测试

列表(List)实战应用

列表是最常用的序列类型，特别适合处理有序数据集。在金融分析中，我们常用列表来存储股票组合数据。

创建股票组合列表

portfolio = [
    ('GOOG', 100, 490.1),  # (股票代码, 数量, 价格)
    ('IBM', 50, 91.3),
    ('CAT', 150, 83.44)
]

动态构建列表

实际项目中，我们通常从文件或数据库动态构建列表：

records = []  # 初始化空列表

with open('portfolio.csv', 'rt') as f:
    next(f)  # 跳过标题行
    for line in f:
        row = line.split(',')
        records.append((row[0], int(row[1]), float(row[2])))

专业建议：对于CSV文件处理，使用csv模块比手动分割更健壮，能自动处理字段引号和转义字符。

字典(Dict)高效查询

字典提供了O(1)时间复杂度的查找能力，非常适合作为内存数据库使用。

股票价格字典示例

prices = {
   'GOOG': 513.25,
   'CAT': 87.22,
   'IBM': 93.37,
   'MSFT': 44.12
}

安全字典访问技巧

直接通过prices['IBM']访问可能引发KeyError，推荐使用：

# 方法1：成员测试
if 'IBM' in prices:
    print(prices['IBM'])

# 方法2：get()方法提供默认值
price = prices.get('SCOX', 0.0)  # 不存在返回0.0

复合键应用

字典键可以是任何不可变类型，如元组：

holidays = {
  (1, 1): '元旦',
  (3, 14): '圆周率日',
}

print(holidays[3, 14])  # 输出"圆周率日"

注意：列表、字典等可变类型不能作为字典键。

集合(Set)高级操作

集合提供了高效的成员测试和集合运算能力。

基本集合操作

tech_stocks = {'IBM', 'AAPL', 'MSFT'}

# 成员测试
print('IBM' in tech_stocks)  # True

# 去重示例
names = ['IBM', 'AAPL', 'IBM', 'GOOG']
unique_names = set(names)  # {'IBM', 'AAPL', 'GOOG'}

集合运算

s1 = {'a', 'b', 'c'}
s2 = {'c', 'd'}

print(s1 | s2)  # 并集 {'a', 'b', 'c', 'd'}
print(s1 & s2)  # 交集 {'c'}
print(s1 - s2)  # 差集 {'a', 'b'}

实战项目：股票组合分析

让我们将这些容器类型应用于实际的股票组合分析。

练习2.4：元组列表实现

def read_portfolio(filename):
    """读取CSV文件到元组列表"""
    portfolio = []
    with open(filename, 'rt') as f:
        rows = csv.reader(f)
        headers = next(rows)
        for row in rows:
            holding = (row[0], int(row[1]), float(row[2]))
            portfolio.append(holding)
    return portfolio

练习2.5：字典列表改进

使用字典提高代码可读性：

def read_portfolio(filename):
    """读取CSV文件到字典列表"""
    portfolio = []
    with open(filename, 'rt') as f:
        rows = csv.reader(f)
        headers = next(rows)
        for row in rows:
            holding = {
                'name': row[0],
                'shares': int(row[1]),
                'price': float(row[2])
            }
            portfolio.append(holding)
    return portfolio

练习2.6：价格字典构建

def read_prices(filename):
    """读取价格CSV到字典"""
    prices = {}
    with open(filename, 'r') as f:
        rows = csv.reader(f)
        for row in rows:
            try:
                prices[row[0]] = float(row[1])
            except IndexError:  # 处理空行
                continue
    return prices