【Pandas】DataFrame groupby 中的as_index含义

最新推荐文章于 2025-06-27 23:25:15 发布

原创最新推荐文章于 2025-06-27 23:25:15 发布 · 8.2k 阅读

1 ·

CC 4.0 BY-SA版权

文章标签：

#Python #Pandas

Python 专栏收录该内容

2 篇文章

订阅专栏

本文探讨了Pandas DataFrame在使用groupby函数时as_index参数的影响。当as_index设为True（默认），'even'列将作为索引，导致在后续操作中无法直接通过列名访问。而设置as_index=False，则会保留'even'列并用新的索引替换原有索引，允许通过loc直接访问。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

栗子如下:

import numpy as np
values = np.array([1, 3, 2, 4, 1, 6, 4])
example_df = pd.DataFrame({
'value': values,
'even': values % 2 == 0,
'above_three': values > 3

}, index=['a', 'b', 'c', 'd', 'e', 'f', 'g'])

print(example_df)

执行如下：

   above_three   even  value
a        False  False      1
b        False  False      3
c        False   True      2
d         True   True      4
e        False  False      1
f         True   True      6
g         True   True      4

first_even = example_df.groupby('even').first() #此时as_index默认为True

print(first_even)

执行如下：

       above_three  value
even                     
False        False      1
True         False      2

此时print(first_even('even')报错，

print(first_even.loc['a'])报错。

因为as_index=True时even列已经默认为索引列，新的dataframe中不再包含这列数据，原来的索引['a','b','c'...]也不再存在。

若

first_even = example_df.groupby('even',as_index='False').first() #此时as_index为False

print(first_even)

执行如下：

    even  above_three  value
0  False        False      1
1   True        False      2

print(first_even.loc[0])

执行如下：

even           False
above_three    False
value              1
Name: 0, dtype: object