一系列的series组成了dataframe（pandas）（titanic）

最新推荐文章于 2024-06-16 17:24:42 发布

原创

最新推荐文章于 2024-06-16 17:24:42 发布 · 898 阅读

1 ·

CC 4.0 BY-SA版权

本文深入探讨了Pandas库中的Series和DataFrame数据结构，通过实例展示了如何利用它们来分析Titanic数据集，涵盖了数据加载、数据清洗、特征工程和初步的数据探索过程。

##一系列的series组成了dataframe
import numpy as np
import pandas as pd
##默认是包含标题的，header=None
df = pd.read_csv("E:/py_learning/day2/titanic.csv",sep=",") ##读取一个数据存储成dataframe类型
df

pd.__version__ ##当前pandas版本'0.25.1'.python存在一些内置变量，都是以两个下划线开始的

dir(pd) ##查看当前包全部函数,对所有包都适用

df.head(2) ##返回数据框前2行

type(df["Age"]) ##pandas.core.series.Series
##dataframe每一列就是一个series

##取一列但想维持dataframe本身
df[["Name","Age"]]
type(df[["Age"]]) ##pandas.core.frame.DataFrame

## 画箱图
df[["Age"]].boxplot()

## 画直方图
df[["Age"]].hist(

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

CHEN_BR

关注关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
分享

复制链接

分享到 QQ

分享到新浪微博

扫一扫
举报

举报

专栏目录

Pandas工具包实战（8）pivot数据透视表：series, dataframe

haiwang_luo的博客

06-04

666

数据透视表 import pandas as pd example = pd.DataFrame({ 'Month': ["January", "January", "January", "January", "February", "February", "February", "February", "March", "March", "March", "March"], 'Category': ["Transportation",

Pandas.DataFrame练手四项目

专注“目标检测”领域技术分享

11-06

611

本文使用了四个数据集，展示了pandas.DataFrame中一些容易以往的数据操作代码命令。

参与评论您还未登录，请先登录后发表或查看评论

pandas库的Series结构和DataFrame结构

高心星的专栏

07-15

856

文章目录前言一、Series结构1.Series对象创建2.Series对象简单操作二、DataFrame结构1.创建DataFrame对象2.DataFrame对象的简单操作总结前言上一篇文章介绍了pandas库的引入以及对于excel的读写时空门，本篇文章介绍pandas的两种数据结构Series和DataFrame。一、Series结构 Pandas series 是像数组一样的一维对象，可以存储很多类型的数据。Pandas series 和 Numpy array之间的主要区别之一是你可以

（14）pandas基础1：Series和DataFrame模块

xiaomengzhang的博客

11-16

1387

pandas的常用工具数据结构：Series和DataFrame

Series和DataFrame

醉糊涂仙的博客

08-22

1760

Series和DataFrame都是Pandas中的数据类型 Series可以认为是一维数组 DataFrame可以认为是二维数组 &gt;&gt;&gt; from pandas import Series,DataFrame &gt;&gt;&gt; import pandas as pd Series只有行索引 &gt;&gt;&gt; obj

将多个Series对象合并成一个DataFrame对象

热门推荐

zrz0258的博客

01-10

2万+

问题描述：将多个列合并成一个表，也就是将多个Series对象合并成一个DataFrame对象，本文章着重讲解多列合并。主要推荐的函数有pd.concat() 和pd.DataFrame(list(zip(s1, s2, s3))) 接下来是详细讲解首先介绍pd.contact()函数首先创建两个Series对象为例首先要提醒的是，DataFrame对象的每一列都可以看做是一个Series对象换句话说，DataFrame对象可以看做是多个Series对象拼接而成 s1(注：第一列的数字是索引)

python-pandas-通过series建立dataframe

weixin_45794183的博客

06-27

2858

python-pandas-通过series建立dataframe

python-jupyter-pandas titanic.csv阿里云数据包

09-09

Pandas是Python中的一个核心数据处理库，它提供了DataFrame和Series两种主要的数据结构。DataFrame可以理解为二维表格型数据结构，能够处理列名、行索引和各种数据类型；Series则是一维带标签的数据结构，类似于一列...

pandas.DataFrame.corrwith

Stephen的博客

08-04

1万+

pandas.DataFrame.corrwith用于计算DataFrame中行与行或者列与列之间的相关性。 Parameters： other：DataFrame, Series. Object with which to compute correlations. axis： {0 or ‘index’, 1 or ‘columns’}, default 0. 0 or ‘index’ t...

python中对已经生成的Series，怎样组合成DataFrame

IAlexanderI的专栏

10-26

2万+

1 2 3 4 5 6 7 8 9 10 11 12 13 14 In [3]: import pandas as pd In [4]: a = pd.Series([1,2,3]) In [5]: b = pd.Series([2,3,4])

titanic_dataset.csv（泰坦尼克数据集）

01-05

Abstract The titanic dataset gives the values of four categorical attributes for each of the 2201 people on board the Titanic when it struck an iceberg and sank. The attributes are social class (first class, second class, third class, crewmember), age (adult or child), sex, and whether or not the person survived. Data Description Origin: natural Usage: assessment Number of attributes: 4 Number of cases: 2,201 Number of prototasks: 1 Number of methods run on this dataset: 3 Contributed by: Radford Neal

series合并成dataframe_pandas学习--合并数据函数区分

weixin_39903176的博客

12-06

301

pandas对象中的数据可以通过一些内置的方法进行合并：pandas.merge，pandas.concat，实例方法join，combine_first，它们的使用对象和效果都是不同的，下面进行区分和比较。1、pandas.mergeimport numpy as np import pandas as pd from pandas import Series,DataFrame df1 = D...

怎么把series变为datamate_pandas把Series组合成DataFrame

weixin_39784774的博客

12-20

983

pandas如何把Series组合成DataFrame呢？这个要分情况而定，可以用pd.DataFrame()方式组合，也可以用concat函数。pd.DataFrame()的方式可以让Series的索引变成DataFrame的行索引或者列索引。1、Series索引变成行索引# -*- coding: utf-8 -*-import pandas as pds1 = pd.Series([1,2,...

【机器学习】pandas中Series和DataFrame

小屋

11-27

2207

一，Series 1,Series的定义 Series类似于一个字典，可以通过index参数定义其“key”值。Series使用pandas.Series来定义。如下所示：s = pd.Series([7, "Beijing", 2.17, -12345, "Happy"], index=["A", "B", "C", "D", "E"]) print(s)结果为：A 7

for循环将Series作为新行插入DataFrame

utopia_0122的博客

09-26

6031

for循环将Series作为新行插入DataFrame 导入包 from pandas import DataFrame import pandas as pd import numpy as np 创建df df = DataFrame(np.random.randn(4, 5), columns=['A', 'B', 'C', 'D', 'E']) print(df) 输出：创建Series 这里要指定与df中相同的索引名字 s1 = pd.Series([1,2,3,4,5]) s1.inde

pandas基础：Series、DataFrame的生成属性和方法

m0_46801330的博客

05-04

1594

pandas基础：Series、DataFrame的生成属性和方法（一）一、Series生成、属性、方法（一）Series生成（二）Series属性（三）Series方法二、DataFrame生成、属性、方法（一）DataFrame生成（二）DataFrame属性（三）DataFrame方法三、pandas函数（一）分箱操作（二）虚拟变量（三）生成日期序列参考文档一、Series生成、属性、方法 pandas是用来处理表格型或异质型数据的。numpy是用于处理同质型数值类数据的。pandas的两个数据结构

【Python数据分析】Pandas_Series如何转变为DataFrame