数据的合并
import numpy as np
import pandas as pd
载入四个文件和train表格观察关系
text_left_up = pd.read_csv("C:/Users/Administrator/LC python/datawhale/data analysis/unit 2/data/train-left-up.csv")
text_left_down = pd.read_csv("C:/Users/Administrator/LC python/datawhale/data analysis/unit 2/data/train-left-down.csv")
text_right_up = pd.read_csv("C:/Users/Administrator/LC python/datawhale/data analysis/unit 2/data/train-right-up.csv")
text_right_down = pd.read_csv("C:/Users/Administrator/LC python/datawhale/data analysis/unit 2/data/train-right-down.csv")
text_left_up.head()
text_left_down.head()
text_right_up.head()
text_right_down.head()
df.head()
观察发现四个数据分别是这个表格十字拆分出的四个部分。
数据的拼接
使用concat方法:将数据train-left-up.csv和trainright-up.csv横向合并为一张表,并保存这张表为result_up。
#数据的拼接
list_up = [text_left_up,text_right_up]
result_up = pd.concat(list_up,axis=1) #两个up横向连接
result_up.head()
pd.concat(objs, axis=0, join=‘outer’, join_axes=None, ignore_index=False, keys=None, levels=None, names=None, verify_integrity=False, copy=True)
objs: 需要连接的对象,eg [df1, df2]
axis: axis = 1, 表示在水平方向(row)进行连接;axis = 0, 表示在垂直方向(column)进行连接
join out