Pandas IO读取文件
pandas官网IO Tools
- read_csv,读取带分隔符的数据,默认逗号’,’
- read_table,读取带分隔符的数据,默认制表符’\t’
- read_fwf,读取没有分隔符的数据,固定列宽(想想Excel的分列功能)
- read_cliboard,读取剪贴板的数据
1.文件路径
- pd.read_csv(‘f:/test/demo.csv’),注意‘/’
- pd.read_csv(r’f\test\demo.csv’),注意‘\’前加r
- pd.read_table('f/:test/demo.csv, sep = ‘,’)
2.read_csv分析
* pd.read_csv('demo_gbk.csv',encoding = 'gbk')
* pd.read_csv('demo_utf8.csv,encoding = 'utf8')
pd.read_csv('f:/test/demo.csv',header = None)
pd.read_csv('f:/test/demo.csv',names = ['a','b','c','d'])
pd.read_csv('f:/test/demo.csv',names = ['a','b','c','d'],index_col = 'd')
pd.read_csv('f:/test/demo.csv',names = ['a','b','c','d'],index_col = ['c','d'])
pd.read_csv('f:/test/demo.csv',nrows = 10)
3.用“正则表达式”读取
pd.read_table('f:/test/demo1.csv',sep = '\s+')
保存
1. df.to_csv('测试1.csv')
2. df.to_html('测试2.html')
3. df.to_excel('测试3.xlsx')
详见pandas官网IO Tools