
python
文章平均质量分 53
rabeen
这个作者很懒,什么都没留下…
展开
-
遍历文件夹下压缩文件并解压至指定文件夹
# -*- coding:utf-8 -*-import Cons as csimport osimport zipfile######### Extract all files in Dirdef extract_to(dir): #os.chdir(dir) files = os.listdir(dir) for file in files:原创 2017-11-08 17:41:04 · 2254 阅读 · 0 评论 -
python解压TAR文件至至指定文件夹
######### Extract all files from src_dir to des_dirdef extract_tar_files(src_dir,des_dir): files = os.listdir(src_dir) for file in files: dir_tmp = os.path.join(src_dir, file)原创 2018-01-26 10:59:10 · 5677 阅读 · 0 评论 -
Python异常处理实例
######### File is too big, read file line by line(if file is small, we can use pandas)def get_valid_inventory(src_dir,tmp_file,des_dir): if not os.path.exists(des_dir): os.mkdir(des_dir)原创 2018-01-26 11:01:47 · 2160 阅读 · 0 评论 -
python时戳转换成字符串日期,并形成时间序列文件
######### Get all performance data of one objectdef get_one_object_perfor_data(object_id,dst_dir,src_file_name): df = pd.read_csv(src_file_name,delimiter=',',header=0) df_tmp1 = df[df['res_i原创 2018-01-26 14:42:17 · 1203 阅读 · 0 评论 -
pandas解决“pandas.parser.CParserError: Error tokenizing data. C error: Expected 2 fields in line 3, s”
df_status0_invertory = pd.read_csv(inventory_dir + inventory_status0_file_name, delimiter=',', header=None, error_bad_lines=False)解决方法:加入参数error_bad_lines=False...原创 2018-03-08 11:34:57 · 55432 阅读 · 0 评论 -
pandas set_value用法
原始代码:df_valid_inventory_sel_tmp['col1']=-9999for index in df_valid_inventory_sel_tmp.index: time_stamp = time.time() df_valid_inventory_sel_tmp['col1'][index] =su.sta_mean(index) ##传统拷贝 40ms...原创 2018-03-09 11:54:42 · 7490 阅读 · 0 评论 -
巧用groupby解决Dataframe筛选分组效率慢问题
原代码:for name in list_valid_perfor_inventory: time_stamp = time.time() df_tmp1 = df_all_performance[df_all_performance['res_ins_id'] == name] ###170万行,该语句大约需要2S if df_tmp1.empty: co...原创 2018-03-06 11:36:40 · 5126 阅读 · 0 评论 -
Python List清空操作
待补充原创 2018-03-20 20:13:40 · 1788 阅读 · 0 评论