txt几百MB之后,基本就无法使用现有工具打开了,只能变成来读了,python很好用,代码如下,建议使用64位的版本
本例打开7.5G的txt文件,并做了适当的归纳处理,也很快
如果只是print那就更快了
共统计49278874行
时间花费231.32秒
# -*- coding: UTF-8 -*- import time file = r"d:\分析数据.txt".decode('utf-8').encode('GB2312') start_time = time.time() select_number = 0.0 all_number = 0.0 with open(file) as f: # 先去除掉前两行 data = f.next() data = f.next() index = 0 while(1): try: data = f.next() index += 1 except: print "共统计%d行" % index break data = data.split("|") if len(data)>10: if data[11] == "443" or data[11] == "465" or data[11] == "995" or data[11] == "993" : select_number += float(data[12]) all_number += float(data[12]) print "select_number是%.2fGB" % (select_number/1024.0/1024.0) print "all_number是%.2fGB" % (all_number/1024.0/1024.0) print "关注的流量占比%.2f%%" % (select_number/all_number * 100) end_time = time.time() print "时间花费%.2f秒"% (end_time-start_time)