- 博客(6)
- 收藏
- 关注
原创 C#数据转化+BFS拷贝
using System;using System.Collections.Generic;using System.Linq;using System.Text;using System.Threading.Tasks;using System.Windows;using System.Windows.Controls;using System.Windows.Data;using System.Windows.Documents;using System.Windows.Input;.
2021-11-25 16:14:40
165
原创 合并表一表二保留公共项
import pandas as pddf1=pd.read_excel('input1.xlsx')df2=pd.read_excel('input2.xlsx')df1.columns=['name','information1','information2']df2.columns=['name','information3','information4']df3 = pd.merge(df1, df2, on="name", how="left")df3.dropna(axis=0, .
2021-09-25 16:15:51
158
原创 预处理归一化
import pandas as pddf=pd.read_csv('energy.csv')orilist=[]for i in df['999']: if i[:4] not in orilist: orilist.append(i[:4])dffinal = pd.DataFrame(columns=('999', '0', '1', '2', '3', '4', '5', '6'))for j in orilist: df1 = pd.DataFr.
2021-08-31 14:57:43
219
原创 中药 爬虫
# -*- coding:utf-8 -*-# coding = GBKimport jsonfrom bs4 import BeautifulSoup as bsimport urllib.requestimport refrom selenium import webdriverimport requestsimport os# html_doc = "https://old.tcmsp-e.com/browse.php?qc=herbs"# req = urllib.req.
2021-08-23 14:32:44
767
原创 rcsb爬虫
# -*- coding:utf-8 -*-import osimport timeimport requestsimport reimport pandas as pdimport numpy as np# retval = os.getcwd()# os.chdir(retval+"/temp")filename='1R4L'# filename = '1RL'url=f"http://files.rcsb.org/download/{filename}.pdb"headers.
2021-08-08 16:01:50
886
1
原创 多文件脚本
# -*- coding:utf-8 -*-import osimport numpy as npimport reimport pandas as pddef traverse(filepath): files = os.listdir(filepath) for fi in files: fi_d = os.path.join(filepath, fi) if os.path.isdir(fi_d): # 判断是否为文件夹 .
2021-06-11 19:37:38
180
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人
RSS订阅