爬虫日记 sandvik

本文深入探讨了刀具产品的数据分析过程,包括从Sandvik Coromant网站获取产品信息的技术细节,分析了原始数据字段的意义,如材质、基底和涂层等关键属性,并讨论了在大量数据处理和图片保存方面遇到的技术难点。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

查看所有刀具

# 获取所有产品
https://www.sandvik.coromant.com/zh-cn/_vti_bin/tibp/coromant/search.svc/getgroupedleaves 
# 获取部分产品,如下图
https://www.sandvik.coromant.com/zh-cn/_vti_bin/tibp/coromant/search.svc/getresults
# 获取单个产品
https://tibp-api.azurewebsites.net/api/v1/coromant/productavailability/batch?orderingCodesSemicolonSeparated=CCGX09T3L020-15FXA%20%207125;CNGX1204L025-18HXA%20%207125;WNMX%2015%2009%2031-MM%20%20%20%204335;WNMX%2021%2012%2051-MM%20%20%20%204335;WNMX%2015%2009%2031-MM%20%20%20%202220;CCGX09T3L020-15FXA%20%207105;CCGX09T3L020-15FXA%20%207115;CNGX1204L025-18HXA%20%207105;CNGX1204L025-18HXA%20%207115;CCGX09T3L020-15FXA%20%207015

image.png

产品数据分析

原始数据

      {
        "CopTMC1ISO": "H",
        "CopCBMD": "XA",
        "CopCUTINTSIZESHAPE": "CC09T3",
        "CopLE": 2.3,
        "CopAPMX": 0.2,
        "CopKCH": 14,
        "CopCEDC": 2,
        "CopBS": null,
        "CopRE": 0,
        "CopWEP": true,
        "CopKRINS": 95,
        "CopBN": 0.15,
        "CopGB": 15,
        "CopHAND": "N",
        "CopGRADE": "7125",
        "CopSUBSTRATE": "BC",
        "CopCOATING": "PVD AlTiCrN",
        "CopWT": 0.003,
        "CopSEP": 0,
        "CopLCS": "20",
        "CopRELEASEPACK": "18.2",
        "CopPRODFAM": "CoroTurn 107",
        "CopSSCM": "09",
        "TIBPAvailability": "Available",
        "CopId": null,
        "CopMNEMONICID": "INSTRNKAP_COR",
        "CopITEMTYPE": "Insert",
        "CopNotReplenishedAfter": null,
        "CopDRAWCHAR": "160600.jpg",  # 图片数据
        "CopDRAWCHAR2": null,
        "CopDRAWCHAR3": null,
        "CopDRAWDETAIL": null,
        "CopDRAWEXPLVIEW": null,
        "CopDRAWFUNCT": null,
        "CopEAN": "7323223812230",
        "CopDXFFILE": null,
        "CopMODELSIMULANTICOLL": null,
        "CopPRODUCT3DMODELBASIC": null,
        "CopPRODUCT3DMODELDETAILED": null,
        "CopLF": null,
        "CopLPR": null,
        "CopMaterialID": 7586828,
        "CopStartValueRec": "H#ap: 0.12 mm(0.05-0.2) fn: 0.3 mm/r(0.2-0.4) vc: 135 m/min(150-125)",
        "CopPRODUCTLISTPIC": "150967.jpg", # 图片数据
        "CopPICT3DVIEW": "150967.jpg", # 图片数据
        "CopORDCODE": "CCGX09T3L020-15FXA  7125",
        "CopORDCODEUSA": "CCGX09T3L020-15FXA  7125",
        "CopMatchingBOM": null,
        "CopMatchingCUTINTMASTER": null,
        "CopMatchingADINTMS": null,
        "CopTSYC": "CCGX-15FXA (A)",
        "CopReplacementProduct": null,
        "CopReplacementProductInfo": null,
        "CopPRODUCTCATEGORY": null,
        "TIBPContentSourceLocation": "Products/Standard",
        "CopTIBPTMApproved": false,
        "CopTIBPTMProApproved": false,
        "CopIsTailorMade": 0,
        "CopCAPPFamilyID": "I415",
        "CopPackageQuantity": 5,
        "TIBPSupportedForAssemblyBuilding": true,
        "CopPRODDESCR": "CoroTurn® 107车削刀片"
      }

字段分析

image.png

字段解析
CopGRADE材质(GRADE)
CopSUBSTRATE基底(SUBSTRATE)
CopCOATING涂层(COATING)
CopWT部件重量(WT)
CopSEPSensorembeddedproperty(SEP)
CopLCS寿命周期状态(LCS)
CopRELEASEPACK发布组件ID(RELEASEPACK)

其实我们获取到的数据比这个还要详细,都可以保存下来

技术难点

  1. 图片保存的方式
  2. 大量数据保存的方法
  3. 反爬虫机制【目前看起来木有】
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值