问题描述:自制数据集时的数据标签名在后续过程中发现不合适,需要批量修改xml文件
原标签名membrane,rust,foreign→film_defect,electrode_defect,foreign_material
'''
批量修改xml文件中的缺陷类别名称
当有多个物体时,多个物体的名称均能被修改
'''
from lxml.etree import Element, SubElement, tostring, ElementTree
from xml.dom import minidom
import xml.etree.ElementTree as ET
import os
# 修改自己的路径
template_file = r'H:\master\data_for_deep_learning\complex_background\AOI_image1 _xml' #这里是存放xml文件的文件夹
xmllist = os.listdir(template_file)
for xml in xmllist:
#print(xml)
tree = ET.parse(os.path.join(template_file,xml))
root = tree.getroot() # 获取根节点
for child in root:
print(child.tag,child.attrib)
if child.tag == 'object':
name=child.find('name').text
# print(name)
if name == 'membrane':
child.find('name').text='film_defect'
tree=ET.ElementTree(root)
elif name == 'rust':
child.find('name').text = 'electrode_defect'
tree = ET.ElementTree(root)
elif name == 'foreign':
child.find('name').text = 'foreign_material'
tree = ET.ElementTree(root)
tree.write(os.path.join(template_file,xml))
上述代码在单个xml文件中出现多个标注物体时也能同时修改多个名称。