批量删除VOC的XML中的某些节点

最新推荐文章于 2024-01-11 09:16:12 发布

转载最新推荐文章于 2024-01-11 09:16:12 发布 · 823 阅读

本博客介绍了一段Python代码，用于处理VOC数据集中的XML标注文件，通过遍历并移除不属于预定义类别的对象标签，实现数据集的类别过滤。代码涉及XML解析、文件操作等技术。

import xml.etree.cElementTree as ET
import os
path_root = ['E:\data-VOC0712\VOC2007\Annotations',
             'E:\data-VOC0712\VOC2012\Annotations']

CLASSES = [
           "bottle",
           "cat", "chair",  "diningtable",
           "dog", "motorbike", "person",
           "pottedplant","sofa",
           "tvmonitor"]
for anno_path in path_root:
    xml_list = os.listdir(anno_path)
    for axml in xml_list:
        path_xml = os.path.join(anno_path, axml)
        tree = ET.parse(path_xml)
        root = tree.getroot()

        for child in root.findall('object'):
            name = child.find('name').text
            if not name in CLASSES:
                root.remove(child)

        tree.write(os.path.join('E:\data-myVOC0712\Annotations', axml))

转载：https://blog.youkuaiyun.com/qq_36735489/article/details/80732346