使用labelimg标注的xml文件转化为yolov5所需的txt文件

Deepcong

已于 2022-05-14 22:10:56 修改

阅读量969

点赞数 1

CC 4.0 BY-SA版权

分类专栏： yolov5 xml txt 文章标签： xml python txt yolov5

于 2022-05-14 22:10:00 首次发布

本文链接：https://blog.csdn.net/DeepCBW/article/details/124775153

yolov5 同时被 3 个专栏收录

2 篇文章

订阅专栏

xml

1 篇文章

订阅专栏

txt

1 篇文章

订阅专栏

该博客介绍了一个用于将COCO128数据集的XML标注转换为YoloV5所需格式的Python脚本。脚本遍历指定目录下的所有XML文件，读取每个对象的名称、边界框坐标，并进行归一化处理，然后将结果写入新的TXT文件中，同时将原始XML文件移动到备份目录。此过程对于训练目标检测模型如YoloV5至关重要。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

yolov5所需的格式如下图：
第一位是类别的索引，，，然后是归一化后的中心点x,y,宽高w,h。
coco
转化代码如下：
GT_PATH路径下放所有的xml文件

# -*- coding: utf-8 -*-
"""
Time    : 2022/5/14 17:18
Author  : cong
"""
import sys
import os
import glob
import xml.etree.ElementTree as ET

names = ['hatch', 'cargo', 'aeroplane']
GT_PATH = 'datasets/coco128/labels/train2017/'
#print(GT_PATH)
os.chdir(GT_PATH)
xml_list = glob.glob('*.xml')
if not os.path.exists("backup"):
    os.makedirs("backup")
for tmp_file in xml_list:
    #print(tmp_file)
    # 1. create new file (VOC format)

    with open(tmp_file.replace(".xml", ".txt"), "a") as new_f:

        root = ET.parse(tmp_file).getroot()
        size = root.find('size')
        for obj in root.findall('object'):
          obj_name = obj.find('name').text
          obj_index = names.index(obj_name)
          bndbox = obj.find('bndbox')
          image_w = int(size.find('width').text)
          image_h = int(size.find('height').text)
          x_min = int(bndbox.find('xmin').text)
          x_max = int(bndbox.find('xmax').text)
          y_min = int(bndbox.find('ymin').text)
          y_max = int(bndbox.find('ymax').text)
          x = ((x_min + x_max)/2)/image_w
          y = ((y_min + y_max)/2)/image_h
          w = (x_max - x_min) /image_w
          h = (y_max - y_min) /image_h
          new_f.write("%d %s %s %s %s\n" % (obj_index, x, y, w, h))
    # 2. move old file (xml format) to backup
    os.rename(tmp_file, os.path.join("backup", tmp_file))
print("Conversion completed!")