Pascal VOC XML
to
YOLOv8 PyTorch TXT
The annotation format originally created for the Visual Object Challenge (VOC) has become a common interchange format for object detection labels. It's well-specified and can be exported from many labeling tools including CVAT, VoTT, and RectLabel.
Unfortunately, no known models directly consume VOC XML labels. That's where Roboflow comes in; it's a universal computer vision format converter that can convert PASCAL VOC into any other format so your data is ready to train in a jiffy.
With Roboflow, you can deploy a computer vision model without having to build your own infrastructure.
The
YOLOX
,
models all use the
data format.
<annotation>
<folder></folder>
<filename>000001.jpg</filename>
<path>000001.jpg</path>
<source>
<database>roboflow.ai</database>
</source>
<size>
<width>500</width>
<height>375</height>
<depth>3</depth>
</size>
<segmented>0</segmented>
<object>
<name>helmet</name>
<pose>Unspecified</pose>
<truncated>0</truncated>
<difficult>0</difficult>
<occluded>0</occluded>
<bndbox>
<xmin>179</xmin>
<xmax>231</xmax>
<ymin>85</ymin>
<ymax>144</ymax>
</bndbox>
</object>
<object>
<name>helmet</name>
<pose>Unspecified</pose>
<truncated>0</truncated>
<difficult>0</difficult>
<occluded>0</occluded>
<bndbox>
<xmin>112</xmin>
<xmax>135</xmax>
<ymin>145</ymin>
<ymax>175</ymax>
</bndbox>
</object>
</annotation>