In this guide, we show you how to convert data between the
VOCand
CLIPformats for free. You can use your converted data to train
models and other models that support the
CLIP format.
Roboflow is a universal conversion tool for computer vision annotation formats. The Public plan is the best way for those exploring personal projects, class assignments, and other experiments to try Roboflow. To convert your dataset, start by creating a free workspace on the Public plan.
Once your account has been created, click Create New Project.
Upload your data to Roboflow by dragging and dropping your Pascal VOC XML images and annotations into the upload space.
Next, click "Generate New Version" to generate a new version of your dataset:
You can then apply any preprocessing or augmentation steps to your dataset:
After generating, you will be prompted to Export your dataset. You can choose to receive your dataset as a .zip file or a curl download link. Choose OpenAI CLIP Classification when asked in what format you want to export your data. You will see a dropdown with various options like this:
Congratulations, you have successfully converted your dataset from Pascal VOC XML format to OpenAI CLIP Classification format!
Ready to use your new CLIP dataset? Great!
Once you've got your object detection or classification dataset into our CLIP format, you'll want to use the "Get Link" option to grab your code snippet and use it with our CLIP colab notebook as described in our OpenAI CLIP tutorial.
Yes! It is free to convert Pascal VOC XML data into the OpenAI CLIP Classification format on the Roboflow platform.
If you have between a few and a few thousand images, converting data between these formats will be quick. But, the time it takes to convert between data formats increases with the more images you have.