Qwen2.5-VL is a Visual Language model which understands both images and text. You can use this model for OCR, VQA, and other vision tasks.
You can now use Qwen2.5-VL with a Dedicated Deployment in Roboflow Workflows, and deploy the Workflow on your own hardware or with your Dedicated Deployment.
To use this feature, open a Workflow in Roboflow then add the "Qwen2.5-VL" block. You can then configure your prompts for using the model.