No items found.
Use the widget below to experiment with PaliGemma Image Captioning. You can detect COCO classes such as people, vehicles, animals, household items.
You can use the set of PaliGemma weights trained on the COCO Captions dataset for zero-shot image captioning.
See the official PaliGemma model card for more information about PaliGemma.
PaliGemma Image Captioning
is licensed under a
license.
You can use Roboflow Inference to deploy a
PaliGemma Image Captioning
API on your hardware. You can deploy the model on CPU (i.e. Raspberry Pi, AI PCs) and GPU devices (i.e. NVIDIA Jetson, NVIDIA T4).
Below are instructions on how to deploy your own model API.