Products
Platform
Universe
Open source computer vision datasets and pre-trained models
Annotate
Label images fast with AI-assisted data annotation
Train
Hosted model training infrastructure and GPU access
Workflows
Low-code interface to build pipelines and applications
Deploy
Run models on device, at the edge, in your VPC, or via API
Solutions
By Industry
Aerospace & Defence
Agriculture
Automotive
Banking & Finance
Government
Healthcare & Medicine
Manufacturing
Oil & Gas
Retail & Ecommerce
Safety & Security
Telecommunications
Transportation
Utilities
Developers
Resources
Documentation
User Forum
Computer Vision Models
Blog
Convert Annotation Formats
Learn Computer Vision
Inference Templates
Weekly Product Webinar
Pricing
Docs
Blog
Sign In
Get Started
Models
Florence-2 vs. Grounding DINO
Florence-2 vs. Grounding DINO
Both Florence 2 and are commonly used in computer vision projects. Below, we compare and contrast Florence 2 and .
Florence 2
Models
Florence 2
Florence-2 is a lightweight vision-language model open-sourced by Microsoft under the MIT license.
Learn more about Florence 2
Learn more about
Model Type
Object Detection
--
--
Model Features
Item 1 Info
Item 2 Info
Architecture
--
--
Frameworks
--
--
Annotation Format
Instance Segmentation
Instance Segmentation
GitHub
--
View Repo
--
View Repo
GitHub Stars
--
--
License
MIT
--
--
Paper
--
View Paper
--
View Paper
Training Notebook
Train on Colab
--
Train on Colab
--
Deploy Model
--
Deploy with Roboflow
--
Deploy with Roboflow
Compare Alternatives
--
Compare with...
OneFormer
DETR
DETIC
RTMDet
Kosmos-2
LLaVA-1.5
GPT-4o
4M
YOLOv9
YOLOv5
YOLOv7
MT-YOLOv6
YOLOv8
SegFormer
Mask RCNN
EfficientNet
Detectron2
Faster R-CNN
YOLOX
YOLOS
YOLOR
Scaled YOLOv4
YOLOv4 Tiny
YOLOv4 PyTorch
YOLOv4 Darknet
YOLOv3 PyTorch
YOLOv3 Keras
PaliGemma
--
Compare with...
No items found.
Compare Florence 2 and with Autodistill