Models
Florence-2 vs. 4M

Florence-2 vs. 4M

Both Florence 2 and 4M are commonly used in computer vision projects. Below, we compare and contrast Florence 2 and 4M.

Models

icon-model

Florence 2

Florence-2 is a lightweight vision-language model open-sourced by Microsoft under the MIT license.
icon-model

4M

The 4M model is a versatile multimodal Transformer model developed by EPFL and Apple, capable of handling a handful of vision and language tasks.
Model Type
Object Detection
--
Multimodal Model
--
Model Features
Item 1 Info
Item 2 Info
Architecture
--
--
Frameworks
--
PyTorch
--
Annotation Format
Instance Segmentation
Instance Segmentation
GitHub Stars
--
1.1k
--
License
MIT
--
Apache 2.0
--
Training Notebook

Compare Florence 2 and 4M with Autodistill

Compare Florence-2 vs. 4M

Provide your own image below to test YOLOv8 and YOLOv9 model checkpoints trained on the Microsoft COCO dataset.

COCO can detect 80 common objects, including cats, cell phones, and cars.