Models
Florence-2 vs. LLaVA

Florence-2 vs. LLaVA

Both Florence 2 and LLaVA-1.5 are commonly used in computer vision projects. Below, we compare and contrast Florence 2 and LLaVA-1.5.

Models

icon-model

Florence 2

Florence-2 is a lightweight vision-language model open-sourced by Microsoft under the MIT license.
icon-model

LLaVA-1.5

LLaVA is an open source multimodal language model that you can use for visual question answering and has limited support for object detection.
Model Type
Object Detection
--
Object Detection
--
Model Features
Item 1 Info
Item 2 Info
Architecture
--
--
Frameworks
--
--
Annotation Format
Instance Segmentation
Instance Segmentation
GitHub Stars
--
16,000
--
License
MIT
--
Apache-2.0
--
Training Notebook

Compare Florence 2 and LLaVA-1.5 with Autodistill

Models

Florence-2 vs. LLaVA

.

Both

Florence 2

and

LLaVA-1.5

are commonly used in computer vision projects. Below, we compare and contrast

Florence 2

and

LLaVA-1.5
  Florence 2 LLaVA-1.5
Date of Release Jun 19, 2024 Oct 05, 2023
Model Type Object Detection Object Detection
Architecture
GitHub Stars 16000

Florence 2

Florence-2 is a lightweight vision-language model open-sourced by Microsoft under the MIT license.

How to AugmentHow to LabelHow to Plot PredictionsHow to Filter PredictionsHow to Create a Confusion Matrix

LLaVA-1.5

LLaVA is an open source multimodal language model that you can use for visual question answering and has limited support for object detection.

How to AugmentHow to LabelHow to Plot PredictionsHow to Filter PredictionsHow to Create a Confusion Matrix

Compare Florence 2 to other models

Compare LLaVA-1.5 to other models

Deploy a computer vision model today

Join 250,000 developers curating high quality datasets and deploying better models with Roboflow.

Get started