We ran seven tests across five state-of-the-art Large Multimodal Models (LMMs) on November 23rd, 2023. GPT-4V passed at four of seven tests and BakLLaVA passed at one of seven tests.
Here are the results:
Based on our tests, GPT-4V performs better than BakLLaVA at multimodal tasks.
Download the raw image results from our analysis.
.
Both
and
are commonly used in computer vision projects. Below, we compare and contrast
and
We ran seven tests across five state-of-the-art Large Multimodal Models (LMMs) on November 23rd, 2023. GPT-4V passed at four of seven tests and BakLLaVA passed at one of seven tests.
Here are the results:
Based on our tests, GPT-4V performs better than BakLLaVA at multimodal tasks.
Download the raw image results from our analysis.
BakLLaVA is an LMM developed by LAION, Ontocord, and Skunkworks AI. BakLLaVA uses a Mistral 7B base augmented with the LLaVA 1.5 architecture.
How to AugmentHow to LabelHow to Plot PredictionsHow to Filter PredictionsHow to Create a Confusion MatrixGPT-4 with Vision is a multimodal language model developed by OpenAI.
How to AugmentHow to LabelHow to Plot PredictionsHow to Filter PredictionsHow to Create a Confusion MatrixJoin 250,000 developers curating high quality datasets and deploying better models with Roboflow.
Get started