LLaVA-1.5

LLaVA is an open source multimodal language model that you can use for visual question answering and has limited support for object detection.

Explore This Model on Roboflow

No items found.

Join over 250,000 developers managing computer vision data on Roboflow.

VentureBeatTechCrunchInteresting EngineeringInternational Business TimesU.S. News & World ReportYahoo Finance