Kosmos-2

Kosmos-2 is a multimodal language model capable of object detection and grounding text in images.

Explore This Model on Roboflow

No items found.

Join over 250,000 developers managing computer vision data on Roboflow.

VentureBeatTechCrunchInteresting EngineeringInternational Business TimesU.S. News & World ReportYahoo Finance