GroundedSAM combines Grounding DINO with the Segment Anything Model to identify and segment objects in an image given text captions.

