Grounding DINO Introduction
Introduction to Grounding DINO - Open vocabulary object detection with vision and language
Grounding DINO is a state-of-the-art, zero-shot object detection model developed by IDEA-Research. It fuses a Transformer-based object detector (DINO) with a language model (like BERT), allowing it to identify and draw bounding boxes around arbitrary objects using natural language text prompts without needing retraining
Introduction to Grounding DINO - Open vocabulary object detection with vision and language