Computer Vision Toolbox Model for Grounding DINO Object Detection
Grounding DINO is a zero-shot pre-trained Vision Language Model (VLM) that enables open vocabulary, text-prompted object detection.
21 Downloads
Updated
25 Nov 2025
Grounding DINO enables zero-shot object detection from textual inputs, without requiring dedicated class training on the input term. It can therefore detect objects outside of its training set. It combines a Transformer-based DINO object detector with grounded pre-training.
MATLAB Release Compatibility
Created with
R2026a
Compatible with R2026a
Platform Compatibility
Windows macOS (Apple Silicon) macOS (Intel) LinuxTags
Discover Live Editor
Create scripts with code, output, and formatted text in a single executable document.
