Object Detection

State of the art object detection models to localize subjects in images. From the latest YOLO models to DETR, we have the most popular models in easy to use formats.

Looking for a dataset? Explore object detection datasets.

Deploy select models (i.e. YOLOv8, CLIP) using the Roboflow Hosted API, or your own hardware using Roboflow Inference.
Roboflow

RF-DETR

RF-DETR is a SOTA, real-time object detection model architecture developed by Roboflow and released under the Apache 2.0 license.
Object Detection
Deploy with Roboflow

YOLOv12

YOLOv12 is a state-of-the-art computer vision model you can use for detection, segmentation, and more.
Object Detection
Deploy with Roboflow

YOLOv9

YOLOv9 is an object detection model architecture released on February 21st, 2024.
Object Detection
Deploy with Roboflow

YOLO-World

YOLO-World is a zero-shot object detection model.
Object Detection
Deploy with Roboflow

GroundingDINO

Grounding DINO is a zero-shot object detection model made by combining a Transformer-based DINO detector and grounded pre-training.
Object Detection
Deploy with Roboflow
Ultralytics

YOLOv8

YOLOv8 is a state-of-the-art object detection and image segmentation model created by Ultralytics, the developers of YOLOv5.
Object Detection
Deploy with Roboflow
Apple

4M

The 4M model is a versatile multimodal Transformer model developed by EPFL and Apple, capable of handling a handful of vision and language tasks.
Object Detection
Deploy with Roboflow

YOLOv10

YOLOv10 is a real-time object detection model introduced in the paper "YOLOv10: Real-Time End-to-End Object Detection".
Object Detection
Deploy with Roboflow
Ultralytics

YOLOv8 Oriented Bounding Boxes

You can retrieve bounding boxes whose edges match an angled object by training an oriented bounding boxes object detection model, such as YOLOv8's Oriented Bounding Boxes model.
Object Detection
Deploy with Roboflow
Meta

CoDet

CoDet is an open vocabulary zero-shot object detection model.
Object Detection
Deploy with Roboflow

LLaVA-1.5

LLaVA is an open source multimodal language model that you can use for visual question answering and has limited support for object detection.
Object Detection
Deploy with Roboflow
Deci AI

YOLO-NAS

YOLO-NAS is an object detection model developed by Deci that achieves SOTA performances compared to YOLOv5, v7, and v8.
Object Detection
Deploy with Roboflow
Google

MediaPipe

Object Detection
Deploy with Roboflow

Co-DETR

Co-Deformable-DETR (Co-DETR) is an object detection model architecture introduced in the paper "DETRs with Collaborative Hybrid Assignments Training".
Object Detection
Deploy with Roboflow

YOLOv7

YOLOv7 is a state of the art object detection model.
Object Detection
Deploy with Roboflow
Meituan

MT-YOLOv6

MT-YOLOv6 is a YOLO based model released in 2022.
Object Detection
Deploy with Roboflow

YOLOX

YOLOX is a high-performance object detection model.
Object Detection
Deploy with Roboflow

YOLOS

YOLOS looks at patches of an image to to form "patch tokens", which are used in place of the traditional wordpiece tokens in NLP.
Object Detection
Deploy with Roboflow

YOLOR

YOLOR (You Only Learn One Representation) is an object detection model that uses both implicit and explicit knowledge to make predictions.
Object Detection
Deploy with Roboflow

YOLOv4 Tiny

The tiny and fast version of YOLOv4 - good for training and deployment on limited compute resources, and getting a feel for your dataset
Object Detection
Deploy with Roboflow
Meta

DETR

Detection Transformer (DETR) is an end-to-end object detection model implemented using the Transformer architecture.
Object Detection
Deploy with Roboflow
Ultralytics

YOLOv5 Oriented Bounding Boxes

YOLOv5-OBB is a variant of YOLOv5 that supports oriented bounding boxes. This model is designed to yield predictions that better fit objects that are positioned at an angle.
Object Detection
Deploy with Roboflow
Ultralytics

YOLOv5

A very fast and easy to use PyTorch model that achieves state of the art (or near state of the art) results.
Object Detection
Deploy with Roboflow
Google

EfficientDet

EfficientDet achieves the best performance in the fewest training epochs among object detection model architectures, making it a highly scalable architecture especially when operating with limited compute.
Object Detection
Deploy with Roboflow

EfficientDet (D7) Tensorflow 2

A scalable, state of the art object detection model, implemented here within the TensorFlow 2 Object Detection API.
Object Detection
Deploy with Roboflow

YOLOv3 PyTorch

Though it is no longer the most accurate object detection algorithm, YOLO v3 is still a very good choice when you need real-time detection while maintaining excellent accuracy. PyTorch version.
Object Detection
Deploy with Roboflow

MobileNet SSD v2

This architecture provides good realtime results on limited compute. It's designed to run in realtime (30 frames per second) even on mobile devices.
Object Detection
Deploy with Roboflow

YOLOE

YOLOE is a new object detection and segmentation model developed by the creators of YOLOv10.
Object Detection
Deploy with Roboflow
Ultralytics

YOLO11

YOLO11 is a computer vision model that you can use for object detection, segmentation, and classification.
Object Detection
Deploy with Roboflow

RT-DETR

Object Detection
Deploy with Roboflow
Meta

VLPart

VLPart, developed by Meta Research, is an object detection and segmentation model that works with an open vocabulary
Object Detection
Deploy with Roboflow
OpenAI

GPT-4 with Vision

GPT-4 with Vision is a multimodal language model developed by OpenAI.
Object Detection
Deploy with Roboflow

Grounding DINO

Grounding DINO is a state-of-the-art zero-shot object detection model, developed by IDEA Research.
Object Detection
Deploy with Roboflow
Google

OWL ViT

OWL-ViT is a transformer-based object detection model developed by Google Research.
Object Detection
Deploy with Roboflow

OWLv2

OWLv2 is a transformer-based object detection model developed by Google Research. OWLv2 is the successor to OWL ViT.
Object Detection
Deploy with Roboflow

Kosmos-2

Kosmos-2 is a multimodal language model capable of object detection and grounding text in images.
Object Detection
Deploy with Roboflow

L2CS-Net

L2CS-Net is a gaze estimation model that enables you to calculate where someone is looking and in what direction someone is looking.
Object Detection
Deploy with Roboflow
Mindee

DocTR

DocTR is an Optical Character Recognition tool powered by deep learning.
Object Detection
Deploy with Roboflow

DINOv2

DINOv2 is a self-supervised method for training computer vision models developed by Meta Research and released in April 2023.
Object Detection
Deploy with Roboflow

RTMDet

RTMDet is an efficient real-time object detector, with self-reported metrics outperforming the YOLO series. It achieves 52.8% AP on COCO with 300+ FPS on an NVIDIA 3090 GPU, making it one of the fastest and most accurate object detectors available as of writing this post.
Object Detection
Deploy with Roboflow

ByteTrack

ByteTrack is a multi-object tracking computer vision algorithm.
Object Detection
Deploy with Roboflow

Scaled YOLOv4

Scaled YOLOv4 is an extension of the YOLOv4 research implemented in the YOLOv5 PyTorch framework.
Object Detection
Deploy with Roboflow

YOLOv4 Darknet

YOLOv4 has emerged as the best real time object detection model. YOLOv4 carries forward many of the research contributions of the YOLO family of models along with new modeling and data augmentation techniques. This implementation is in Darknet.
Object Detection
Deploy with Roboflow

Faster R-CNN

One of the most accurate object detection algorithms but requires a lot of power at inference time. A good choice if you can do processing asynchronously on a server.
Object Detection
Deploy with Roboflow

YOLOv3 Keras

Though it is no longer the most accurate object detection algorithm, YOLO v3 is still a very good choice when you need real-time detection while maintaining excellent accuracy. Keras implementation.
Object Detection
Deploy with Roboflow

YOLOv4 PyTorch

YOLOv4 has emerged as the best real time object detection model. YOLOv4 carries forward many of the research contributions of the YOLO family of models along with new modeling and data augmentation techniques. This implementation is in PyTorch.
Object Detection
Deploy with Roboflow
Meta

Detectron2

Detectron2 is model zoo of it's own for computer vision models written in PyTorch.
Object Detection
Deploy with Roboflow

Frequently Asked Questions

What models are used for object detection?

The YOLO family of models (i.e. YOLOv7, YOLOv7) are commonly used in object detection use cases. YOLO has been developed and refined over a years-long period and is still in active development. The latest model, YOLOv7, achieves state-of-the-art performance on object detection in the MS COCO dataset. Other models like Detectron2 and EfficientDet are also used in object detection.

What are the use cases for object detection?

Object detection has many use cases, including:

  • Identifying whether someone is wearing the requisite safety equipment in a controlled environment (i.e. a construction site or a medical area).
  • Finding objects on a road to help guide an autonomous car.
  • Identifying animals in a wildlife reserve.
  • Counting pills to be dispensed into bottles for pharmacies.

What is object detection?

Object detection is a computer vision solution that focuses on identifying the location of objects in an image or video. Each identified object is assigned a label that represents its contents. Using object detection, you can also count the number of times different objects appear in an image.