Top Real Time Vision Models

Explore models that run in real-time (or close to real-time).

Deploy select models (i.e. YOLOv8, CLIP) using the Roboflow Hosted API, or your own hardware using Roboflow Inference.

Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

GPT

GPT-4.1 is a multimodal model developed by OpenAI that comes in three sizes: GPT-4.1, mini, and nano. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

129M

Architecture:

Transformers

RF-DETR is a SOTA, real-time object detection model architecture developed by Roboflow and released under the Apache 2.0 license. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

Claude 3.7 is a multimodal "hybrid reasoning" model developed by Anthropic. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

Phi-4 Multimodal is a multimodal language model developed by Microsoft. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

Transformers

Co-Deformable-DETR (Co-DETR) is an object detection model architecture introduced in the paper "DETRs with Collaborative Hybrid Assignments Training". Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

Transformers

D-FINE is a real-time object detection model introduced in the paper " D-FINE: Redefine Regression Task of DETRs as Fine‑grained Distribution Refinement". Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

DEIM is a training framework for DETR models. The framework strives to enable "faster convergence and improved accuracy" in models. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

YOLO

YOLOE is a new object detection and segmentation model developed by the creators of YOLOv10. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

Transformers

SmolVLM2 is a multimodal image and video understanding model developed by engineers on the Hugging Face TB (Textbook) Research team. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

Moondream 2 is the latest model in the Moondream series of “tiny vision language models”. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

Transformers

Gemma 3 is a multimodal language model developed by Google. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

GPT

OpenAI o3-mini is a multimodal reasoning model developed by OpenAI. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

Transformers

Qwen2.5-VL is a multimodal vision-language model developed by the Qwen team at Alibaba Cloud. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

YOLO

YOLOv12 is a state-of-the-art computer vision model you can use for detection, segmentation, and more. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

PaliGemma-2 is a multimodal model developed by Google. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

YOLO11 is a computer vision model that you can use for object detection, segmentation, and classification. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

Architecture:

Optical Character Recognition
Optical Character Recognition
Optical Character Recognition
Deploy on Device with Roboflow✅
Optical Character Recognition

Model Size:

MB

Parameters:

Architecture:

Florence-2 OCR is a subset of Florence-2 that can read characters in images. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

Architecture:

Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

6600000000.0

Architecture:

Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

Architecture:

Segment Anything

Segment Anything 2 (SAM 2) is a real-time image and video segmentation model. Learn more »
Keypoint Detection
Keypoint Detection
Keypoint Detection
Deploy on Device with Roboflow✅
Keypoint Detection

Model Size:

MB

Parameters:

Architecture:

Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

76000000

Architecture:

DETR

Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

2800000000.0

MB

Parameters:

705000000.0

Architecture:

The 4M model is a versatile multimodal Transformer model developed by EPFL and Apple, capable of handling a handful of vision and language tasks. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

770000000.0

MB

Parameters:

770000000

Architecture:

Florence-2 is a lightweight vision-language model open-sourced by Microsoft under the MIT license. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

You can use the set of PaliGemma weights trained on the OCRVQA dataset for performing OCR on images. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

You can use the set of PaliGemma weights trained on the DocVQA dataset for asking questions about documents. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

You can use the set of PaliGemma weights trained on the VQAv2 dataset for asking questions about the contents of images. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

You can use the set of PaliGemma weights trained on the Screen2Words dataset for asking questions about website screenshots. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

You can use the set of PaliGemma weights trained on the COCO Captions dataset for zero-shot image captioning. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

29500000

Architecture:

YOLO

YOLOv10 is a real-time object detection model introduced in the paper "YOLOv10: Real-Time End-to-End Object Detection". Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

GPT-4o is OpenAI’s third major iteration of GPT-4 expanding on the capabilities of GPT-4 with Vision Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

4000000000.0

MB

Parameters:

3 Billion

Architecture:

PaliGemma is a vision language model (VLM) by Google that has multimodal capabilities. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

MMOCR is an Optical Character Recognition model zoo implemented with the MMDetection package. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

TrOCR is a Transformer-based OCR model developed by researchers from Microsoft Research. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

Tesseract is a highly popular OCR engine and project, now primarily developed open-source. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

Surya is a Python package designed for OCR on document layout analysis. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

Gemini is a family of Large Multimodal Models (LMMs) developed by Google Deepmind focused specifically on multimodality. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

25600000

Architecture:

Residual Neural Networks

ResNet-50 is a popular image classification model architecture. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

Architecture:

Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

50000000

Architecture:

Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

69500000.0

MB

Parameters:

69500000

Architecture:

YOLO

You can retrieve bounding boxes whose edges match an angled object by training an oriented bounding boxes object detection model, such as YOLOv8's Oriented Bounding Boxes model. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

AltCLIP is a zero-shot image classification model. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

RemoteCLIP is a zero-shot classification model for remote sensing. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

BioCLIP is a Vision Foundation Model for the Tree of Life Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

MobileCLIP is an image embedding model developed by Apple and introduced in the "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" paper Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

878000000

Architecture:

SigLIP is an image embedding model defined in the "Sigmoid Loss for Language Image Pre-Training" paper. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

57300000

Architecture:

YOLO

YOLOv9 is an object detection model architecture released on February 21st, 2024. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

69000000

Architecture:

YOLO

YOLO-World is a zero-shot object detection model. Learn more »
Keypoint Detection
Keypoint Detection
Keypoint Detection
Deploy on Device with Roboflow✅
Keypoint Detection

Model Size:

MB

Parameters:

Architecture:

YOLO

YOLO-NAS Pose is a keypoint detection model developed by Deci AI. Learn more »
Keypoint Detection
Keypoint Detection
Keypoint Detection
Deploy on Device with Roboflow✅
Keypoint Detection

Model Size:

MB

Parameters:

Architecture:

YOLO

The YOLOv8 pose estimation model allows you to detect keypoints in an image. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

Architecture:

Grounded EdgeSAM is a combination of Grounding DINO, a zero-shot object detection model, and EdgeSAM, a fast zero-shot image segmentation model. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

13000000000

Architecture:

BakLLaVA is an LMM developed by LAION, Ontocord, and Skunkworks AI. BakLLaVA uses a Mistral 7B base augmented with the LLaVA 1.5 architecture. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

MB

Parameters:

6500000000

Architecture:

CogVLM shows strong performance in Visual Question Answering (VQA) and other vision tasks. Learn more »
Multimodal Model
Multimodal Model
Multimodal Model
Deploy on Device with Roboflow✅
Multimodal Model

Model Size:

7000000000.0

MB

Parameters:

Architecture:

Qwen-VL is an LMM developed by Alibaba Cloud. Qwen-VL accepts images, text, and bounding boxes as inputs. The model can output text and bounding boxes. Qwen-VL naturally supports English, Chinese, and multilingual conversation. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

VLPart, developed by Meta Research, is an object detection and segmentation model that works with an open vocabulary Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

CoDet is an open vocabulary zero-shot object detection model. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

Transformer

GPT-4 with Vision is a multimodal language model developed by OpenAI. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

Grounding DINO is a state-of-the-art zero-shot object detection model, developed by IDEA Research. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

Architecture:

Combination of Grounding DINO and Segment Anything

GroundedSAM combines Grounding DINO with the Segment Anything Model to identify and segment objects in an image given text captions. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

Architecture:

Combination of Segment Anything and CLIP

Use Grounding DINO, Segment Anything, and CLIP to label objects in images. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

BLIPv2 is a multimodal model developed by Salesforce Research. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

OWL-ViT is a transformer-based object detection model developed by Google Research. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

FastViT is a fast image classification model developed by Apple. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

CLIP

MetaCLIP is a zero-shot classification and embedding model developed by Meta AI. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

OWLv2 is a transformer-based object detection model developed by Google Research. OWLv2 is the successor to OWL ViT. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

13000000000

Architecture:

LLaVA is an open source multimodal language model that you can use for visual question answering and has limited support for object detection. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

Kosmos-2 is a multimodal language model capable of object detection and grounding text in images. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

L2CS-Net is a gaze estimation model that enables you to calculate where someone is looking and in what direction someone is looking. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

DocTR is an Optical Character Recognition tool powered by deep learning. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

DINOv2 is a self-supervised method for training computer vision models developed by Meta Research and released in April 2023. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

RTMDet is an efficient real-time object detector, with self-reported metrics outperforming the YOLO series. It achieves 52.8% AP on COCO with 300+ FPS on an NVIDIA 3090 GPU, making it one of the fastest and most accurate object detectors available as of writing this post. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

Architecture:

A simple, fully convolutional model for real-time instance segmentation Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

ByteTrack is a multi-object tracking computer vision algorithm. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

636000000

Architecture:

FastSAM is an image segmentation model trained using 2% of the data in the Segment Anything Model SA-1B dataset. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

25000000.0

MB

Parameters:

Architecture:

Detic is an open source segmentation model developed by Meta Research and released in 2022. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

67000000.0

MB

Parameters:

Architecture:

YOLO

YOLO-NAS is an object detection model developed by Deci that achieves SOTA performances compared to YOLOv5, v7, and v8. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

Architecture:

Segment Anything (SAM) is an image segmentation model developed by Meta Research, capable of doing zero-shot segmentation. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

Grounding DINO is a zero-shot object detection model made by combining a Transformer-based DINO detector and grounded pre-training. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

60000000.0

MB

Parameters:

Architecture:

Transformers

Detection Transformer (DETR) is an end-to-end object detection model implemented using the Transformer architecture. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

68200000

Architecture:

YOLO

An image classification model built using YOLOv8. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

68200000

Architecture:

YOLO

The state-of-the-art YOLOv8 model comes with support for instance segmentation tasks. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

68200000.0

MB

Parameters:

Architecture:

YOLO, CNN

YOLOv8 is a state-of-the-art object detection and image segmentation model created by Ultralytics, the developers of YOLOv5. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

150000000

Architecture:

YOLO

YOLOv7 Instance Segmentation lets you perform segmentation tasks with the YOLOv7 model. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

219 million

Architecture:

Transformers

OneFormer is a state-of-the-art multi-task image segmentation framework that is implemented using transformers. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

460,000

Architecture:

A fast, simple convolutional neural network that gets the job done for many tasks, including classification. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

68.7

MB

Parameters:

99.1 million parameters

Architecture:

CNN, YOLO

YOLOX is a high-performance object detection model. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

202.0

MB

Parameters:

12,786,711 (S2D)

Architecture:

CNN, YOLO

YOLOR (You Only Learn One Representation) is an object detection model that uses both implicit and explicit knowledge to make predictions. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

85000000

Architecture:

Transformer, YOLO

YOLOS looks at patches of an image to to form "patch tokens", which are used in place of the traditional wordpiece tokens in NLP. Learn more »
Object Detection
Object Detection
Object Detection
Deploy on Device with Roboflow✅
Object Detection

Model Size:

MB

Parameters:

Architecture:

YOLO

Scaled YOLOv4 is an extension of the YOLOv4 research implemented in the YOLOv5 PyTorch framework. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

The Vision Transformer leverages powerful natural language processing embeddings (BERT) and applies them to images. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

Architecture:

Mask RCNN is a convolutional neural network for instance segmentation. Learn more »
Semantic Segmentation
Semantic Segmentation
Semantic Segmentation
Deploy on Device with Roboflow✅
Semantic Segmentation

Model Size:

MB

Parameters:

Architecture:

Transformers

SegFormer is a computer vision framework used in semantic segmentation tasks, implemented with transformers. Learn more »
Classification
Classification
Classification
Deploy on Device with Roboflow✅
Classification

Model Size:

MB

Parameters:

Architecture:

YOLO

YOLOv5 Classification is a version of the YOLOv5 model used in single-label and multi-label image classification. Learn more »
Instance Segmentation
Instance Segmentation
Instance Segmentation
Deploy on Device with Roboflow✅
Instance Segmentation

Model Size:

MB

Parameters:

Architecture:

CNN, YOLO

YOLOv5 Instance Segmentation is a version of YOLOv5 that can be used for instance segmentation tasks. Learn more »

Deploy a computer vision model today

Join 800,000+ developers curating high quality datasets and deploying better models with Roboflow.

Get started