Detection Transformer (DETR) is one of the first end-to-end object detection models implemented using the Transformer architecture. DETR was developed by Facebook Research. The baseline DETR model has an AP score of 42.0% on COCO.
It uses a transformer encoder-decoder architecture on top of a convolutional backbone (e.g., ResNet). This allows DETR to handle variable numbers of objects and complex object interactions more effectively than traditional CNN-based detectors.
By leveraging the power of transformers and attention mechanisms, DETR demonstrates improved generalization capabilities, handling diverse and complex scenes more effectively than traditional object detectors.
DETR
is licensed under a
Apache-2.0
license.
You can use Roboflow Inference to deploy a
DETR
API on your hardware. You can deploy the model on CPU (i.e. Raspberry Pi, AI PCs) and GPU devices (i.e. NVIDIA Jetson, NVIDIA T4).
Below are instructions on how to deploy your own model API.