YOLOv9 vs. EfficientDet: A Detailed Comparison
Choosing the optimal object detection model is critical for computer vision tasks, balancing accuracy, speed, and computational resources. This page provides a detailed technical comparison between Ultralytics YOLOv9 and EfficientDet, two significant models in the object detection landscape. We will delve into their architectural designs, performance benchmarks, and suitable applications to assist you in making an informed decision for your projects.
YOLOv9: State-of-the-Art Accuracy and Efficiency
YOLOv9, introduced in 2024 by Chien-Yao Wang and Hong-Yuan Mark Liao from the Institute of Information Science, Academia Sinica, Taiwan, represents a significant advancement in the YOLO series. It is detailed in their paper "YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information" and implemented in their GitHub repository. YOLOv9 addresses the challenge of information loss in deep networks through innovative architectural elements like Programmable Gradient Information (PGI) and the Generalized Efficient Layer Aggregation Network (GELAN). These innovations ensure that the model learns effectively and maintains high accuracy with fewer parameters, showcasing a strong balance between performance and efficiency.
Technical Details:
- Authors: Chien-Yao Wang, Hong-Yuan Mark Liao
- Organization: Institute of Information Science, Academia Sinica, Taiwan
- Date: 2024-02-21
- Arxiv: https://arxiv.org/abs/2402.13616
- GitHub: https://github.com/WongKinYiu/yolov9
- Docs: https://docs.ultralytics.com/models/yolov9/
Strengths
- State-of-the-art Accuracy: YOLOv9 achieves superior accuracy in object detection, often outperforming competitors at similar parameter counts.
- Efficient Parameter Utilization: PGI and GELAN architectures enhance feature extraction and reduce information loss, leading to better performance with fewer parameters and FLOPs.
- Scalability: The YOLOv9 family includes various model sizes (YOLOv9t to YOLOv9e), offering flexibility for different computational capabilities.
- Ultralytics Ecosystem: While the original research is from Academia Sinica, integration within the Ultralytics framework provides immense benefits. These include ease of use through a simple Python API, extensive documentation, and efficient training processes with readily available pre-trained weights. The well-maintained ecosystem ensures active development, strong community support, and integration with tools like Ultralytics HUB for no-code training.
- Low Memory Footprint: YOLO models typically exhibit lower memory requirements during training compared to many other architectures, especially transformer-based models like RT-DETR.
Weaknesses
- Novelty: As a newer model, real-world deployment examples might be less numerous than for older, established models like EfficientDet, although adoption within the Ultralytics community is rapid.
- Task Specificity: The original YOLOv9 paper focuses primarily on object detection. However, its integration into the Ultralytics ecosystem hints at broader potential, aligning with the multi-task capabilities of models like Ultralytics YOLOv8.
Use Cases
YOLOv9 is particularly well-suited for applications where accuracy and efficiency are paramount, such as:
- High-resolution image analysis, like using computer vision to analyse satellite imagery.
- Complex scene understanding required in autonomous vehicles.
- Detailed object recognition for tasks like quality control in manufacturing.
EfficientDet: Scalable and Efficient Object Detection
EfficientDet was introduced in 2019 by a team at Google Research. It proposed a new family of scalable object detectors that prioritized efficiency without sacrificing accuracy. The model's architecture is based on the highly efficient EfficientNet backbone, a novel Bi-directional Feature Pyramid Network (BiFPN) for feature fusion, and a compound scaling method that uniformly scales the resolution, depth, and width for all parts of the model.
Technical Details:
- Authors: Mingxing Tan, Ruoming Pang, Quoc V. Le
- Organization: Google
- Date: 2019-11-20
- Arxiv: https://arxiv.org/abs/1911.09070
- GitHub: https://github.com/google/automl/tree/master/efficientdet
Strengths
- Compound Scaling: The key innovation of EfficientDet is its systematic approach to scaling, allowing it to create a family of models (D0-D7) that cater to different resource constraints.
- BiFPN: The Bi-directional Feature Pyramid Network allows for richer multi-scale feature fusion compared to traditional FPNs, improving detection accuracy.
- Historical Significance: At the time of its release, EfficientDet set a new standard for efficiency in object detection, influencing many subsequent architectures.
Weaknesses
- Outdated Performance: While groundbreaking for its time, EfficientDet has been surpassed in both accuracy and speed by newer models like YOLOv9. As shown in the performance table, YOLOv9 models consistently achieve higher mAP with fewer parameters and significantly faster inference speeds.
- Slower Inference: On modern hardware like the NVIDIA T4, even the smallest EfficientDet models are slower than comparable or more accurate YOLOv9 variants.
- Limited Ecosystem: EfficientDet is primarily a research repository. It lacks the comprehensive, user-friendly ecosystem provided by Ultralytics, which includes streamlined training, deployment, and community support.
- Task-Specific: EfficientDet is designed solely for object detection and does not offer the built-in versatility for other tasks like instance segmentation or pose estimation found in the Ultralytics framework.
Use Cases
EfficientDet can still be considered for legacy systems or as a baseline for academic comparison. Its applications include:
- General-purpose object detection where high-speed inference is not the primary constraint.
- Educational purposes for understanding feature pyramid networks and model scaling principles.
- Projects that have been standardized on the TensorFlow framework, where the original implementation resides.
Performance Analysis: YOLOv9 vs. EfficientDet
The performance comparison between YOLOv9 and EfficientDet clearly demonstrates the advancements made in object detection over the last few years. YOLOv9 consistently offers a superior trade-off between accuracy, speed, and model size.
Model | size (pixels) |
mAPval 50-95 |
Speed CPU ONNX (ms) |
Speed T4 TensorRT (ms) |
params (M) |
FLOPs (B) |
---|---|---|---|---|---|---|
YOLOv9t | 640 | 38.3 | - | 2.30 | 2.0 | 7.7 |
YOLOv9s | 640 | 46.8 | - | 3.54 | 7.1 | 26.4 |
YOLOv9m | 640 | 51.4 | - | 6.43 | 20.0 | 76.3 |
YOLOv9c | 640 | 53.0 | - | 7.16 | 25.3 | 102.1 |
YOLOv9e | 640 | 55.6 | - | 16.77 | 57.3 | 189.0 |
EfficientDet-d0 | 640 | 34.6 | 10.2 | 3.92 | 3.9 | 2.54 |
EfficientDet-d1 | 640 | 40.5 | 13.5 | 7.31 | 6.6 | 6.1 |
EfficientDet-d2 | 640 | 43.0 | 17.7 | 10.92 | 8.1 | 11.0 |
EfficientDet-d3 | 640 | 47.5 | 28.0 | 19.59 | 12.0 | 24.9 |
EfficientDet-d4 | 640 | 49.7 | 42.8 | 33.55 | 20.7 | 55.2 |
EfficientDet-d5 | 640 | 51.5 | 72.5 | 67.86 | 33.7 | 130.0 |
EfficientDet-d6 | 640 | 52.6 | 92.8 | 89.29 | 51.9 | 226.0 |
EfficientDet-d7 | 640 | 53.7 | 122.0 | 128.07 | 51.9 | 325.0 |
From the table, several key insights emerge:
- Accuracy vs. Efficiency: The YOLOv9c model achieves a 53.0 mAP with only 25.3M parameters and a blazing-fast inference time of 7.16 ms on a T4 GPU. In contrast, the similarly accurate EfficientDet-d6 (52.6 mAP) requires more than double the parameters (51.9M) and is over 12 times slower at 89.29 ms.
- Top-Tier Performance: The largest model, YOLOv9e, reaches an impressive 55.6 mAP, surpassing even the largest EfficientDet-d7 model (53.7 mAP) while being over 7 times faster and requiring significantly fewer FLOPs.
- Lightweight Models: At the smaller end, YOLOv9s (46.8 mAP) offers comparable accuracy to EfficientDet-d3 (47.5 mAP) but with nearly half the parameters and is over 5 times faster on a GPU.
Conclusion: Which Model Should You Choose?
For nearly all modern object detection applications, YOLOv9 is the clear winner. Its advanced architecture delivers state-of-the-art accuracy while maintaining exceptional inference speed and parameter efficiency. The integration into the Ultralytics ecosystem further enhances its value, providing a streamlined workflow from training to deployment, backed by robust documentation and an active community.
EfficientDet remains an important model from a historical and academic perspective, pioneering concepts in model scaling and feature fusion. However, for practical development and deployment, its performance has been eclipsed by newer, more efficient architectures like YOLOv9. If you are starting a new project or looking to upgrade an existing one, choosing YOLOv9 will provide superior performance, faster development cycles, and better support for future advancements.
Explore Other Models
If you are exploring other state-of-the-art models, consider looking at comparisons with YOLOv10, YOLOv8, and transformer-based architectures like RT-DETR. You can find more detailed analyses on our model comparison page.