Skip to content

YOLOv7 vs YOLO11: A Detailed Technical Comparison

Selecting the optimal object detection model requires a deep understanding of the specific capabilities and trade-offs of different architectures. This page provides a comprehensive technical comparison between YOLOv7 and Ultralytics YOLO11, two powerful models in the YOLO lineage. We will delve into their architectural differences, performance benchmarks, and ideal use cases to help you choose the best fit for your computer vision projects.

YOLOv7: Efficient and Accurate Object Detection

YOLOv7 was introduced as a significant advancement in real-time object detection, focusing on optimizing training efficiency and accuracy without increasing inference costs. It set a new state-of-the-art for real-time detectors upon its release.

Authors: Chien-Yao Wang, Alexey Bochkovskiy, and Hong-Yuan Mark Liao
Organization: Institute of Information Science, Academia Sinica, Taiwan
Date: 2022-07-06
Arxiv: https://arxiv.org/abs/2207.02696
GitHub: https://github.com/WongKinYiu/yolov7
Docs: https://docs.ultralytics.com/models/yolov7/

Architecture and Key Features

YOLOv7 builds upon previous YOLO architectures by introducing several key innovations. It employs techniques like Extended Efficient Layer Aggregation Networks (E-ELAN) in the backbone to improve feature extraction and learning. A major contribution is the concept of "trainable bag-of-freebies," which involves optimization strategies applied during training—like using an auxiliary detection head and coarse-to-fine guidance—to boost final model accuracy without adding computational overhead during inference. While primarily focused on object detection, the official repository shows community extensions for tasks like pose estimation and instance segmentation.

Performance and Use Cases

YOLOv7 demonstrated state-of-the-art performance upon release, offering a compelling balance between speed and accuracy. For instance, the YOLOv7x model achieves 53.1% mAPtest on the MS COCO dataset at a 640 image size. Its efficiency makes it suitable for real-time applications like advanced security systems and autonomous systems requiring rapid, accurate detection.

Strengths

  • High Accuracy and Speed Balance: Offers a strong combination of mAP and inference speed for real-time tasks on GPU.
  • Efficient Training: Utilizes advanced training techniques ("bag-of-freebies") to improve accuracy without increasing inference cost.
  • Established Performance: Proven results on standard benchmarks like MS COCO.

Weaknesses

  • Complexity: The architecture and training techniques can be complex to fully grasp and optimize.
  • Resource Intensive: Larger YOLOv7 models require significant GPU resources for training.
  • Limited Task Versatility: Primarily focused on object detection, requiring separate implementations for other tasks like segmentation or classification, unlike integrated models such as YOLO11.
  • Less Maintained: The framework is not as actively developed or maintained as the Ultralytics ecosystem, leading to fewer updates and less community support.

Learn more about YOLOv7

Ultralytics YOLO11: State-of-the-Art Efficiency and Versatility

Ultralytics YOLO11 represents the latest evolution in the YOLO series from Ultralytics, designed for superior accuracy, enhanced efficiency, and broader task versatility within a user-friendly framework. It builds on the successes of its predecessors like YOLOv8 to deliver a state-of-the-art experience.

Authors: Glenn Jocher and Jing Qiu
Organization: Ultralytics
Date: 2024-09-27
GitHub: https://github.com/ultralytics/ultralytics
Docs: https://docs.ultralytics.com/models/yolo11/

Architecture and Key Features

YOLO11's architecture incorporates advanced feature extraction techniques and a streamlined network design, resulting in higher accuracy often with a reduced parameter count compared to predecessors. This optimization leads to faster inference speeds and lower computational demands, which is crucial for deployment across diverse platforms, from edge devices to cloud infrastructure.

A key advantage of YOLO11 is its versatility. It natively supports multiple computer vision tasks, including object detection, instance segmentation, image classification, pose estimation, and oriented bounding boxes (OBB). It integrates seamlessly into the Ultralytics ecosystem, offering a streamlined user experience via simple Python and CLI interfaces, extensive documentation, and readily available pre-trained weights for efficient training.

Performance and Use Cases

YOLO11 demonstrates impressive mean Average Precision (mAP) scores across different model sizes, achieving a favorable trade-off between speed and accuracy. For instance, YOLO11m achieves a mAPval of 51.5 at a 640 image size with significantly fewer parameters than YOLOv7l. Smaller variants like YOLO11n offer exceptionally fast inference, while larger models like YOLO11x maximize accuracy. Notably, YOLO11 models often exhibit lower memory usage during training and inference compared to other architectures.

The enhanced precision and efficiency of YOLO11 make it ideal for applications requiring accurate, real-time processing:

Strengths

  • State-of-the-Art Performance: High mAP scores with an optimized, anchor-free architecture.
  • Efficient Inference: Excellent speed, especially on CPU, suitable for real-time needs.
  • Versatile Task Support: Natively handles detection, segmentation, classification, pose, and OBB in a single framework.
  • Ease of Use: Simple API, extensive documentation, and integrated Ultralytics HUB support for no-code training and deployment.
  • Well-Maintained Ecosystem: Active development, strong community, frequent updates, and efficient training processes.
  • Scalability: Performs effectively across hardware, from edge to cloud, with lower memory requirements.

Weaknesses

  • As a newer model, some specific third-party tool integrations might still be evolving compared to older, more established models.
  • Larger models can demand significant computational resources for training, though they remain highly efficient for their performance class.

Learn more about YOLO11

Performance Comparison: YOLOv7 vs. YOLO11

The following table provides a detailed performance comparison between YOLOv7 and YOLO11 models on the COCO dataset. YOLO11 models demonstrate a superior balance of accuracy, speed, and efficiency. For example, YOLO11l achieves a higher mAP than YOLOv7x with less than half the parameters and FLOPs, and it is significantly faster on GPU. Similarly, YOLO11m matches the accuracy of YOLOv7l with about half the parameters and computational cost. The smallest model, YOLO11n, provides remarkable speed on both CPU and GPU with minimal resource usage, making it ideal for edge applications.

Model size
(pixels)
mAPval
50-95
Speed
CPU ONNX
(ms)
Speed
T4 TensorRT10
(ms)
params
(M)
FLOPs
(B)
YOLOv7l 640 51.4 - 6.84 36.9 104.7
YOLOv7x 640 53.1 - 11.57 71.3 189.9
YOLO11n 640 39.5 56.1 1.5 2.6 6.5
YOLO11s 640 47.0 90.0 2.5 9.4 21.5
YOLO11m 640 51.5 183.2 4.7 20.1 68.0
YOLO11l 640 53.4 238.6 6.2 25.3 86.9
YOLO11x 640 54.7 462.8 11.3 56.9 194.9

Conclusion: Which Model Should You Choose?

While YOLOv7 was a powerful model for its time and still offers strong performance for real-time object detection, Ultralytics YOLO11 represents a significant leap forward. YOLO11 not only surpasses YOLOv7 in key performance metrics but also offers a far more versatile, user-friendly, and well-supported framework.

For developers and researchers seeking a modern, all-in-one solution, YOLO11 is the clear choice. Its advantages include:

  • Superior Performance Balance: YOLO11 provides a better trade-off between accuracy, speed, and computational cost.
  • Multi-Task Versatility: The native support for detection, segmentation, classification, pose, and OBB eliminates the need for multiple models and simplifies development workflows.
  • Ease of Use: The streamlined API, comprehensive documentation, and simple training procedures make it accessible to both beginners and experts.
  • Active Development: As part of the Ultralytics ecosystem, YOLO11 benefits from continuous updates, a strong open-source community, and integration with tools like Ultralytics HUB for seamless MLOps.

In summary, if your priority is leveraging the latest advancements in AI for a wide range of applications with a focus on ease of deployment and future-proofing, Ultralytics YOLO11 is the recommended model.

Explore Other Models

For further exploration, consider these comparisons involving YOLOv7, YOLO11, and other relevant models in the Ultralytics documentation:



📅 Created 1 year ago ✏️ Updated 1 month ago

Comments