Link to this sectionModel Training#

Q: How long does training take?

Training time depends on: Dataset size (number of images); Model size (n, s, m, l, x); Number of epochs; GPU type selected. A typical training run with 1000 images, YOLO26n, 100 epochs on RTX PRO 6000 takes about 5-10 minutes. Smaller runs (500 images, 50 epochs on RTX 4090) complete in under an hour. See cost examples for detailed estimates.

Q: How do I choose the right GPU?

| Scenario | Recommended GPU | | ----------------------------- | ---------------- | | Most training jobs | RTX PRO 6000 | | Large datasets or batch sizes | H100 SXM or H200 | | Budget-conscious | RTX 4090 |

Ultralytics Platform provides comprehensive tools for training YOLO models, from organizing experiments to running cloud training jobs with real-time metrics streaming.

Watch: Get Started with Ultralytics Platform - Train

Link to this sectionOverview#

The Training section helps you:

Organize models into projects for easier management
Train on cloud GPUs with a single click
Monitor real-time metrics during training
Compare model performance across experiments
Export to 19+ deployment formats (see supported formats)

Ultralytics Platform Train Overview

Link to this sectionWorkflow#

graph LR
    A[📁 Project]:::start --> B[⚙️ Configure]:::proc
    B --> C[🚀 Train]:::proc
    C --> D[📈 Monitor]:::proc
    D --> E[📦 Export]:::out

    classDef start fill:#4CAF50,color:#fff
    classDef proc fill:#2196F3,color:#fff
    classDef out fill:#9C27B0,color:#fff

Stage	Description
Project	Create a workspace to organize related models
Configure	Select dataset, base model, and training parameters
Train	Run on cloud GPUs or your local hardware
Monitor	View real-time loss curves and metrics
Export	Convert to 19+ deployment formats (details)

Link to this sectionTraining Options#

Ultralytics Platform supports multiple training approaches:

Method	Description	Best For
Cloud Training	Train on Ultralytics Cloud GPUs	No local GPU, scalability
Local Training	Train locally, stream metrics to the platform	Existing hardware, privacy
Colab Training	Use Google Colab with platform integration	Free GPU access

Link to this sectionGPU Options#

Available GPUs for cloud training on Ultralytics Cloud:

GPU	Generation	VRAM	Cost/Hour	Best For
RTX 2000 Ada	Ada	16 GB	$0.24	Small datasets, testing
RTX A4500	Ampere	20 GB	$0.25	Small-medium datasets
RTX 4000 Ada	Ada	20 GB	$0.26	Medium datasets
RTX A5000	Ampere	24 GB	$0.27	Medium datasets
L4	Ada	24 GB	$0.39	Inference optimized
A40	Ampere	48 GB	$0.44	Larger batch sizes
RTX 3090	Ampere	24 GB	$0.46	General training
RTX A6000	Ampere	48 GB	$0.49	Large models
RTX PRO 4000	Blackwell	24 GB	$0.57	Budget Blackwell
RTX PRO 4500	Blackwell	32 GB	$0.64	Great price/performance
RTX 4090	Ada	24 GB	$0.69	Best price/performance
RTX 6000 Ada	Ada	48 GB	$0.77	Large batch training
L40S	Ada	48 GB	$0.86	Large batch training
RTX PRO 5000	Blackwell	48 GB	$0.96	Large batch training
RTX 5090	Blackwell	32 GB	$0.99	Latest consumer generation
L40	Ada	48 GB	$0.99	Large models
A100 PCIe	Ampere	80 GB	$1.39	Production training
A100 SXM	Ampere	80 GB	$1.49	Production training
RTX PRO 6000	Blackwell	96 GB	$2.09	Recommended default
H100 PCIe	Hopper	80 GB	$2.89	High-performance training
H100 NVL	Hopper	94 GB	$3.19	Maximum performance
H100 SXM	Hopper	80 GB	$3.29	Fastest training
H200 NVL	Hopper	143 GB	$3.39	Maximum memory
H200 SXM	Hopper	141 GB	$4.39	Maximum performance
B200	Blackwell	180 GB	$5.89	Large models (Pro+)
B300	Blackwell	288 GB	$7.39	Largest models (Pro+)

GPU Tier Access

B200 and B300 GPUs require a Pro or Enterprise plan. All other GPUs are available on all plans including Free.

Signup Credits

New accounts receive signup credits for training. Check Billing for details.

Link to this sectionReal-Time Metrics#

During training, view live metrics across three subtabs:

graph LR
    A[Charts]:::start --> B[Loss Curves]:::out
    A --> C[Performance Metrics]:::out
    D[Console]:::start --> E[Live Logs]:::out
    D --> F[Error Detection]:::out
    G[System]:::start --> H[GPU Utilization]:::out
    G --> I[Memory & Temp]:::out

    classDef start fill:#4CAF50,color:#fff
    classDef out fill:#9C27B0,color:#fff

Subtab	Metrics
Charts	Box/class/DFL loss, mAP50, mAP50-95, precision, recall
Console	Live training logs with ANSI color and error detection
System	GPU utilization, memory, temperature, CPU, disk

Automatic Checkpoints

For cloud training, the best model (best.pt, the highest-mAP checkpoint) is saved automatically and made available for download, export, and deployment after training completes.

Link to this sectionQuick Start#

Get started with cloud training in under a minute:

Create a project in the sidebar
Click New Model
Select a model, dataset, and GPU
Click Start Training

Link to this sectionQuick Links#

Projects: Organize your models and experiments
Models: Manage trained checkpoints
Cloud Training: Train on cloud GPUs

Link to this sectionFAQ#

Link to this sectionHow long does training take?#

Training time depends on:

Dataset size (number of images)
Model size (n, s, m, l, x)
Number of epochs
GPU type selected

A typical training run with 1000 images, YOLO26n, 100 epochs on RTX PRO 6000 takes about 5-10 minutes. Smaller runs (500 images, 50 epochs on RTX 4090) complete in under an hour. See cost examples for detailed estimates.

Link to this sectionCan I train multiple models simultaneously?#

Yes. Concurrent cloud training limits depend on your plan: Free allows 3, Pro allows 10, and Enterprise is unlimited. For additional parallel training, use remote training from multiple machines.

Link to this sectionWhat happens if training fails?#

If training fails:

The model is marked failed and the compute instance is terminated
You can start a new training run from the base model
Credits are only charged for completed compute time

Link to this sectionHow do I choose the right GPU?#

Scenario	Recommended GPU
Most training jobs	RTX PRO 6000
Large datasets or batch sizes	H100 SXM or H200
Budget-conscious	RTX 4090

Contributors

GLglenn-jocher¹⁶ LAlaodouya¹ RIRizwanMunawar¹ SEsergiuwaxmann¹

Created Jan 14, 2026Updated 2 days ago