Link to this sectionSemantic Segmentation#

Semantic segmentation assigns a class label to every pixel in an image, producing a dense class map that covers the entire scene. Unlike instance segmentation, which separates individual objects, semantic segmentation groups all pixels of the same class together regardless of how many distinct objects are present.

Watch: Semantic Segmentation with Ultralytics YOLO26 | Quickstart Tutorial

The output of a semantic segmentation model is a single height-by-width class map where each pixel value corresponds to a predicted class ID. This makes semantic segmentation ideal for scene parsing tasks such as autonomous driving, medical imaging, and land-cover mapping.

Tip

Use task=semantic or the yolo semantic CLI task for semantic segmentation. YOLO26 semantic segmentation model files use the -sem suffix, such as yolo26n-sem.pt.

Link to this section Models #

YOLO26 semantic segmentation models pretrained on the Cityscapes dataset are shown below.

Models download automatically from the latest Ultralytics release on first use.

Model	size ^(pixels)	mIoU^val	Speed ^{RTX3090 PyTorch (ms)}	params ^(M)	FLOPs ^(B)
YOLO26n-sem	1024 × 2048	78.3	4.4 ± 0.0	1.6	22.7
YOLO26s-sem	1024 × 2048	80.8	8.4 ± 0.0	6.5	88.8
YOLO26m-sem	1024 × 2048	82.0	19.9 ± 0.1	14.3	304.5
YOLO26l-sem	1024 × 2048	82.9	26.5 ± 0.1	17.9	384.7
YOLO26x-sem	1024 × 2048	83.6	48.9 ± 0.2	40.2	861.7

mIoU^val values are for single-model single-scale on the Cityscapes validation set.
Reproduce with yolo semantic val data=cityscapes.yaml device=0 imgsz=2048
Speed metrics are averaged over Cityscapes validation images using an RTX3090 instance.
Reproduce with yolo semantic val data=cityscapes.yaml batch=1 device=0|cpu imgsz=2048
Params and FLOPs values are for the fused model after model.fuse(), which merges Conv and BatchNorm layers. Pretrained checkpoints retain the full training architecture and may show higher counts.

YOLO26 semantic segmentation models pretrained on the ADE20K dataset are shown below.

Models download automatically from the latest Ultralytics release on first use.

Model	size ^(pixels)	mIoU^val	Speed ^{RTX3090 PyTorch (ms)}	params ^(M)	FLOPs ^(B)
YOLO26n-sem-ade20k	640	38.8	3.9 ± 0.2	1.6	4.4
YOLO26s-sem-ade20k	640	45.6	4.2 ± 0.3	6.5	17.4
YOLO26m-sem-ade20k	640	47.4	4.7 ± 0.3	14.3	59.5
YOLO26l-sem-ade20k	640	49.7	8.3 ± 0.2	17.9	75.0
YOLO26x-sem-ade20k	640	51.5	9.9 ± 0.3	40.2	168.1

mIoU^val values are for single-model single-scale on the ADE20K validation set.
Reproduce with yolo semantic val model=yolo26n-sem-ade20k.pt data=ade20k.yaml device=0 imgsz=640, replacing yolo26n-sem-ade20k.pt with the desired yolo26*-sem-ade20k.pt checkpoint.
Speed metrics are averaged over ADE20K validation images using an RTX3090 instance.
Reproduce with yolo semantic val model=yolo26n-sem-ade20k.pt data=ade20k.yaml batch=1 device=0|cpu imgsz=640, replacing yolo26n-sem-ade20k.pt with the desired yolo26*-sem-ade20k.pt checkpoint.
Params and FLOPs values are for the fused model after model.fuse(), which merges Conv and BatchNorm layers. Pretrained checkpoints retain the full training architecture and may show higher counts.

Link to this sectionTrain#

Train YOLO26n-sem on the Cityscapes8 dataset for 100 epochs at image size 1024. For a full list of available arguments see the Configuration page.

Example

from ultralytics import YOLO

# Load a model
model = YOLO("yolo26n-sem.yaml")  # build a new model from YAML
model = YOLO("yolo26n-sem.pt")  # load a pretrained model (recommended for training)
model = YOLO("yolo26n-sem.yaml").load("yolo26n-sem.pt")  # build from YAML and transfer weights

# Train the model
results = model.train(data="cityscapes8.yaml", epochs=100, imgsz=1024)

See full train mode details in the Train page. Semantic segmentation models can also be trained with Ultralytics Platform cloud training.

Link to this sectionDataset format#

Semantic segmentation datasets use single-channel mask images, typically PNG, where each pixel value represents a class ID. Pixels with value 255 are treated as "ignore" and excluded from loss computation. The dataset YAML should specify paths to images and their corresponding mask directories. See the Semantic Segmentation Dataset Guide for format details. Supported datasets include Cityscapes and ADE20K. You can manage and label semantic datasets with Ultralytics Platform annotation.

Link to this sectionVal#

Validate trained YOLO26n-sem model accuracy on a semantic segmentation dataset. Pass data explicitly so validation uses the intended dataset YAML.

Example

from ultralytics import YOLO

# Load a model
model = YOLO("yolo26n-sem.pt")  # load an official model
model = YOLO("path/to/best.pt")  # load a custom model

# Validate the model
metrics = model.val(data="cityscapes.yaml")
metrics.miou  # mean Intersection over Union
metrics.pixel_accuracy  # overall pixel accuracy

Link to this sectionPredict#

Use a trained YOLO26n-sem model to run predictions on images.

Example

from ultralytics import YOLO

# Load a model
model = YOLO("yolo26n-sem.pt")  # load an official model
model = YOLO("path/to/best.pt")  # load a custom model

# Predict with the model
results = model("https://ultralytics.com/images/bus.jpg")  # predict on an image

# Access the results
for result in results:
    semantic_mask = result.semantic_mask.data  # class map, shape (H,W), integer dtype selected by class count

See full predict mode details in the Predict page.

Link to this sectionResults Output#

YOLO semantic segmentation returns one Results object per image. Each result stores one dense class map for the full image instead of a list of object masks. Pixels with the same predicted class share the same class ID, even when they belong to separate objects.

Attribute	Type	Shape	Description
`result.semantic_mask`	`SemanticMask`	`(H,W)`	Dense class map.
`result.semantic_mask.data`	`torch.uint8` `torch.int16` `torch.int32`	`(H,W)`	Class IDs; dtype selected by class count.
`result.masks`	-	-	No instance masks.
`result.boxes`	-	-	No instance boxes/confidences.
`result.masks.xy`	-	-	No default polygons.

For task-specific Results fields across every task, see the Predict Results by Task section.

Mask boundary quality

Semantic segmentation predicts a dense class map, then resizes that map back to the image shape for visualization and downstream use. Very thin structures, such as lane markings, court lines, poles, or wires, can therefore look stair-stepped when inference runs at a much lower imgsz than the original image resolution. If boundaries appear jagged, first retest the native PyTorch .pt model with a larger imgsz, such as 1024, 1280, or the closest practical value to the source image size. Use exported models only after confirming the .pt output is acceptable, since lower-resolution inputs cannot recover fine detail that was not present in the predicted class map.

Link to this sectionInstance vs Semantic Segmentation#

Aspect	Instance Segmentation (`task="segment"`)	Semantic Segmentation (`task="semantic"`)
Prediction goal	Segment each detected object separately	Assign one class ID to every pixel
Output field	`result.masks`	`result.semantic_mask`
Main data	`result.masks.data`	`result.semantic_mask.data`
Shape	`(N,H,W)`	`(H,W)`
Pixel values	Binary mask values: `0` or `1`	Class IDs: `0`, `1`, `2`, ...
Dtype	`torch.uint8`	`torch.uint8` `torch.int16` `torch.int32`
Same-class objects	Kept as separate instances	Merged into the same class region
Polygons	Yes, through `result.masks.xy` and `result.masks.xyn`	No polygon output by default
Boxes and confidence	Yes, through `result.boxes`	No per-instance boxes or confidence scores
Typical use	Counting, tracking, cropping, object-level measurement	Dense scene labeling, drivable area, land cover, medical regions

Link to this sectionExport#

Export a YOLO26n-sem model to a different format like ONNX, CoreML, etc.

Example

from ultralytics import YOLO

# Load a model
model = YOLO("yolo26n-sem.pt")  # load an official model
model = YOLO("path/to/best.pt")  # load a custom model

# Export the model
model.export(format="onnx")

Available YOLO26 semantic segmentation export formats are in the table below. You can export to any format using the format argument, i.e., format='onnx' or format='engine'. You can predict or validate directly on exported models, i.e., yolo predict model=yolo26n-sem.onnx. Usage examples are shown for your model after export completes.

Format	`format` Argument	Model	Metadata	Arguments
PyTorch	-	`yolo26n-sem.pt`	✅	-
TorchScript	`torchscript`	`yolo26n-sem.torchscript`	✅	`imgsz`, `quantize`, `dynamic`, `nms`, `batch`, `device`
ONNX	`onnx`	`yolo26n-sem.onnx`	✅	`imgsz`, `quantize`, `dynamic`, `simplify`, `opset`, `nms`, `batch`, `data`, `fraction`, `device`
OpenVINO	`openvino`	`yolo26n-sem_openvino_model/`	✅	`imgsz`, `quantize`, `dynamic`, `nms`, `batch`, `data`, `fraction`, `device`
TensorRT	`engine`	`yolo26n-sem.engine`	✅	`imgsz`, `quantize`, `dynamic`, `simplify`, `workspace`, `nms`, `batch`, `data`, `fraction`, `device`
CoreML	`coreml`	`yolo26n-sem.mlpackage`	✅	`imgsz`, `dynamic`, `quantize`, `nms`, `batch`, `device`
TF SavedModel	`saved_model`	`yolo26n-sem_saved_model/`	✅	`imgsz`, `keras`, `quantize`, `nms`, `batch`, `data`, `fraction`, `device`
TF GraphDef	`pb`	`yolo26n-sem.pb`	❌	`imgsz`, `batch`, `device`
TF Edge TPU	`edgetpu`	`yolo26n-sem_edgetpu.tflite`	✅	`imgsz`, `quantize`, `data`, `fraction`, `device`
PaddlePaddle	`paddle`	`yolo26n-sem_paddle_model/`	✅	`imgsz`, `batch`, `device`
MNN	`mnn`	`yolo26n-sem.mnn`	✅	`imgsz`, `batch`, `dynamic`, `quantize`, `nms`, `device`
NCNN	`ncnn`	`yolo26n-sem_ncnn_model/`	✅	`imgsz`, `quantize`, `batch`, `device`
IMX500	`imx`	`yolo26n-sem_imx_model/`	✅	`imgsz`, `quantize`, `data`, `fraction`, `nms`, `device`
RKNN	`rknn`	`yolo26n-sem_rknn_model/`	✅	`imgsz`, `batch`, `name`, `quantize`, `data`, `fraction`, `device`
ExecuTorch	`executorch`	`yolo26n-sem_executorch_model/`	✅	`imgsz`, `batch`, `device`
Axelera	`axelera`	`yolo26n-sem_axelera_model/`	✅	`imgsz`, `batch`, `quantize`, `data`, `fraction`, `device`
DEEPX	`deepx`	`yolo26n-sem_deepx_model/`	✅	`imgsz`, `quantize`, `data`, `optimize`, `device`
Qualcomm QNN	`qnn`	`yolo26n-sem_qnn.onnx`	✅	`imgsz`, `batch`, `name`, `quantize`, `data`, `fraction`, `device`
LiteRT	`litert`	`yolo26n-sem.tflite`	✅	`imgsz`, `quantize`, `batch`, `data`, `fraction`, `device`
Hailo	`hailo`	`yolo26n-sem_hailo_model/`	✅	`imgsz`, `name`, `quantize`, `data`, `fraction`, `opset`, `simplify`, `conf`, `iou`

See full export details in the Export page.

Link to this sectionFAQ#

Link to this sectionHow do I train a YOLO26 semantic segmentation model on a custom dataset?#

To train a YOLO26 semantic segmentation model on a custom dataset, you need to prepare PNG mask images where each pixel value represents a class ID (0, 1, 2, ...) and pixels with value 255 are ignored during training. Create a dataset YAML file pointing to your images and masks directories, then train the model:

Example

from ultralytics import YOLO

# Load a pretrained YOLO26 semantic segmentation model
model = YOLO("yolo26n-sem.pt")

# Train the model
results = model.train(data="path/to/your_dataset.yaml", epochs=100, imgsz=512)

Check the Configuration page for more available arguments.

Link to this sectionWhat is the difference between instance segmentation and semantic segmentation?#

Instance segmentation and semantic segmentation are both pixel-level tasks but differ in a key way:

Semantic segmentation assigns a class label to every pixel but does not distinguish between individual objects of the same class. For example, all cars in a scene share the same class label.
Instance segmentation identifies each individual object separately, producing distinct masks for each object even if they belong to the same class.

Semantic segmentation is best suited for scene understanding tasks like autonomous driving and land-cover mapping, while instance segmentation is preferred when counting or tracking individual objects matters.

Link to this sectionCan I use instance segmentation data to train semantic segmentation?#

Yes. If your dataset uses Ultralytics YOLO polygon labels (one .txt per image), omit masks_dir from the dataset YAML, and make sure no masks/ folder exists next to your images at the dataset root (its presence alone triggers PNG-mask mode even without masks_dir set). The loader then converts polygons to per-image semantic masks on the fly. For multi-class datasets (N > 1) an extra background class is appended to names automatically. For single-class datasets (N == 1) training stays at 1 class — your declared class becomes 1 in the mask and uncovered pixels become 0. See the Semantic Segmentation Dataset Guide for details.

Link to this sectionWhat datasets are supported for semantic segmentation?#

Ultralytics YOLO26 provides built-in configurations for several semantic segmentation datasets:

Cityscapes: Urban street scenes with 19 classes, widely used for autonomous driving research.
ADE20K: A large-scale scene parsing dataset with 150 classes.

You can also use any custom dataset that provides PNG mask annotations where pixel values correspond to class IDs.

Link to this sectionHow do I validate a pretrained YOLO26 semantic segmentation model?#

Validate a pretrained YOLO26 semantic segmentation model with the dataset YAML used for evaluation:

Example

from ultralytics import YOLO

# Load a pretrained model
model = YOLO("yolo26n-sem.pt")

# Validate the model
metrics = model.val(data="cityscapes.yaml")
print("Mean IoU:", metrics.miou)
print("Pixel Accuracy:", metrics.pixel_accuracy)

These steps will provide you with validation metrics like mean Intersection over Union (mIoU) and pixel accuracy, which are standard measures for assessing semantic segmentation performance.

Link to this sectionHow can I export a YOLO26 semantic segmentation model to ONNX format?#

Export a YOLO26 semantic segmentation model to ONNX format with Python or CLI commands:

Example

from ultralytics import YOLO

# Load a pretrained model
model = YOLO("yolo26n-sem.pt")

# Export the model to ONNX format
model.export(format="onnx")

For more details on exporting to various formats, refer to the Export page.

Contributors

GLglenn-jocher² RAraimbekovm² LALaughing-q² OAoaslananka¹ LMlmycross¹ RIRizwanMunawar¹

Created 2 months agoUpdated 2 hours ago

Link to this sectionSemantic Segmentation#

Link to this sectionModels#

Link to this sectionTrain#

Link to this sectionDataset format#

Link to this sectionVal#

Link to this sectionPredict#

Link to this sectionResults Output#

Link to this sectionInstance vs Semantic Segmentation#

Link to this sectionExport#

Link to this sectionFAQ#

Link to this sectionHow do I train a YOLO26 semantic segmentation model on a custom dataset?#

Link to this sectionWhat is the difference between instance segmentation and semantic segmentation?#

Link to this sectionCan I use instance segmentation data to train semantic segmentation?#

Link to this sectionWhat datasets are supported for semantic segmentation?#

Link to this sectionHow do I validate a pretrained YOLO26 semantic segmentation model?#

Link to this sectionHow can I export a YOLO26 semantic segmentation model to ONNX format?#

Comments

Link to this section Models #