Link to this sectionYOLO26 Model Export to TorchScript for Quick Deployment#

PyTorch is retiring TorchScript

PyTorch has deprecated TorchScript and is gradually removing its features. For new mobile and edge deployments, use the supported ExecuTorch integration. Ultralytics retains regular TorchScript export for legacy C++ compatibility.

Deploying computer vision models in C++ environments without Python requires a portable serialized representation. TorchScript provides that compatibility for legacy LibTorch applications.

Export to TorchScript to serialize your Ultralytics YOLO26 models for cross-platform compatibility and streamlined deployment. In this guide, we'll show you how to export your YOLO26 models to the TorchScript format, making it easier for you to use them across a wider range of applications.

Link to this sectionWhy should you export to TorchScript?#

TorchScript model serialization and deployment workflow overview

Developed by the creators of PyTorch, TorchScript is a powerful tool for optimizing and deploying PyTorch models across a variety of platforms. Exporting YOLO26 models to TorchScript is crucial for moving from research to real-world applications. TorchScript, part of the PyTorch framework, helps make this transition smoother by allowing PyTorch models to be used in environments that don't support Python.

The process involves two techniques: tracing and scripting. Tracing records operations during model execution, while scripting allows for the definition of models using a subset of Python. These techniques ensure that models like YOLO26 can still work their magic even outside their usual Python environment.

TorchScript scripting vs tracing comparison

TorchScript models can also be optimized through techniques such as operator fusion and refinements in memory usage, ensuring efficient execution. Another advantage of exporting to TorchScript is its potential to accelerate model execution across various hardware platforms. It creates a standalone, production-ready representation of your PyTorch model that can be integrated into C++ environments.

Link to this sectionKey Features of TorchScript Models#

TorchScript, a key part of the PyTorch ecosystem, provides powerful features for optimizing and deploying deep learning models.

TorchScript key features overview

Here are the key features that make TorchScript a valuable tool for developers:

Static Graph Execution: TorchScript uses a static graph representation of the model's computation, which is different from PyTorch's dynamic graph execution. In static graph execution, the computational graph is defined and compiled once before the actual execution, resulting in improved performance during inference.
Model Serialization: TorchScript allows you to serialize PyTorch models into a platform-independent format. Serialized models can be loaded without requiring the original Python code, enabling deployment in different runtime environments.
JIT Compilation: TorchScript uses Just-In-Time (JIT) compilation to convert PyTorch models into an optimized intermediate representation. JIT compiles the model's computational graph, enabling efficient execution on target devices.
Gradual Conversion: TorchScript provides a gradual conversion approach, allowing you to incrementally convert parts of your PyTorch model into TorchScript. This flexibility is particularly useful when dealing with complex models or when you want to optimize specific portions of the code.

Link to this sectionDeployment Options in TorchScript#

Before we look at the code for exporting YOLO26 models to the TorchScript format, let's understand where TorchScript models are normally used.

TorchScript offers various deployment options for machine learning models, such as:

C++ API: The most common use case for TorchScript is its LibTorch C++ API, which allows you to load and execute optimized TorchScript models directly within C++ applications. This is ideal for production environments where Python may not be suitable or available. The C++ API offers low-overhead and efficient execution of TorchScript models, maximizing performance potential.
Mobile Deployment: For low-latency, offline inference and data privacy on mobile devices, use ExecuTorch, PyTorch's replacement for TorchScript Mobile.
Cloud Deployment: TorchScript models can be deployed to cloud-based servers using solutions like TorchServe. It provides features like model versioning, batching, and metrics monitoring for scalable deployment in production environments. Cloud deployment with TorchScript can make your models accessible via APIs or other web services.

Link to this sectionExport to TorchScript: Converting Your YOLO26 Model#

Exporting YOLO26 models to TorchScript makes it easier to use them in different places and helps them run faster and more efficiently. This is great for anyone looking to use deep learning models more effectively in real-world applications.

Link to this sectionInstallation#

To install the required package, run:

Installation

# Install the required package for YOLO26
pip install ultralytics

For detailed instructions and best practices related to the installation process, check our Ultralytics Installation guide. While installing the required packages for YOLO26, if you encounter any difficulties, consult our Common Issues guide for solutions and tips.

Link to this sectionUsage#

All Ultralytics YOLO26 models are designed to support export out of the box, making it easy to integrate them into your preferred deployment workflow. You can view the full list of supported export formats and configuration options to choose the best setup for your application.

The TorchScript format supports the Export, Predict, and Validate modes. Export your model, then load the exported model to run inference or validate its accuracy.

Export

from ultralytics import YOLO

# Load a YOLO26 model
model = YOLO("yolo26n.pt")

# Export the model to TorchScript format
model.export(format="torchscript")  # creates 'yolo26n.torchscript'

Predict

from ultralytics import YOLO

# Load the exported TorchScript model
model = YOLO("yolo26n.torchscript")

# Run inference
results = model("https://ultralytics.com/images/bus.jpg")

Validate

from ultralytics import YOLO

# Load the exported TorchScript model
model = YOLO("yolo26n.torchscript")

# Validate accuracy on the COCO8 dataset
metrics = model.val(data="coco8.yaml")

Link to this sectionExport Arguments#

Argument	Type	Default	Description
`format`	`str`	`'torchscript'`	Target format for the exported model, defining compatibility with various deployment environments.
`imgsz`	`int` or `tuple`	`640`	Desired image size for the model input. Can be an integer for square images or a tuple `(height, width)` for specific dimensions.
`dynamic`	`bool`	`False`	Allows dynamic input sizes, enhancing flexibility in handling varying image dimensions.
`quantize`	`int` or `str`	`None`	Quantization precision: `16` (FP16) requires GPU export with `device=0`; `32`/unset is FP32. Replaces the deprecated `half` flag.
`nms`	`bool`	`False`	Adds Non-Maximum Suppression (NMS), essential for accurate and efficient detection post-processing.
`batch`	`int`	`1`	Specifies export model batch inference size or the max number of images the exported model will process concurrently in `predict` mode.
`device`	`str`	`None`	Specifies the device for exporting: GPU (`device=0`), CPU (`device=cpu`), MPS for Apple silicon (`device=mps`).

For more details about the export process, visit the Ultralytics documentation page on exporting.

Link to this sectionDeploying Exported YOLO26 TorchScript Models#

After successfully exporting your Ultralytics YOLO26 models to TorchScript format, you can now deploy them. The primary and recommended first step for running a TorchScript model is to use the YOLO("model.torchscript") method, as outlined in the previous usage code snippet. For in-depth instructions on deploying your TorchScript models in other settings, take a look at the following resources:

Explore Mobile Deployment: Use ExecuTorch's separate torch.export() → .pte pipeline for current PyTorch mobile deployment.
Master Server-Side Deployment: Learn how to deploy models server-side with TorchServe, offering a step-by-step tutorial for scalable, efficient model serving.
Implement C++ Deployment: Dive into the Tutorial on Loading a TorchScript Model in C++, facilitating the integration of your TorchScript models into C++ applications for enhanced performance and versatility.

Link to this sectionSummary#

In this guide, we explored the process of exporting Ultralytics YOLO26 models to the TorchScript format. By following the provided instructions, you can optimize YOLO26 models for performance and gain the flexibility to deploy them across various platforms and environments.

For further details on usage, visit TorchScript's official documentation.

Also, if you'd like to know more about other Ultralytics YOLO26 integrations, visit our integration guide page. You'll find plenty of useful resources and insights there.

Link to this sectionFAQ#

Link to this sectionWhat is Ultralytics YOLO26 model export to TorchScript?#

Exporting an Ultralytics YOLO26 model to TorchScript allows for flexible, cross-platform deployment. TorchScript, a part of the PyTorch ecosystem, facilitates the serialization of models, which can then be executed in environments that lack Python support. This makes it useful for deploying models in C++ environments.

Link to this sectionHow can I export my YOLO26 model to TorchScript using Ultralytics?#

To export a YOLO26 model to TorchScript, you can use the following example code:

Usage

from ultralytics import YOLO

# Load a YOLO26 model
model = YOLO("yolo26n.pt")

# Export the model to TorchScript format
model.export(format="torchscript")  # creates 'yolo26n.torchscript'

# Load the exported TorchScript model
torchscript_model = YOLO("yolo26n.torchscript")

# Run inference
results = torchscript_model("https://ultralytics.com/images/bus.jpg")

For more details about the export process, refer to the Ultralytics documentation on exporting.

Link to this sectionWhy should I use TorchScript for deploying YOLO26 models?#

Using TorchScript for deploying YOLO26 models offers several advantages:

Portability: Exported models can run in C++ applications without Python.
Optimization: TorchScript supports static graph execution and Just-In-Time (JIT) compilation, which can optimize model performance.
Cross-Language Integration: TorchScript models can be integrated into other programming languages, enhancing flexibility and expandability.
Serialization: Models can be serialized, allowing for platform-independent loading and inference.

For more insights into deployment, visit the TorchServe Documentation and the C++ Deployment Guide. For on-device mobile deployment, PyTorch now recommends ExecuTorch, which uses its own separate torch.export() → .pte pipeline rather than TorchScript.

Link to this sectionWhat are the installation steps for exporting YOLO26 models to TorchScript?#

To install the required package for exporting YOLO26 models, use the following command:

Installation

# Install the required package for YOLO26
pip install ultralytics

For detailed instructions, visit the Ultralytics Installation guide. If any issues arise during installation, consult the Common Issues guide.

Link to this sectionHow do I deploy my exported TorchScript YOLO26 models?#

After exporting YOLO26 models to the TorchScript format, you can deploy them across a variety of platforms:

C++ API: Use LibTorch for low-overhead, highly efficient production environments.
Mobile Deployment: Use ExecuTorch, PyTorch's supported replacement with a separate .pte export pipeline.
Cloud Deployment: Utilize services like TorchServe for scalable server-side deployment.

Explore comprehensive guidelines for deploying models in these settings to take full advantage of TorchScript's capabilities.

Contributors

GLglenn-jocher¹⁹ LAlakshanthad⁴ ABabirami-vina² ONonuralpszr¹ RAraimbekovm¹ PDpderrenger¹ MAMatthewNoyce¹ RIRizwanMunawar¹

Created Feb 29, 2024Updated 1 week ago