No license

Link to this sectionMNIST Dataset#

Name: MNIST Image Classification Dataset
Creator: Yann LeCun
Keywords: MNIST, dataset, handwritten digits, image classification, deep learning, machine learning, training set, testing set, NIST

The MNIST (Modified National Institute of Standards and Technology) dataset is an image classification benchmark of 70,000 28x28 grayscale images of handwritten digits spanning 10 classes — the digits 0 through 9. It ships with a predefined split of 60,000 training and 10,000 test images and has long served as the standard benchmark for evaluating machine learning and computer vision algorithms. For the harder clothing-image equivalent, see the related Fashion-MNIST dataset; for color images, see CIFAR-10.

Link to this sectionKey Features#

MNIST contains 60,000 training images and 10,000 test images of handwritten digits, for 70,000 in total.
Every image is a 28x28 grayscale picture of a single digit, normalized and anti-aliased into a fixed 28x28 bounding box.
The 10 classes span the digits 0–9, with a roughly balanced number of images per class.
It ships with a predefined train/test split, so no manual or automatic splitting is required.
MNIST is a standard benchmark for image classification and deep learning research.

Link to this sectionDataset Structure#

MNIST ships with an official, predefined split, so no automatic or manual partitioning is needed:

Classes: 10 (handwritten digits 0–9)
Total images: 70,000 (28x28 grayscale)
Training set: 60,000 images
Test set: 10,000 images

Validation split

MNIST has no separate validation folder, so Ultralytics uses the 10,000-image test set as the validation split during training by default.

Each image is labeled with its corresponding digit (0–9), making MNIST a supervised dataset ideal for classification tasks.

Link to this sectionApplications#

MNIST is widely used to train and evaluate image classification models, from classic Convolutional Neural Networks (CNNs) and Support Vector Machines (SVMs) to modern deep architectures. Its small grayscale images and 10 digit classes make it a fast, reproducible benchmark for algorithm comparison and computer vision experimentation.

Some common applications include:

Benchmarking new classification algorithms
Educational purposes for teaching machine learning concepts
Prototyping image recognition systems
Testing model optimization techniques

Link to this sectionUsage#

Train a YOLO classification model on MNIST for 100 epochs at an image size of 28. The dataset downloads and caches automatically on first use; if you prefer full control over preprocessing, the original gzip archives are also available from the MNIST database. For the full list of available arguments, see the Training page and the image classification task guide.

Train Example

from ultralytics import YOLO

# Load a model
model = YOLO("yolo26n-cls.pt")  # load a pretrained model (recommended for training)

# Train the model
results = model.train(data="mnist", epochs=100, imgsz=28)

Quick tests with MNIST160

Ultralytics also exposes data="mnist160", a 160-image slice containing the first eight images of each digit (0–9) from both the train and test splits. It mirrors the MNIST directory structure, so you can swap datasets without changing any other arguments — ideal for CI pipelines or sanity checks before committing to the full 70,000-image dataset.

yolo classify train data=mnist160 model=yolo26n-cls.pt epochs=5 imgsz=28

Link to this sectionSample Images and Annotations#

Sample images from the MNIST dataset:

MNIST handwritten digit classification dataset samples

The samples show the range of handwriting styles the dataset captures across the 10 digit classes.

Link to this sectionCitations and Acknowledgments#

If you use the MNIST dataset in your research or development work, please cite the following paper:

Quote

@article{lecun2010mnist,
         title={MNIST handwritten digit database},
         author={LeCun, Yann and Cortes, Corinna and Burges, CJ},
         journal={ATT Labs [Online]},
         volume={2},
         year={2010}
}

We would like to acknowledge Yann LeCun, Corinna Cortes, and Christopher J.C. Burges for creating and maintaining the MNIST dataset as a valuable resource for the machine learning and computer vision research community. For more information about the MNIST dataset and its creators, visit the MNIST dataset website.

Link to this sectionFAQ#

Link to this sectionWhat is the MNIST dataset, and why is it important in machine learning?#

The MNIST dataset is a benchmark of 70,000 28x28 grayscale images of handwritten digits, split into 60,000 training and 10,000 test images across the 10 classes 0–9. It is the standard reference for evaluating image classification algorithms — its small, uniform format lets researchers and engineers compare methods and track progress with minimal setup, which is why it remains a common first benchmark in machine learning.

Link to this sectionHow many classes and images does the MNIST dataset have?#

MNIST has 10 classes — the handwritten digits 0 through 9 — and 70,000 grayscale images in total, each 28x28 pixels. It ships with a predefined split of 60,000 training and 10,000 test images, with a roughly even number of examples per digit.

Link to this sectionHow can I use Ultralytics YOLO to train a model on the MNIST dataset?#

To train an Ultralytics YOLO model on MNIST, use the code snippets below. The dataset downloads automatically on first use. For a detailed list of available training arguments, refer to the Training page.

Train Example

from ultralytics import YOLO

# Load a model
model = YOLO("yolo26n-cls.pt")  # load a pretrained model (recommended for training)

# Train the model
results = model.train(data="mnist", epochs=100, imgsz=28)

Link to this sectionHow is the MNIST dataset split into training and test sets?#

MNIST ships with a predefined split of 60,000 training images and 10,000 test images. Unlike folder-based classification datasets that Ultralytics splits automatically, MNIST's official partition is used as-is, and the test set serves as the validation split during training by default.

Link to this sectionWhat is the difference between the MNIST and EMNIST datasets?#

The MNIST dataset contains only handwritten digits, whereas the Extended MNIST (EMNIST) dataset includes both digits and uppercase and lowercase letters. EMNIST was developed as a successor to MNIST and uses the same 28x28 pixel format, making it compatible with tools and models designed for the original MNIST dataset. This broader range of characters makes EMNIST useful for a wider variety of machine learning applications.

Link to this sectionCan I use Ultralytics Platform to train models on datasets like MNIST?#

Yes. Ultralytics Platform lets you upload datasets, train image classification models, and deploy them without extensive coding. It is a convenient way to run MNIST experiments in the cloud — see the classification datasets overview for related options.

Link to this sectionHow does MNIST compare to other image classification datasets?#

MNIST is simpler than many modern datasets like CIFAR-10 or ImageNet, making it ideal for beginners and quick experimentation. While more complex datasets offer greater challenges with color images and diverse object categories, MNIST remains valuable for its simplicity, small file size, and historical significance in the development of machine learning algorithms. For a harder drop-in replacement with the same structure, see Fashion-MNIST, which features clothing items instead of digits.

Contributors

GLglenn-jocher¹⁸ RAraimbekovm² PDpderrenger¹ MAMatthewNoyce¹ JKjk4e¹

Created Nov 12, 2023Updated 3 days ago