0% found this document useful (0 votes)

64 views

Image Datasets For Practicing Machine Learning in OpenCV

This document discusses two image datasets that can be used for machine learning in OpenCV: 1) The digits dataset provided by OpenCV contains 5,000 handwritten digit images that can be easily extracted from a single image file. 2) The CIFAR-10 dataset contains 60,000 complex images across 10 classes that must be downloaded separately. Functions are provided to load the datasets. 3) The datasets can be partitioned into training and test sets and loaded using functions defined in separate Python scripts for use in OpenCV machine learning tutorials.

Uploaded by

prediatech

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

Image Datasets For Practicing Machine Learning in OpenCV

Uploaded by

prediatech

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

 Navigation

Click Here to Take the FREE Machine Learning with OpenCV Crash-Course

Search... 

Image Datasets for Practicing Machine Learning in

OpenCV
by Stefania Cristina on January 30, 2024 in OpenCV 0

Share Tweet Share

At the very start of your machine learning journey, publicly available datasets alleviate the worry of creating the
datasets yourself and let you focus on learning to use the machine learning algorithms. It also helps if the datasets
are moderately sized and do not require too much pre-processing to get you to practice using the algorithms
quicker before moving on to more challenging problems.

Two datasets we will be looking at are the simpler digits dataset provided with OpenCV and the more challenging
but widely used CIFAR-10 dataset. We will use any of these two datasets during our journey through OpenCV’s
machine learning algorithms.

In this tutorial, you will learn how to download and extract the OpenCV digits and CIFAR-10 datasets for practicing
machine learning in OpenCV.

After completing this tutorial, you will know:

How to download and extract the OpenCV digits dataset.

How to download and extract the CIFAR-10 dataset without necessarily relying on other Python packages
(such as TensorFlow).

Kick-start your project with my book Machine Learning in OpenCV. It provides self-study tutorials with working
code.

Let’s get started.

Image Datasets for Practicing Machine Learning in OpenCV
Photo by OC Gonzalez, some rights reserved.

Tutorial Overview
This tutorial is divided into three parts; they are:

The Digits Dataset

The CIFAR-10 Dataset
Loading the Datasets

The Digits Dataset

OpenCV provides the image, digits.png, composed of a ‘collage’ of 20× 20 pixel sub-images, where each sub-
image features a digit from 0 to 9 and may be split up to create a dataset. In total, the digits image contains 5,000
handwritten digits.

The digits dataset provided by OpenCV does not necessarily represent the real-life challenges that come with more
complex datasets, primarily because its image content features very limited variation. However, its simplicity and
ease of use will allow us to quickly test several machine learning algorithms at a low pre-processing and
computational cost.

To be able to extract the dataset from the full digits image, our first step is to split it into the many sub-images that
make it up. For this purpose, let’s create the following split_images function:

1 from cv2 import imread, IMREAD_GRAYSCALE

2 from numpy import hsplit, vsplit, array
3
4 def split_images(img_name, img_size):
5
6 # Load the full image from the specified file
7 img = imread(img_name, IMREAD_GRAYSCALE)
8
9 # Find the number of sub-images on each row and column according to their size
10 num_rows = img.shape[0] / img_size
11 num_cols = img.shape[1] / img_size
12
13 # Split the full image horizontally and vertically into sub-images
14 sub_imgs = [hsplit(row, num_cols) for row in vsplit(img, num_rows)]
15
16 return img, array(sub_imgs)

The split_images function takes as input the path to the full image, together with the pixel size of the sub-
images. Since we are working with square sub-images, we shall be denoting their size by a single dimension,
which is equal to 20.

The function subsequently applies the OpenCV imread method to load a grayscale version of the image into a
NumPy array. The hsplit and vsplit methods are then used to split the NumPy array horizontally and vertically,
respectively.

The array of sub-images the split_images function returns is of size (50, 100, 20, 20).

Once we have extracted the array of sub-images, we shall partition it into training and testing sets. We will also
need to create the ground truth labels for both splits of data to be used during the training process and to evaluate
the test results.

The following split_data function serves these purposes:

1 from numpy import float32, arange, repeat, newaxis

2
3 def split_data(img_size, sub_imgs, ratio):
4
5 # Compute the partition between the training and testing data
6 partition = int(sub_imgs.shape[1] * ratio)
7
8 # Split dataset into training and testing sets
9 train = sub_imgs[:, :partition, :, :]
10 test = sub_imgs[:, partition:sub_imgs.shape[1], :, :]
11
12 # Flatten each image into a one-dimensional vector
13 train_imgs = train.reshape(-1, img_size ** 2)
14 test_imgs = test.reshape(-1, img_size ** 2)
15
16 # Create the ground truth labels
17 labels = arange(10)
18 train_labels = repeat(labels, train_imgs.shape[0] / labels.shape[0])[:, newaxis]
19 test_labels = repeat(labels, test_imgs.shape[0] / labels.shape[0])[:, newaxis]
20
21 return train_imgs, train_labels, test_imgs, test_labels

The split_data function takes the array of sub-images as input and the split ratio for the training portion of the
dataset. The function then proceeds to compute the partition value that divides the array of sub-images along
its columns into training and testing sets. This partition value is then used to allocate the first set of columns to
the training data and the remaining set of columns to the testing data.

To visualize this partitioning on the digits.png image, this would appear as follows:
Partitioning the sub-images into a training dataset and a testing dataset

You may also note that we are flattening out every 20× 20 sub-image into a one-dimensional vector of length 400
pixels such that, in the arrays containing the training and testing images, every row now stores a flattened out
version of a 20/ times 20 pixel image.

The final part of the split_data function creates ground truth labels with values between 0 and 9 and repeats
these values according to how many training and testing images we have available.

The CIFAR-10 Dataset

The CIFAR-10 dataset is not provided with OpenCV, but we shall consider it because it represents real-world
challenges better than OpenCV’s digits dataset.

The CIFAR-10 dataset consists of a total of 60,000, 32× 32 RGB images. It features a variety of images belonging
to 10 different classes, such as airplanes, cats, and ships. The dataset files are readily split into 5 pickle files
containing 1,000 training images and labels, plus an additional one with 1,000 testing images and labels.

Let’s go ahead and download the CIFAR-10 dataset for Python from this link (note: the reason for not using
TensorFlow/Keras to do so is to show how we can work without relying on additional Python packages if need be).
Take note of the path on your hard disk to which you have saved and extracted the dataset.

The following code loads the dataset files and returns the training and testing, images, and labels:

1 from pickle import load

2 from numpy import array, newaxis
3
4
5 def load_images(path):
6
7 # Create empty lists to store the images and labels
8 imgs = []
9 labels = []
10
11 # Iterate over the dataset's files
12 for batch in range(5):
13
14 # Specify the path to the training data
15 train_path_batch = path + 'data_batch_' + str(batch + 1)
16
17 # Extract the training images and labels from the dataset files
18 train_imgs_batch, train_labels_batch = extract_data(train_path_batch)
19
20 # Store the training images
21 imgs.append(train_imgs_batch)
22 train_imgs = array(imgs).reshape(-1, 3072)
23
24 # Store the training labels
25 labels.append(train_labels_batch)
26 train_labels = array(labels).reshape(-1, 1)
27
28 # Specify the path to the testing data
29 test_path_batch = path + 'test_batch'
30
31 # Extract the testing images and labels from the dataset files
32 test_imgs, test_labels = extract_data(test_path_batch)
33 test_labels = array(test_labels)[:, newaxis]
34
35 return train_imgs, train_labels, test_imgs, test_labels
36
37
38 def extract_data(path):
39
40 # Open pickle file and return a dictionary
41 with open(path, 'rb') as fo:
42 dict = load(fo, encoding='bytes')
43
44 # Extract the dictionary values
45 dict_values = list(dict.values())
46
47 # Extract the images and labels
48 imgs = dict_values[2]
49 labels = dict_values[1]
50
51 return imgs, labels

It is important to remember that the compromise of testing out different models using a larger and more varied
dataset, such as the CIFAR-10, over a simpler one, such as the digits dataset, is that training on the former might
be more time-consuming.

Loading the Datasets

Let’s try calling the functions that we have created above.

I have separated the code belonging to the digits dataset from the code belonging to the CIFAR-10 dataset into two
different Python scripts that I named digits_dataset.py and cifar_dataset.py, respectively:

1 from digits_dataset import split_images, split_data

2 from cifar_dataset import load_images
3
4 # Load the digits image
5 img, sub_imgs = split_images('Images/digits.png', 20)
6
7 # Obtain training and testing datasets from the digits image
8 digits_train_imgs, digits_train_labels, digits_test_imgs, digits_test_labels = split_data(20, sub_imgs
9
10 # Obtain training and testing datasets from the CIFAR-10 dataset
11 cifar_train_imgs, cifar_train_labels, cifar_test_imgs, cifar_test_labels = load_images('Images/cifar-1

Note: Do not forget to change the paths in the code above to where you have saved your data files.

In the subsequent tutorials, we shall see how to use these datasets with different machine learning techniques, first
seeing how to convert the dataset images into feature vectors as one of the pre-processing steps before using
them for machine learning.
Want to Get Started With Machine Learning with OpenCV?
Take my free email crash course now (with sample code).

Click to sign-up and also get a free PDF Ebook version of the course.

Download Your FREE Mini-Course

Further Reading
This section provides more resources on the topic if you want to go deeper.

Books
Mastering OpenCV 4 with Python, 2019.

Websites
OpenCV, https://opencv.org/

Summary
In this tutorial, you learned how to download and extract the OpenCV digits and CIFAR-10 datasets for practicing
machine learning in OpenCV.

Specifically, you learned:

How to download and extract the OpenCV digits dataset.

How to download and extract the CIFAR-10 dataset without necessarily relying on other Python packages
(such as TensorFlow).

Do you have any questions?

Ask your questions in the comments below, and I will do my best to answer.

Get Started on Machine Learning in OpenCV!

Learn how to use machine learning techniques in image processing projects
...using OpenCV in advanced ways and work beyond pixels

Discover how in my new Ebook:

Machine Learing in OpenCV

It provides self-study tutorials with all working code in Python to turn you from a novice to expert. It equips you with
logistic regression, random forest, SVM, k-means clustering, neural networks, and much more...all using the machine learning
module in OpenCV

Kick-start your deep learning journey with hands-on exercises

SEE WHAT'S INSIDE

Share Tweet Share

About Stefania Cristina

Stefania Cristina, PhD is a Lecturer with the Department of Systems and Control Engineering, at the University
of Malta.
View all posts by Stefania Cristina →

 CIFAR-10, datasets, digits, machine learning, opencv

 Extracting Histogram of Gradients with OpenCV How to Train a Object Detection Engine with HOG in OpenCV 

No comments yet.

Email (will not be published) (required)

SUBMIT COMMENT

Welcome!
I'm Jason Brownlee PhD
and I help developers get results with machine learning.
Read more

Never miss a tutorial:

Picked for you:

Running a Neural Network Model in OpenCV

K-Means Clustering in OpenCV and Application for Color Quantization

How to Transform Images and Create Video with OpenCV

A Gentle Introduction to OpenCV: An Open Source Library for Computer Vision and Machine Learning

How to Read, Write, Display Images in OpenCV and Converting Color Spaces

Loving the Tutorials?

The Machine Learning in Open CV EBook

is where you'll find the Really Good stuff.

>> SEE WHAT'S INSIDE

LinkedIn | Twitter | Facebook | Newsletter | RSS

Privacy | Disclaimer | Terms | Contact | Sitemap | Search

Image Processing
No ratings yet
Image Processing
5 pages
POC Guide
No ratings yet
POC Guide
8 pages
BS en 50171-2001
100% (5)
BS en 50171-2001
24 pages
MeterMax Ultra Owner's Manual (900M14 - 05)
No ratings yet
MeterMax Ultra Owner's Manual (900M14 - 05)
67 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
Muqaddas Zulfiqar 30101assignment 7
No ratings yet
Muqaddas Zulfiqar 30101assignment 7
12 pages
DLT Record Final
No ratings yet
DLT Record Final
120 pages
MLP - Week 5 - MNIST - Perceptron - Ipynb - Colaboratory
No ratings yet
MLP - Week 5 - MNIST - Perceptron - Ipynb - Colaboratory
31 pages
MNIST Dataset
No ratings yet
MNIST Dataset
12 pages
Deep Learning Project for Computer Vision with Python 2022
No ratings yet
Deep Learning Project for Computer Vision with Python 2022
297 pages
Three Ways of Storing and Accessing Lots of Images in Python
No ratings yet
Three Ways of Storing and Accessing Lots of Images in Python
27 pages
0
No ratings yet
0
343 pages
Lab Record
No ratings yet
Lab Record
30 pages
PCa $ Image processing
No ratings yet
PCa $ Image processing
8 pages
MNIST
No ratings yet
MNIST
54 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Deep Learning Manual (1)
No ratings yet
Deep Learning Manual (1)
44 pages
Recognizing Handwritten Digits With Scikit-Learn: Punam Seal
No ratings yet
Recognizing Handwritten Digits With Scikit-Learn: Punam Seal
21 pages
CV Assignment 2 Group02
No ratings yet
CV Assignment 2 Group02
12 pages
CIS 6213 Applied Machine Learning Coursework
No ratings yet
CIS 6213 Applied Machine Learning Coursework
5 pages
Def Load - Data (Data - Directory)
No ratings yet
Def Load - Data (Data - Directory)
3 pages
DL Lab-final
No ratings yet
DL Lab-final
22 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
42 pages
Intro Ai Group3
No ratings yet
Intro Ai Group3
35 pages
Unit 4 (10 marks
No ratings yet
Unit 4 (10 marks
16 pages
Lab09 Assignment
No ratings yet
Lab09 Assignment
29 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
CNN 1721592934
No ratings yet
CNN 1721592934
53 pages
Image Classification
No ratings yet
Image Classification
18 pages
How to Develop a CNN for MNIST Handwritten Digit Classification
No ratings yet
How to Develop a CNN for MNIST Handwritten Digit Classification
43 pages
Dlv Lab Manual Print
No ratings yet
Dlv Lab Manual Print
29 pages
IP_LAB[1]
No ratings yet
IP_LAB[1]
8 pages
DRASHTI_CVML
No ratings yet
DRASHTI_CVML
83 pages
stanfordKNNassignment
No ratings yet
stanfordKNNassignment
78 pages
DIP Mini Project
100% (1)
DIP Mini Project
12 pages
Deep Learning lab manual
No ratings yet
Deep Learning lab manual
69 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
CV Practical
No ratings yet
CV Practical
3 pages
ML Ex6
No ratings yet
ML Ex6
8 pages
Pytorch Waste Classification Using Densenet Jupyter Notebook
No ratings yet
Pytorch Waste Classification Using Densenet Jupyter Notebook
14 pages
Computer Vision With Python (Answer)
No ratings yet
Computer Vision With Python (Answer)
11 pages
Performance Testing
No ratings yet
Performance Testing
15 pages
Digital Image Processing Lab Manual# 2
No ratings yet
Digital Image Processing Lab Manual# 2
6 pages
Assignment3 AL
No ratings yet
Assignment3 AL
23 pages
Revision Python For Computer Vision
No ratings yet
Revision Python For Computer Vision
50 pages
Computer vision activity
No ratings yet
Computer vision activity
6 pages
LP V GRPB 2b
No ratings yet
LP V GRPB 2b
8 pages
Practical Image-1
No ratings yet
Practical Image-1
22 pages
Lab05 ML Naqash
No ratings yet
Lab05 ML Naqash
10 pages
Computer Vision LAB 8 SEM
No ratings yet
Computer Vision LAB 8 SEM
92 pages
EX_2
No ratings yet
EX_2
2 pages
Report Digit Recognition
No ratings yet
Report Digit Recognition
11 pages
Dinushasan Courseproject04: Sign in
No ratings yet
Dinushasan Courseproject04: Sign in
19 pages
Project
No ratings yet
Project
15 pages
Project Guidelines_ AIML
No ratings yet
Project Guidelines_ AIML
30 pages
Image Processing: Mentor: Saqib Azim
No ratings yet
Image Processing: Mentor: Saqib Azim
24 pages
Ip Lab Programs
No ratings yet
Ip Lab Programs
34 pages
業務処理定義書セマンティックセグメンテーション En
No ratings yet
業務処理定義書セマンティックセグメンテーション En
9 pages
Input Image
No ratings yet
Input Image
8 pages
Digital Image Processing Lab Manual
No ratings yet
Digital Image Processing Lab Manual
26 pages
Machine Learning for iOS Developers
From Everand
Machine Learning for iOS Developers
Abhishek Mishra
No ratings yet
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
From Everand
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
Abdelfattah Ragab
No ratings yet
8 Tactics To Combat Imbalanced Classes in Your Machine Learning Dataset
No ratings yet
8 Tactics To Combat Imbalanced Classes in Your Machine Learning Dataset
62 pages
How To Prepare Data For Machine Learning
No ratings yet
How To Prepare Data For Machine Learning
34 pages
Build A Machine Learning Portfolio
No ratings yet
Build A Machine Learning Portfolio
18 pages
How To Choose The Right Test Options When Evaluating Machine Learning Algorithms
No ratings yet
How To Choose The Right Test Options When Evaluating Machine Learning Algorithms
16 pages
Pilot
No ratings yet
Pilot
78 pages
ALPHA Script - Presentation
No ratings yet
ALPHA Script - Presentation
13 pages
Lecture - 11 - Regression Testing
No ratings yet
Lecture - 11 - Regression Testing
32 pages
Assignment (1) Ict
No ratings yet
Assignment (1) Ict
8 pages
Product Datasheet
No ratings yet
Product Datasheet
1 page
What Is The Function of LOR's?
100% (1)
What Is The Function of LOR's?
8 pages
Cisco Email Encryption Compatibility Matrix: Revised: July 01, 2020
No ratings yet
Cisco Email Encryption Compatibility Matrix: Revised: July 01, 2020
13 pages
UCLA Computer Science B.S.
No ratings yet
UCLA Computer Science B.S.
3 pages
Layout and Construction: Grouted Riprap
No ratings yet
Layout and Construction: Grouted Riprap
4 pages
SYMMETRICAL FACE GUIDE
No ratings yet
SYMMETRICAL FACE GUIDE
7 pages
Bin Liu
No ratings yet
Bin Liu
19 pages
HKICO_Scratch_ver3
No ratings yet
HKICO_Scratch_ver3
26 pages
TP Phisics 1 L2
No ratings yet
TP Phisics 1 L2
5 pages
Pressure Switch MFD
No ratings yet
Pressure Switch MFD
2 pages
CLP Word Program
No ratings yet
CLP Word Program
6 pages
Concentration Terms Practice Questions 1728747202
No ratings yet
Concentration Terms Practice Questions 1728747202
3 pages
Database 04 Relational Algebra Part1
No ratings yet
Database 04 Relational Algebra Part1
17 pages
Nuclear Fission and Fusion Lesson 16
No ratings yet
Nuclear Fission and Fusion Lesson 16
6 pages
TIA PRO1 04 DevicesAndNetworks en
No ratings yet
TIA PRO1 04 DevicesAndNetworks en
48 pages
Assessing Mechanical Properties and Microstructure of Fire-Damaged Engineered Cementitious Composites
No ratings yet
Assessing Mechanical Properties and Microstructure of Fire-Damaged Engineered Cementitious Composites
8 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
03-exec
No ratings yet
03-exec
11 pages
RT-duroid 6006-6010LM Laminate Data Sheet PDF
No ratings yet
RT-duroid 6006-6010LM Laminate Data Sheet PDF
2 pages
Ottocento Lesson #3 Confidence Interval & Margin of Error Worksheet
No ratings yet
Ottocento Lesson #3 Confidence Interval & Margin of Error Worksheet
5 pages
The Micro-Architecture Level: Ms - Chit Su Mon
No ratings yet
The Micro-Architecture Level: Ms - Chit Su Mon
70 pages
Almanaratain Blocks Catalogue PDF
No ratings yet
Almanaratain Blocks Catalogue PDF
14 pages