0% found this document useful (0 votes)

42 views35 pages

ISPR 26 Pytorch

This document is a laboratory lecture for the 'Intelligent Systems for Pattern Recognition' course, focusing on PyTorch for tensor manipulation, automatic differentiation, and model building. It covers essential topics such as tensor operations, GPU usage, autograd, and the training loop, along with practical exercises for implementing neural networks. The lecture emphasizes the importance of understanding PyTorch's ecosystem and provides resources for further learning and implementation.

Uploaded by

Lamiss Kara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views35 pages

ISPR 26 Pytorch

Uploaded by

Lamiss Kara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Laboratory Lecture for the “Intelligent Systems for Pattern Recognitionˮ

course of the Masterʼs Degree in Computer Science 20242025

Riccardo Massidda
[email protected]

Huge thanks to Valerio De Caro and Antonio Carta for previous versions of this material.
Why PyTorch?
Tensor Manipulation.
Tensor operations on a MATLAB/NumPy-like API.

Accelerator Support.
Seamless execution on CPU, GPU, and TPU devices.

Automatic Differentiation.
Only need to define forward computation → chain rule! ⛓

High-Level API.
Readily available neural networks layers, losses, optimizers, …
Getting Started
For this lecture:

1. Clone the repository di-unipi/ispr-lab from GitHub,

2. Install PyTorch, either using an environment manager (conda, pipenv,
poetry, etc.) or using Docker/Podman 🐳.

In a hurry? Just open the repository in Google Colab!

Up-to-date instructions to install PyTorch here: Start Locally | PyTorch

Basics of
Tensor
Operations
and
Manipulation
Tensors
Tensors are the main data structure and represent multidimensional arrays.

As for NumPy arrays, they support advanced indexing and broadcasting.

Attributes:

● dtype: determine the type of the tensor elements (float{16, 32, 64,
int{8, 16, 32, 64, uint8. Can be specified during the initialization.
● device: memory location, as in CPU or GPU
● layout: dense tensors (strided) or sparse (sparse_coo)
Tensor Initialization
● Existing Array: torch.tensor(list)
● Constants: torch.zeros(*dims), torch.ones(*dims)
● Random: torch.randn(*dims), torch.rand(*dims)
● Range: torch.linspace(start, end, steps=100)
● NumPy: torch.from_numpy(arr)
Tensor Operations
Some operators are overloaded:

● +, - for addition and subtraction (support broadcasting)

● * is the elementwise multiplication (not the matrix product, supports
broadcasting)
● @ for matrix multiplication (torch.matmul)

In-place operations are defined with a suffix underscore:

● add_, sub_, matmul_ are the in-place equivalent for the previous
operators, and also support broadcasting.

Check the documentation: http://pytorch.org/docs/stable/tensors.html

Broadcasting Rules
PyTorch broadcasting semantics follows NumPy own semantics.

Two tensors are “broadcastableˮ if the following rules hold:

1. Each tensor has at least one dimension.

2. When iterating over the dimension sizes, starting at the trailing
dimension, the dimension sizes must either be equal, one of them is 1, or
one of them does not exist.
Broadcasting Rules

https://pytorch.org/docs/stable/notes/broadcasting.html
Broadcasting Rules

https://numpy.org/doc/stable/user/basics.broadcasting.html
Broadcasting Rules

https://numpy.org/doc/stable/user/basics.broadcasting.html
Tensors in GPU
The submodule torch.cuda provides the API for GPU management.
Check availability of the GPU
torch.cuda.is_available
Create or move to GPU
torch.tensor([2., -1.], device="cuda")
tensor.to("cuda")
In all operations, all the tensor must reside on the same device and result on
the same device.
You can move tensors back to the CPU with the tensor.cpu() method.
Tensors in GPU
On a server, you typically have access to multiple shared GPU and you must
select one:

1. Manually selecting with the device argument (‘cuda:0ʼ, ‘cuda:1ʼ…), or

2. Using the context manager torch.cuda.device

Changing the shell environment variable CUDA_VISIBLE_DEVICES to limit the

visible GPUs

export CUDA_VISIBLE_DEVICES=0

Note that the indices of the GPU IDs will always start from 0.

⚠ Remember to de-allocate tensors from the GPU if youʼre not using it!
Tensor Indexing
Basic tensor indexing is similar to list # first k elements
x = arr[:k]
indexing, but with multiple
# all but the first k
dimensions. x = arr[k:]
# negative indexing
Boolean arrays can be used to filter x = arr[-k:]
elements that satisfy some condition. # mixed indexing
arr[:t_max, b:b+k, :]
If the indices are less than the
number of dimensions, the missing # indexing with Boolean condition
def relu(x):
indices are considered complete
x[x < 0] = 0
slices. return x
Tensor Reshaping
Reshaping is fundamental to combine x = torch.randn(5,1,5)
tensors.
# squeeze
tensor.squeeze() removes all singleton x.squeeze() → [5,5]
dimensions

tensor.unsqueeze(dim) add a singleton # unsqueeze

dimension at the provided dimension x.unsqueeze(3) → [5,1,5,1]

tensor.transpose(dim1, dim2) # transpose

transposes the two dimensions of the x.transpose(1, 2) → [5,5,1]
tensor
# indexing with Boolean condition
tensor.permute(*dims) re-arranges the
x.permute(1,0,2) → [1,5,5]
dimensions as in *dims
Tensor Reduce
Reduction operations collapse the x = torch.randn(5,1,5)

tensor dimensionality.
x.sum(0) → [1, 5]
x.mean(1) → [5, 5]
tensor.sum(dim)
x.amin(2) → [5,1]
tensor.mean(dim)
tensor.prod(dim)
tensor.amin(dim)
tensor.amax(dim)

The keepdim parameter keeps an

empty dimension in place.
Your Turn!

The Kaiming uniform initialization scheme

provides a standard baseline to train Neural
Networks with rectified activation functions.

Write the following functions:

_relu_kaiming_init_(weights: torch.Tensor)
that modifies in-place the provided tensor,

_relu_kaiming_init(in_size, out_size) that

returns a new tensor with shape (out_size ×
in_size)
Autograd
Automatic
Differentiation
in PyTorch
Autograd
The submodule torch.autograd is responsible for automatic
differentiation.

Each operation creates a Function node in a dynamic computational graph,

connected to its Tensor arguments.

The gradient is computed on each tensor by calling the backward() method.

Autograd

Overview of PyTorch Autograd Engine

Autograd
The main Tensor attributes related to the graph structure are:

● data: Tensor containing the data itself

● grad: Tensor containing the gradient (initially set to None)
● grad_fn: the function used to compute the gradient

Each Function implements two methods:

● forward: function application

● backward: gradient computation
Autograd
The requires_grad attribute is used to specify if the gradient computation should
propagate into the Tensor or not, which also stops the backpropagation.

For optimizable model parameters ⇒ requires_grad=True

For input data or constant values ⇒ requires_grad=False

The method detach removes the tensor from the graph, truncating the gradient.

In-place modification is not allowed, as it breaks the automatic differentiation.

At inference time, the context manager torch.no_grad speeds up computation.

Autograd documentation: http://pytorch.org/docs/stable/autograd.html

Autograd
Building
Models and
Pipelines
Model Interface
torch.nn contains the basic
components to define your neural
networks, loss functions,
regularization techniques and
optimizers.
Module and Parameters
Module is the base class for all the neural network components: Linear,
Convolutional, Recurrent Layers…

Each Module contains a set of Parameter objects, i.e., a “tensor with a name
and requires_grad=Trueˮ

The parameters() method returns an iterator over model parameters.

If you want to add a list of parameters or sub-modules, you can use the
ParameterList and ModuleList objects.

⚠ If you use a regular list, the parameters will not be registered!

Forward and Backward
The logic of the module is defined in the forward() method, which you can
call either as net(in_tensor) or net.forward(in_tensor).

The backward() step is automatically defined by Autograd, but can be

overridden!

It is possible to define forward and backward hooks to debug your model!

Modules can operate in train or eval mode: net.train() or net.eval()

This is useful for layers that define a different behavior during train and test,
e.g. Dropout, BatchNormalization…
Existing Modules
Thereʼs no need to
reinvent the wheel!
(in most cases, but sometimes you really do: good luck)

PyTorch provides lots

of common modules
that can be easily glued
together.

https://pytorch.org/docs/stable/nn.html
Datasets and Data Loaders
The module torch.data.utils
defines classes to handle datasets and
load them from data.

DataLoader automatizes mini-batching,

shuffling of the dataset, sampling
techniques and any pre-processing,
and allows parallel loading.
Training Loop
To define a training loop, we need a loss function and an optimizer.

Always check the documentation for the correct shape and input arguments
(does the loss need logits or probabilities? Which dimension should be the
last? Is the average for each element or for each sample?

⚠ Remember to reset gradients using the zero_grad() method!

(less talk, more code)

Logging
Several metrics can help to understand your model.

Logging them, itʼs always a good idea!

TensorBoard works great for PyTorch as well.

Otherwise, there are cloud-based commercial

products Weights & Biases, neptune.ai, …)
Model Serialization
Last, but not least, how do I store my model?

The state dictionary stores the value of all model parameters.

torch.save(the_model.state_dict(), PATH)

Then, instantiate the object and reload the state dictionary.

net = MyModelClass(*args, **kwargs)

net.load_state_dict(torch.load(PATH))
PyTorch Ecosystem
To know how things work under-the-hood is worth the effort.

… but in practice, most “routineˮ operations can be abstracted away.

Both PyTorch Lightning and Transformers by HuggingFace 🤗 provide APIs

for common practices such as training, logging, evaluating and performing
inference on Machine Learning models.

Also, tons of libraries in the PyTorch Ecosystem: graph neural networks,

interpretability, continual learning, federated learning, quantum ML…
Your Turn!
Implement and train a Convolutional Neural
Network to perform image classification on
the MNIST dataset.

📜 Side Quests:

1. Monitor the performance with a logger,

2. Play around with dropout,
batch_norm, etc.
(remember of train vs eval!
3. Try a PyTorch Lightning implementation

PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
16 pages
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
17 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
PyTorch Overview and Applications Guide
100% (4)
PyTorch Overview and Applications Guide
33 pages
PyTorch Guide for Beginners
100% (1)
PyTorch Guide for Beginners
62 pages
PyTorch Tensor and Autograd Guide
No ratings yet
PyTorch Tensor and Autograd Guide
15 pages
PyTorch for Deep Learning Experts
No ratings yet
PyTorch for Deep Learning Experts
72 pages
DL Pytorch
No ratings yet
DL Pytorch
8 pages
(Deep Learning Using PyTorch) (Cheatsheet)
No ratings yet
(Deep Learning Using PyTorch) (Cheatsheet)
7 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
Pytorch
No ratings yet
Pytorch
38 pages
DIP Lab 10
No ratings yet
DIP Lab 10
11 pages
PyTorch for Deep Learning Beginners
No ratings yet
PyTorch for Deep Learning Beginners
31 pages
Week2 - PytorchIntro - Ipynb - Colaboratory
No ratings yet
Week2 - PytorchIntro - Ipynb - Colaboratory
12 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
Unit 4 Part 3
No ratings yet
Unit 4 Part 3
8 pages
Chapter1 Intro
No ratings yet
Chapter1 Intro
35 pages
Beginner's Guide to PyTorch Tutorial
No ratings yet
Beginner's Guide to PyTorch Tutorial
11 pages
Pytorch Demo 1749471354
No ratings yet
Pytorch Demo 1749471354
10 pages
WWW Learnpytorch
No ratings yet
WWW Learnpytorch
14 pages
PyTorch Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
No ratings yet
PyTorch Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
45 pages
Pytorch Cheatsheet EN
No ratings yet
Pytorch Cheatsheet EN
1 page
Py Torch
No ratings yet
Py Torch
786 pages
PyTorch - A Comprehensive Overview
No ratings yet
PyTorch - A Comprehensive Overview
7 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
PyTorch 1.0: Bridging Research and Production
No ratings yet
PyTorch 1.0: Bridging Research and Production
108 pages
Pytorch Tutorial PDF
No ratings yet
Pytorch Tutorial PDF
27 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
PyTorch Fundamentals: Tensors Explained
No ratings yet
PyTorch Fundamentals: Tensors Explained
23 pages
PyTorch Ebook
No ratings yet
PyTorch Ebook
44 pages
PyTorch Overview and Key Concepts
No ratings yet
PyTorch Overview and Key Concepts
35 pages
Introduction To PyTorch
No ratings yet
Introduction To PyTorch
25 pages
Tensors
No ratings yet
Tensors
12 pages
Pytorch: Tensors and Datasets
No ratings yet
Pytorch: Tensors and Datasets
9 pages
PyTorch Fundamentals: Tensors Explained
No ratings yet
PyTorch Fundamentals: Tensors Explained
24 pages
Chapter 1
No ratings yet
Chapter 1
50 pages
Build Your Own Deep Learning Framework
No ratings yet
Build Your Own Deep Learning Framework
35 pages
Apurv Notes - Foundations of Pytorch
No ratings yet
Apurv Notes - Foundations of Pytorch
15 pages
Lec 3
No ratings yet
Lec 3
30 pages
PyTorch For Deep Learning Zero To Mastery
No ratings yet
PyTorch For Deep Learning Zero To Mastery
6 pages
PyTorch Neural Network Tutorial
No ratings yet
PyTorch Neural Network Tutorial
64 pages
DeepLearning Pytorch 522H0134 NguyenNhatHuy 522H0150 PhamHuynhTin
No ratings yet
DeepLearning Pytorch 522H0134 NguyenNhatHuy 522H0150 PhamHuynhTin
54 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
PyTorch Lerning Path
No ratings yet
PyTorch Lerning Path
8 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
2c PyTorch4
No ratings yet
2c PyTorch4
4 pages
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
No ratings yet
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
8 pages
PyTorch Guide With Code
No ratings yet
PyTorch Guide With Code
4 pages
Pytorch For Beginners
No ratings yet
Pytorch For Beginners
13 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
Deep Learning Unit 4
No ratings yet
Deep Learning Unit 4
11 pages
DL 1 - ComputerVision With PyTorch Notes
No ratings yet
DL 1 - ComputerVision With PyTorch Notes
304 pages
PyTorch Guide for Deep Learning
No ratings yet
PyTorch Guide for Deep Learning
5 pages
10 22531-Muglajsci 1567197-4287837
No ratings yet
10 22531-Muglajsci 1567197-4287837
8 pages
1 s2.0 S1746809425004823 Main
No ratings yet
1 s2.0 S1746809425004823 Main
10 pages
1 s2.0 S0895611125000850 Main
No ratings yet
1 s2.0 S0895611125000850 Main
10 pages
1 s2.0 S0883944125000140 Main
No ratings yet
1 s2.0 S0883944125000140 Main
8 pages
Classification of Orbital Tumors Using Convolutional Neural Networks
No ratings yet
Classification of Orbital Tumors Using Convolutional Neural Networks
11 pages
BFGMSF-Net Boundary Feature Guidance and Multi-Sca
No ratings yet
BFGMSF-Net Boundary Feature Guidance and Multi-Sca
13 pages
LCMorph Exploiting Frequency Cues and Morphologica
No ratings yet
LCMorph Exploiting Frequency Cues and Morphologica
19 pages
1 s2.0 S0301562925002492 Main
No ratings yet
1 s2.0 S0301562925002492 Main
9 pages
Algorithms Techniques and Applications of Intelligent Diagnosis Using Dynamic Ultrasound A Review
No ratings yet
Algorithms Techniques and Applications of Intelligent Diagnosis Using Dynamic Ultrasound A Review
32 pages
1 s2.0 S1530891X2500093X Main
No ratings yet
1 s2.0 S1530891X2500093X Main
8 pages
1 s2.0 S2949668325000047 Main
No ratings yet
1 s2.0 S2949668325000047 Main
8 pages
1 s2.0 S0022346824008285 Main
No ratings yet
1 s2.0 S0022346824008285 Main
6 pages
Asp-Vmunet: Atrous Shifted Parallel Vision Mamba U-Net For Skin Lesion Segmentation
No ratings yet
Asp-Vmunet: Atrous Shifted Parallel Vision Mamba U-Net For Skin Lesion Segmentation
12 pages
Preprint 59711 Submitted
No ratings yet
Preprint 59711 Submitted
26 pages
Advanced Arabic Alphabet Sign Language Recognition Using Transfer Learning and Transformer Models
No ratings yet
Advanced Arabic Alphabet Sign Language Recognition Using Transfer Learning and Transformer Models
6 pages
Paper Chakraborty
No ratings yet
Paper Chakraborty
10 pages
1 s2.0 S1361841516301268 Main
No ratings yet
1 s2.0 S1361841516301268 Main
20 pages
Human Brain Mapping - 2022 - Cao - Deep Learning Derived Automated ASPECTS On Non Contrast CT Scans of Acute Ischemic
No ratings yet
Human Brain Mapping - 2022 - Cao - Deep Learning Derived Automated ASPECTS On Non Contrast CT Scans of Acute Ischemic
14 pages
Paper Abhishek A
No ratings yet
Paper Abhishek A
11 pages
1 s2.0 S0169260724004814 Main
No ratings yet
1 s2.0 S0169260724004814 Main
10 pages
1 s2.0 S0010482524005936 Main
No ratings yet
1 s2.0 S0010482524005936 Main
24 pages
1 s2.0 S0169260725001324 Main
No ratings yet
1 s2.0 S0169260725001324 Main
20 pages
1 s2.0 S0925231224017867 Main
No ratings yet
1 s2.0 S0925231224017867 Main
9 pages
ResDAC-Net A Novel Pancreas Segmentation Model Uti
No ratings yet
ResDAC-Net A Novel Pancreas Segmentation Model Uti
14 pages
1 s2.0 S0010482525000617 Main
No ratings yet
1 s2.0 S0010482525000617 Main
16 pages
Applsci 13 11625
No ratings yet
Applsci 13 11625
28 pages
Arabic Sign Language Recognition Systems A Systema
No ratings yet
Arabic Sign Language Recognition Systems A Systema
19 pages
ISPR 33 Flow
No ratings yet
ISPR 33 Flow
35 pages
1 s2.0 S0957417423011399 Main
No ratings yet
1 s2.0 S0957417423011399 Main
12 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
Data Science Resume of Soumyadeep Khandual
No ratings yet
Data Science Resume of Soumyadeep Khandual
2 pages
Facemask Detection Using Convolutional Neural Networks
100% (1)
Facemask Detection Using Convolutional Neural Networks
11 pages
Chen 2022
No ratings yet
Chen 2022
18 pages
Aspiring ML Developer's Profile
No ratings yet
Aspiring ML Developer's Profile
2 pages
FX-Based Feature Extraction in PyTorch
No ratings yet
FX-Based Feature Extraction in PyTorch
9 pages
Major Project Report Draft
No ratings yet
Major Project Report Draft
60 pages
AI Face Detection & Alignment Guide
No ratings yet
AI Face Detection & Alignment Guide
9 pages
Unit Iv (CNN)
No ratings yet
Unit Iv (CNN)
8 pages
Deep Learning: Concepts and Applications
No ratings yet
Deep Learning: Concepts and Applications
38 pages
Fake News Analysis
No ratings yet
Fake News Analysis
46 pages
Data Science Roadmap
No ratings yet
Data Science Roadmap
41 pages
MCSL 228
No ratings yet
MCSL 228
17 pages
Alok Shukla DS Resume
No ratings yet
Alok Shukla DS Resume
1 page
DL Lesson Plan
No ratings yet
DL Lesson Plan
8 pages
YOLO-Based Fetal Brain Abnormality Detection
100% (1)
YOLO-Based Fetal Brain Abnormality Detection
38 pages
PyTorch For Machine Learning
No ratings yet
PyTorch For Machine Learning
5 pages
Linear Regression with PyTorch Guide
No ratings yet
Linear Regression with PyTorch Guide
28 pages
Deep Learning Assignments
No ratings yet
Deep Learning Assignments
6 pages
DL NN3
No ratings yet
DL NN3
5 pages
Pytorch FX
No ratings yet
Pytorch FX
10 pages
Unit Iv DL
No ratings yet
Unit Iv DL
26 pages
AI Programming for Beginners
No ratings yet
AI Programming for Beginners
14 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
26 pages

ISPR 26 Pytorch

Uploaded by

ISPR 26 Pytorch

Uploaded by

Laboratory Lecture for the “Intelligent Systems for Pattern Recognitionˮ

course of the Masterʼs Degree in Computer Science 20242025

1. Clone the repository di-unipi/ispr-lab from GitHub,

In a hurry? Just open the repository in Google Colab!

Up-to-date instructions to install PyTorch here: Start Locally | PyTorch

As for NumPy arrays, they support advanced indexing and broadcasting.

● +, - for addition and subtraction (support broadcasting)

In-place operations are defined with a suffix underscore:

Check the documentation: http://pytorch.org/docs/stable/tensors.html

Two tensors are “broadcastableˮ if the following rules hold:

1. Each tensor has at least one dimension.

1. Manually selecting with the device argument (‘cuda:0ʼ, ‘cuda:1ʼ…), or

Changing the shell environment variable CUDA_VISIBLE_DEVICES to limit the

tensor.unsqueeze(dim) add a singleton # unsqueeze

tensor.transpose(dim1, dim2) # transpose

The keepdim parameter keeps an

The Kaiming uniform initialization scheme

Write the following functions:

_relu_kaiming_init(in_size, out_size) that

Each operation creates a Function node in a dynamic computational graph,

The gradient is computed on each tensor by calling the backward() method.

Overview of PyTorch Autograd Engine

● data: Tensor containing the data itself

Each Function implements two methods:

● forward: function application

For optimizable model parameters ⇒ requires_grad=True

For input data or constant values ⇒ requires_grad=False

In-place modification is not allowed, as it breaks the automatic differentiation.

At inference time, the context manager torch.no_grad speeds up computation.

Autograd documentation: http://pytorch.org/docs/stable/autograd.html

The parameters() method returns an iterator over model parameters.

⚠ If you use a regular list, the parameters will not be registered!

The backward() step is automatically defined by Autograd, but can be

It is possible to define forward and backward hooks to debug your model!

Modules can operate in train or eval mode: net.train() or net.eval()

PyTorch provides lots

DataLoader automatizes mini-batching,

⚠ Remember to reset gradients using the zero_grad() method!

(less talk, more code)

Logging them, itʼs always a good idea!

TensorBoard works great for PyTorch as well.

Otherwise, there are cloud-based commercial

The state dictionary stores the value of all model parameters.

Then, instantiate the object and reload the state dictionary.

net = MyModelClass(*args, **kwargs)

… but in practice, most “routineˮ operations can be abstracted away.

Both PyTorch Lightning and Transformers by HuggingFace 🤗 provide APIs

Also, tons of libraries in the PyTorch Ecosystem: graph neural networks,

1. Monitor the performance with a logger,

You might also like