0% found this document useful (0 votes)

255 views82 pages

Deep Learning Hardware Evolution

1) Deep learning hardware has evolved from early analog implementations to modern digital systems capable of complex neural networks. 2) Early limitations in hardware constrained the complexity of neural networks that could be trained, but the development of backpropagation and more powerful computers enabled larger modern deep learning models. 3) Past research in convolutional neural networks for tasks like image recognition and semantic segmentation helped lay the foundations for modern deep learning, but progress was limited by available hardware, data, and software tools at the time.

Uploaded by

blackgen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

255 views82 pages

Deep Learning Hardware Evolution

Uploaded by

blackgen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

ISSCC, San Francisco, 2019-02-18

Deep Learning Hardware:

Past, Present, & Future
Yann LeCun
Facebook AI Research
New York University
[Link]
Y. LeCun

AI today is mostly supervised learning

Training a machine by showing examples instead of programming it
When the output is wrong, tweak the parameters of the machine
Works well for:
Speech→words
Image→categories
Portrait→ name
Photo→caption
CAR
Text→topic
…. PLANE
Y. LeCun

The History of Neural Nets is Inextricable from Hardware

N
The McCulloch-Pitts Binar Neuron y=sign( ∑ W i X i + b)
Perceptron: weights are motorized potentiometers i=1

Adaline: Weights are electrochemical “memistors”

[Link]
Y. LeCun

The Standard Paradigm of Pattern Recognition

...and “traditional” Machine Learning

Feature Trainable
Extractor Classifier

Hand engineered Trainable

Y. LeCun

1969→1985: Neural Net Winter

No learning for multilayer nets, why?
People used the wrong “neuron”: the McCulloch & Pitts binary neuron
Binary neurons are easier to implement: No multiplication necessary!
Binary neurons prevented people from thinking about gradient-based
methods for multi-layer nets

Early 1980s: The second wave of neural nets

1982: Hopfield nets: fully-connected recurrent binary networks
1983: Boltzmann Machines: binary stochastic networks with hidden units
1985/86: Backprop! Q: Why only then? A: sigmoid neurons!
Sigmoid neurons were enabled by “fast” floating point (Sun Workstations)
Y. LeCun

Multilayer Neural Nets and Deep Learning

Traditional Machine Learning

Feature Trainable
Extractor Classifier

Hand engineered Trainable

Trainable
Deep Learning

Low-Level Mid-Level High-Level Trainable

Features Features Features Classifier
Y. LeCun

Multi-Layer Neural Nets

Multiple Layers of simple units ReLU ( x )= max ( x , 0)

Each units computes a weighted sum of its inputs
Weighted sum is passed through a non-linear function
The learning algorithm changes the weights

Ceci est une voiture

Weight
matrix
Hidden
Layer
Y. LeCun

Supervised Machine Learning = Function Optimization

Function with
adjustable parameters
Objective
Function Error

traffic light: -1
It's like walking in the mountains in a fog
and following the direction of steepest
descent to reach the village in the valley
But each sample gives us a noisy
estimate of the direction. So our path is
a bit random. ∂ L( W , X )
W i ←W i −η
∂Wi
Stochastic Gradient Descent (SGD)
Y. LeCun

Computing Gradients by Back-Propagation

C(X,Y,Θ) ●
A practical Application of Chain Rule
Cost
●
Backprop for the state gradients:
Wn
●
dC/dXi-1 = dC/dXi . dXi/dXi-1
dC/dWn Fn(Xn-1,Wn) ●
dC/dXi-1 = dC/dXi . dFi(Xi-1,Wi)/dXi-1
dC/dXi Xi
Wi ●
Backprop for the weight gradients:
Fi(Xi-1,Wi)
dC/dWi ●
dC/dWi = dC/dXi . dXi/dWi
dC/dXi- Xi-1 ●
dC/dWi = dC/dXi . dFi(Xi-1,Wi)/dWi
1
F1(X0,W1)

Y (desired output)
X (input)
Y. LeCun

1986-1996 Neural Net Hardware at Bell Labs, Holmdel

1986: 12x12 resistor array
Fixed resistor values
E-beam lithography: 6x6microns
1988: 54x54 neural net
Programmable ternary weights 6 microns

On-chip amplifiers and I/O

1991: Net32k: 256x128 net
Programmable ternary weights
320GOPS, 1-bit convolver.
1992: ANNA: 64x64 net
ConvNet accelerator: 4GOPS
6-bit weights, 3-bit activations
Y. LeCun

Convolutional Network Architecture [LeCun et al. NIPS 1989]

Filter Bank +non-linearity

Pooling
Filter Bank +non-linearity

Pooling

Filter Bank +non-linearity

Inspired by [Hubel & Wiesel 1962] &

[Fukushima 1982] (Neocognitron):
simple cells detect local features
complex cells “pool” the outputs of simple
cells within a retinotopic neighborhood.
Y. LeCun

LeNet character recognition demo 1992

Running on an AT&T DSP32C (floating-point DSP, 20 MFLOPS)

Y. LeCun

Convolutional Network (LeNet5, vintage 1990)

Filters-tanh → pooling → filters-tanh → pooling → filters-tanh
Y. LeCun

ConvNets can recognize multiple objects

All layers are convolutional
Networks performs simultaneous segmentation and recognition
Y. LeCun

Check Reader (AT&T 1995)

Check amount reader

ConvNet+Language Model
trained at the sequence level.
50% percent correct, 49% reject,
1% error (detectable later in the
process).
Fielded in 1996, used in many
banks in the US and Europe.
Processed an estimated 10% to
20% of all the checks written in
the US in the early 2000s.

[LeCun, Bottou, Bengio ICASSP1997]

[LeCun, Bottou, Bengio, Haffner 1998]
Y. LeCun
nd
1996→2006: 2 NN Winter! Few teams could train large NNs
Hardware was slow for floating point computation
Training a character recognizer took 2 weeks on a Sun or SGI workstation
A very small ConvNet by today’s standard (500,000 connections)
Data was scarce and NN were data hungry
No large datasets besides character and speech recognition
Interactive software tools had to be built from scratch
We wrote a NN simulator with a custom Lisp interpreter/compiler
SN [Bottou & LeCun 1988] → SN2 [1992] → Lush (open sourced in 2002).
Open sourcing wasn’t common in the pre-Internet days
The “black art” of NN training could not be communicated easily
SN/SN2/Lush gave us superpowers: tools shape research directions
Y. LeCun

Lessons learned #1
1.1: It’s hard to succeed with exotic hardware
Hardwired analog → programmable hybrid → digital
1.2: Hardware limitations influence research directions
It constrains what algorithm designers will let themselves imagine
1.3: Good software tools shape research and give superpowers
But require a significant investment
Common tools for Research and Development facilitates productization
1.4: Hardware performance matters
Fast turn-around is important for R&D
But high-end production models always take 2-3 weeks to train
1.5: When hardware is too slow, software is not readily available, or
experiments are not easily reproducible, good ideas can be abandoned.
nd
The 2 Neural Net
Winter (1995-2005)
& Spring (2006-2012)
The Lunatic Fringe and
the Deep Learning Conspircy
Y. LeCun

Semantic Segmentation with ConvNet for off-Road Driving

DARPA LAGR program 2005-2009
[Hadsell et al., J. of Field Robotics 2009]
[Sermanet et al., J. of Field Robotics 2009]

Input image Stereo Labels Classifier Output

Y. LeCun

LAGR Video
Y. LeCun

Semantic Segmentation with ConvNets (33 categories)

Y. LeCun

FPGA ConvNet Accelerator: NewFlow [Farabet 2011]

NeuFlow: Reconfigurable Dataflow architecture
Implemented on Xilinx Virtex6 FPGA
20 configurable tiles. 150GOPS, 10 Watts
Semantic Segmentation: 20 frames/sec at 320x240
Exploits the structure of convolutions
NeuFlow ASIC [Pham 2012]
150GOPS, 0.5 Watts (simulated)
Y. LeCun

Driving Cars with Convolutional Nets

MobilEye

NVIDIA
The Deep Learning Revolution

State of the Art

Y. LeCun

Deep ConvNets for Object Recognition (on GPU)

AlexNet [Krizhevsky et al. NIPS 2012], OverFeat [Sermanet et al. 2013]
1 to 10 billion connections, 10 million to 1 billion parameters, 8 to 20 layers.
Y. LeCun

Error Rate on ImageNet

Depth inflation

(Figure: Anirudh Koul)

Y. LeCun

Deep ConvNets (depth inflation)

VGG
[Simonyan 2013]

GoogLeNet
Szegedy 2014]

ResNet
[He et al. 2015]

DenseNet
[Huang et al 2017]
Y. LeCun

GOPS vs Accuracy on ImageNet vs #Parameters

[Canziani 2016]

ResNet50 and
ResNet100 are used
routinely in
production.

Each of the few

billions photos
uploaded on
Facebook every day
goes through a
handful of ConvNets
within 2 seconds.
Y. LeCun

Progress in Computer Vision

[He 2017]
Y. LeCun

Mask R-CNN: instance segmentation

[He, Gkioxari, Dollar, Girshick
arXiv:1703.06870]
ConvNet produces an object
mask for each region of
interest
Combined ventral and dorsal
pathways
Y. LeCun

RetinaNet, feature pyramid network

One-pass object detection

[Lin et al. ArXiv:1708.02002]
Y. LeCun

Mask-RCNN Results on COCO dataset

Individual
objects are
segmented.
Y. LeCun

Mask R-CNN Results on COCO test set

Y. LeCun

Real-Time Pose Estimation on Mobile Devices

Maks R-CNN
running on
Caffe2Go
Y. LeCun

Detectron: open source vision in PyTorch

[Link]
Y. LeCun

3D ConvNet for Medical Image Analysis

Segmentation Femur from MR Images
[Deniz et al. Nature 2018]
Y. LeCun

3D ConvNet for Medical Image Analysis

Y. LeCun

Applications of Deep Learning

Medical image analysis [Mnih 2015]
Self-driving cars
Accessibility
Face recognition
Language translation
Virtual assistants*
Content Understanding for: [MobilEye]
Filtering
Selection/ranking
Search
Games
Security, anomaly detection
Diagnosis, prediction
Science!
[Geras 2017] [Esteva 2017]
Y. LeCun

Lessons learned #2
2.1: Good results are not enough
Making them easily reproducible also makes them credible.
2.2: Hardware progress enables new breakthroughs
General-Purpose GPUs should have come 10 years earlier!
But can we please have hardware that doesn’t require batching?
2.3: Open-source software platforms disseminate ideas
But making platforms that are good for research and production is hard.
2.4: Convolutional Nets will soon be everywhere
Hardware should exploit the properties of convolutions better
There is a need for low-cost, low-power ConvNet accelerators
Cars, cameras, vacuum cleaners, lawn mowers, toys, maintenance robots...
New DL Architectures
With different hardware/software requirements:
Memory-Augmented Networks
Dynamic Networks
Graph Convolutional Nets
Networks with Sparse Activations
Y. LeCun

Augmenting Neural Nets with a Memory Module

Recurrent networks cannot remember things for very long
The cortex only remember things for 20 seconds
We need a “hippocampus” (a separate memory module)
LSTM [Hochreiter 1997], registers
Memory networks [Weston et 2014] (FAIR), associative memory
Stacked-Augmented Recurrent Neural Net [Joulin & Mikolov 2014] (FAIR)
Neural Turing Machine [Graves 2014],
Differentiable Neural Computer [Graves 2016]

Recurrent net memory

Y. LeCun

Differentiable Associative Memory

Used very widely in NLP
Sum
MemNN, Transformer Network, ELMO,
GPT, BERT, GPT2, GLoMO
Essentially a “soft” RAM or hash table
Values Vi

K Ti X Softmax
e
Ci= K Tj X
∑e Keys Ki
j

Dot Products
Y =∑ Ci V i
i
Input (Address) X
Y. LeCun

Learning to synthesize neural programs for visual reasoning

[Link]
Y. LeCun

PyTorch: differentiable programming

Software 2.0:
The operations in a program are only partially specified
They are trainable parameterized modules.
The precise operations are learned from data, only the general structure
of the program is designed.
Dynamic computational graph
Automatic differentiation by recording a “tape” of operations and rolling it
backwards with the Jacobian of each operator.
Implemented in PyTorch1.0, Chainer…
Easy if the front-end language is dynamic and interpreted (e.g Python)
Not so easy if we want to run without a Python runtime...
Y. LeCun

ConvNets on Graphs (fixed and data-dependent)

Graphs can represent: Natural
language, social networks, chemistry,
physics, communication networks...

Review paper: “Geometric deep learning: going

beyond euclidean data”, MM Bronstein, J Bruna, Y
LeCun, A Szlam, P Vandergheynst, IEEE Signal
Processing Magazine 34 (4), 18-42, 2017
[ArXiv:1611.08097]
Y. LeCun

Spectral ConvNets / Graph ConvNets

Regular grid graph

Standard ConvNet
Fixed irregular graph
Spectral ConvNet
Dynamic irregular graph
Graph ConvNet

IPAM workshop:
[Link]
Y. LeCun

Sparse ConvNets: for sparse voxel-based 3D data

ShapeNet competition results ArXiv:1710.06104]
Winner: Submanifold Sparse ConvNet
[Graham & van der Maaten arXiv 1706.01307]
PyTorch: [Link]
Y. LeCun

Lessons learned #3
3.1: Dynamic networks are gaining in popularity (e.g. for NLP)
Dynamicity breaks many assumptions of current hardware
Can’t optimize the compute graph distribution at compile time.
Can’t do batching easily!
3.2: Large-Scale Memory-Augmented Networks...
...Will require efficient associative memory/nearest-neighbor search
3.3: Graph ConvNets are very promising for many applications
Say goodbye to matrix multiplications?
Say goodbye to tensors?
3.4: Large Neural Nets may have sparse activity
How to exploit sparsity in hardware?
What About (Deep)
Reinforcement Learning?
It works great …
…for games and virtual environments
Y. LeCun

Reinforcement Learning works fine for games

RL works well for games

Playing Atari games [Mnih 2013], Go
[Silver 2016, Tian 2018], Doom [Tian
2017], StarCraft...
RL requires too many trials.
100 hours to reach the performance that
a human can reach in 15 minutes on
Atari games [Hessel ArXiv:1710.02298]
RL often doesn’t really work in the real
world
FAIR open Source go player: OpenGo
[Link]
Y. LeCun

Pure RL is hard to use in the real world

Pure RL requires too many
trials to learn anything
it’s OK in a game
it’s not OK in the real world
RL works in simple virtual
world that you can run faster
than real-time on many
machines in parallel.

Anything you do in the real world can kill you

You can’t run the real world faster than real time
Y. LeCun

What are we missing to get to “real” AI?

What we can have What we cannot have (yet)
Safer cars, autonomous cars Machines with common sense
Better medical image analysis Intelligent personal assistants
Personalized medicine “Smart” chatbots”
Adequate language translation Household robots
Useful but stupid chatbots Agile and dexterous robots
Information search, retrieval, filtering Artificial General Intelligence
Numerous applications in energy, (AGI)
finance, manufacturing,
environmental protection, commerce,
law, artistic creation, games,…..
How do Humans
and Animal Learn?
So quickly
Y. LeCun

Babies learn how the world works by observation

Largely by observation, with remarkably little interaction.

Photos courtesy of
Emmanuel Dupoux
Y. LeCun

Early Conceptual Acquisition in Infants [from Emmanuel Dupoux]

pointing
Social-
helping vs false perceptual
communicati beliefs
hindering
ve
Perception

Action face tracking rational, goal-

s directed actions
biological
motion
gravity, inertia
Physic stability, support conservation of
s momentum

Object permanence shape

constancy
Objects solidity, rigidity

natural kind categories Time (months)

roduction

0 1 2 3 4 5 6 7 8 9 10 11 12
13 14
prooto-imitation
crawling walking
emotional contagion
Y. LeCun

Prediction is the essence of Intelligence

We learn models of the world by predicting
The Future:
Self-Supervised Learning
With massive amounts of data
and very large networks
Y. LeCun

Self-Supervised Learning

Predict any part of the input from any Time →

other part.
Predict the future from the past.

Predict the future from the recent past.

Predict the past from the present.

Predict the top from the bottom.

Predict the occluded from the visible

Pretend there is a part of the input you ← Past Future →
Present
don’t know and predict that.
Y. LeCun

How Much Information is the Machine Given during Learning?

“Pure” Reinforcement Learning (cherry)
The machine predicts a scalar reward given once in a
while.
A few bits for some samples

Supervised Learning (icing)

The machine predicts a category or a few numbers
for each input
Predicting human-supplied data
10→10,000 bits per sample

Self-Supervised Learning (cake génoise)

The machine predicts any part of its input for any
observed part.
Predicts future frames in videos
Millions of bits per sample
Y. LeCun

Self-Supervised Learning: Filling in the Blanks

Y. LeCun

Self-Supervised Learning works well for text

Word2vec
[Mikolov 2013]

FastText
[Joulin 2016]

BERT
Bidirectional Encoder
Representations from
Transformers
[Devlin 2018]

Figure credit: Jay Alammar [Link]

Y. LeCun

But it doesn’t really work for high-dim continuous signals

Video prediction:
Multiple futures are possible.
Training a system to make a single
prediction results in “blurry” results
the average of all the possible futures
Y. LeCun

The Next AI Revolution

THE REVOLUTION
WILL NOT BE SUPERVISED
(nor purely reinforced)
With thanks
To
Alyosha Efros
Learning Predictive Models
of the World
Learning to predict, reason, and plan,
Learning Common Sense.
Y. LeCun

Planning Requires Prediction

To plan ahead, we simulate the world
World

Agent Actions/
World Percepts
Simulator Outputs
Predicted Inferred
Action
Percepts World State Agent
Proposals
Actor
Agent State
Actor State
Critic Predicted Objective
Cost Cost
Y. LeCun

Training the Actor with Optimized Action Sequences

1. Find action sequence through optimization

2. Use sequence as target to train the actor
Over time we get a compact policy that requires no run-time optimization

Agent
World World World World
Simulator Simulator Simulator Simulator

Perception
Actor Actor Actor Actor

Critic Critic Critic Critic

Y. LeCun

The Hard Part: Prediction Under Uncertainty

Invariant prediction: The training samples are merely representatives of a
whole set of possible outputs (e.g. a manifold of outputs).

Percepts

Hidden State
Of the World
Y. LeCun

Faces “invented” by a GAN (Generative Adversarial Network)

Random vector → Generator Network → output image [Goodfellow NIPS 2014]
[Karras et al. ICLR 2018] (from NVIDIA)
Y. LeCun

Generative Adversarial Networks for Creation

[Sbai 2017]
Y. LeCun

Self-supervised Adversarial Learning for Video Prediction

Our brains are “prediction machines”
Can we train machines to predict the future?
Some success with “adversarial training”
[Mathieu, Couprie, LeCun arXiv:1511:05440]
But we are far from a complete solution.
Y. LeCun

Predicting Instance Segmentation Maps

[Luc, Couprie, LeCun, Verbeek ECCV 2018]

Mask R-CNN Feature Pyramid Network backbone
Trained for instance segmentation on COCO
Separate predictors for each feature level
Y. LeCun

Predictions
Y. LeCun

Long-term predictions (10 frames, 1.8 seconds)

Y. LeCun

Using Forward Models to Plan (and to learn to drive)

Overhead camera on
highway.
Vehicles are tracked
A “state” is a pixel
representation of a
rectangular window
centered around each
car.
Forward model is
trained to predict how
every car moves relative
to the central car.
steering and acceleration
are computed
Y. LeCun

Forward Model Architecture

Architecture: Encoder Decoder

expander

Latent variable predictor

Y. LeCun

Predictions
Y. LeCun

Learning to Drive by Simulating it in your Head

Feed initial state

Sample latent variable
sequences of length 20
Run the forward model
with these sequences
Backpropagate gradient of
cost to train a policy
network.
Iterate

No need for planning at

run time.
Y. LeCun

Adding an Uncertainty Cost (doesn’t work without it)

Estimates epistemic
uncertainty
Samples multiple drop-
puts in forward model
Computes variance of
predictions
(differentiably)
Train the policy network
to minimize the
lane&proximity cost plus
the uncertainty cost.
Avoids unpredictable
outcomes
Y. LeCun

Driving an Invisible Car in “Real” Traffic

Y. LeCun

Lessons learned #4
4.1: Self-Supervised learning is the future
Networks will be much larger than today, perhaps sparse
4.2: Reasoning/inference through minimization
4.3: DL hardware use cases
A. DL R&D: 32-bit FP, high parallelism, fast inter-node communication,
flexible hardware and software.
B. Routine training: 16-bit FP, some parallelism, moderate cost.
C. inference in data centers: 8 or 16-bit FP, low latency, low power
consumption, standard interface.
D. inference on embedded devices: low cost, low power, exotic number
systems?
AR/VR, consumer items, household robots, toys, manufacturing, monitoring,...
Y. LeCun

Speculations
Spiking Neural Nets, and neuromorphic architectures?
I’m skeptical…..
No spike-based NN comes close to state of the art on practical tasks
Why build chips for algorithms that don’t work?

Exotic technologies?
Resistor/Memristor matrices, and other analog implementations?
Conversion to and from digital kills us.
No possibility of hardware multiplexing
Spintronics?
Optical implementations?
Thank you

DeepSeek-VL: Open-Source Vision-Language Model
No ratings yet
DeepSeek-VL: Open-Source Vision-Language Model
33 pages
MLDD 1
No ratings yet
MLDD 1
44 pages
Understanding Few-Shot Learning in CV
No ratings yet
Understanding Few-Shot Learning in CV
1 page
Vision-Language Models Intro Guide
No ratings yet
Vision-Language Models Intro Guide
76 pages
Hardware Design for Machine Learning
No ratings yet
Hardware Design for Machine Learning
22 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
5 LLM Tools
No ratings yet
5 LLM Tools
13 pages
S4421 Gpu Computing With Matlab
No ratings yet
S4421 Gpu Computing With Matlab
27 pages
A Survey of Evolution of Image Captioning PDF
No ratings yet
A Survey of Evolution of Image Captioning PDF
18 pages
Machine Learning, Deep Learning, Computer Vision On Raspberry Pi2019-20
No ratings yet
Machine Learning, Deep Learning, Computer Vision On Raspberry Pi2019-20
2 pages
Small Language Model Nov 2024
No ratings yet
Small Language Model Nov 2024
52 pages
GPU Computing for Data Scientists
No ratings yet
GPU Computing for Data Scientists
34 pages
Tensor Processing Unit Overview
50% (2)
Tensor Processing Unit Overview
23 pages
Vibe Coding
No ratings yet
Vibe Coding
2 pages
Deep Learning Applications and Image Processing
No ratings yet
Deep Learning Applications and Image Processing
5 pages
Machine Learning Techniques For 5G and Beyond
No ratings yet
Machine Learning Techniques For 5G and Beyond
18 pages
Jeff Dean's Lecture For YC AI
100% (19)
Jeff Dean's Lecture For YC AI
86 pages
CUDA C/C++: Managing Device Count 802
No ratings yet
CUDA C/C++: Managing Device Count 802
67 pages
Geographic Coordinate Conversion
No ratings yet
Geographic Coordinate Conversion
11 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
15 pages
Verilog Nonblocking Assignments Demystified
100% (2)
Verilog Nonblocking Assignments Demystified
3 pages
Kaggle: Your Machine Learning and Data Science Community
No ratings yet
Kaggle: Your Machine Learning and Data Science Community
7 pages
Intro To Deep Learning
No ratings yet
Intro To Deep Learning
39 pages
Deeplearningsmartnetworks 190505233523
100% (1)
Deeplearningsmartnetworks 190505233523
101 pages
Deep Learning Handbook
No ratings yet
Deep Learning Handbook
33 pages
LSTM for Touchpoint Prediction
100% (1)
LSTM for Touchpoint Prediction
73 pages
Responsible Use of Large Language Models
No ratings yet
Responsible Use of Large Language Models
12 pages
Cheatsheet Recurrent Neural Networks
No ratings yet
Cheatsheet Recurrent Neural Networks
5 pages
PDF DSP - Real Time Digital Signal Processing
100% (4)
PDF DSP - Real Time Digital Signal Processing
503 pages
Lecture 4.b - Metaheuristics - Basic Concepts
No ratings yet
Lecture 4.b - Metaheuristics - Basic Concepts
42 pages
How To Develop Embedded Software Using The QEMU Machine Emulator
No ratings yet
How To Develop Embedded Software Using The QEMU Machine Emulator
107 pages
AI-RAN in 6G Networks: State-of-the-Art and Challenges
No ratings yet
AI-RAN in 6G Networks: State-of-the-Art and Challenges
18 pages
Deep Learning Basics with TensorFlow
No ratings yet
Deep Learning Basics with TensorFlow
50 pages
Nvidia - Ug - Matlab Gpu Coder
100% (1)
Nvidia - Ug - Matlab Gpu Coder
66 pages
Deep Learning Course Overview by Yann LeCun
No ratings yet
Deep Learning Course Overview by Yann LeCun
66 pages
Deep Learning Insights by Yann LeCun
No ratings yet
Deep Learning Insights by Yann LeCun
72 pages
Deep Learning Hardware Evolution
No ratings yet
Deep Learning Hardware Evolution
8 pages
Artificial Intelligence Hardware Design: Challenges and Solutions
No ratings yet
Artificial Intelligence Hardware Design: Challenges and Solutions
12 pages
7 CNN
No ratings yet
7 CNN
66 pages
Introduction+to+Neural+Networks+ +Lecture+Slides+Part+1
No ratings yet
Introduction+to+Neural+Networks+ +Lecture+Slides+Part+1
36 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
The Evolution of Deep Learning
No ratings yet
The Evolution of Deep Learning
53 pages
Deep Neural Network Hardware Architectures
No ratings yet
Deep Neural Network Hardware Architectures
65 pages
Understanding AI, ML, and DL Concepts
No ratings yet
Understanding AI, ML, and DL Concepts
50 pages
MV cs4243 2024 Amir 6 p0
No ratings yet
MV cs4243 2024 Amir 6 p0
40 pages
AI Slide 2
No ratings yet
AI Slide 2
82 pages
DL-Lecture-02 Introduction and History
No ratings yet
DL-Lecture-02 Introduction and History
50 pages
Understanding Deep Learning Concepts
No ratings yet
Understanding Deep Learning Concepts
74 pages
C8-Modern CNNs
No ratings yet
C8-Modern CNNs
57 pages
CNNs: From Human Vision to AI
No ratings yet
CNNs: From Human Vision to AI
25 pages
04introduction To Neural Networks
No ratings yet
04introduction To Neural Networks
62 pages
CS231n Deep Learning Overview
No ratings yet
CS231n Deep Learning Overview
66 pages
Session: Deep Learning: Module: Digital Image Processing Module Code: IMP302
No ratings yet
Session: Deep Learning: Module: Digital Image Processing Module Code: IMP302
34 pages
CNN for Breast Cancer Detection
100% (1)
CNN for Breast Cancer Detection
7 pages
Hardware Architectures For Deep Neural Networks-MIT'16
No ratings yet
Hardware Architectures For Deep Neural Networks-MIT'16
300 pages
Autonomous Car CNN Model Guide
No ratings yet
Autonomous Car CNN Model Guide
7 pages
Deep Learning Insights by Yann LeCun
No ratings yet
Deep Learning Insights by Yann LeCun
69 pages
Introduction to Deep Learning Course
No ratings yet
Introduction to Deep Learning Course
63 pages
Chapitre 8 2024
No ratings yet
Chapitre 8 2024
231 pages
Hardware Architectures For Deep Neural Networks: ISCA Tutorial June 24, 2017
No ratings yet
Hardware Architectures For Deep Neural Networks: ISCA Tutorial June 24, 2017
290 pages
Deep Equilibrium Models
No ratings yet
Deep Equilibrium Models
15 pages
La Litho Pakyaan Am
No ratings yet
La Litho Pakyaan Am
224 pages
Multiworld Testing Decision Service
No ratings yet
Multiworld Testing Decision Service
40 pages
Nityotsava
No ratings yet
Nityotsava
214 pages
Expectimax Search and Pseudocode
No ratings yet
Expectimax Search and Pseudocode
44 pages
NMSLIB Manual: Non-Metric Search Guide
No ratings yet
NMSLIB Manual: Non-Metric Search Guide
82 pages
Difference Between ISO and ASTM Standards
100% (4)
Difference Between ISO and ASTM Standards
5 pages
Unit 2 PPT AD3501 Deep Learning
No ratings yet
Unit 2 PPT AD3501 Deep Learning
17 pages
Case Study - Transformers in Machine Translation - Quiz - Attempt Review
No ratings yet
Case Study - Transformers in Machine Translation - Quiz - Attempt Review
6 pages
MACHINE LEARNING Updated
No ratings yet
MACHINE LEARNING Updated
12 pages
BERT (Bidirectional Encoder Representations From Transformers)
No ratings yet
BERT (Bidirectional Encoder Representations From Transformers)
4 pages
NASA Launch Control Speech Recognition
No ratings yet
NASA Launch Control Speech Recognition
7 pages
Solar Panels Presentation UC3M
No ratings yet
Solar Panels Presentation UC3M
24 pages
Machine Learning Exam Paper 2015
No ratings yet
Machine Learning Exam Paper 2015
1 page
Artificial Intelligence Project Overview
No ratings yet
Artificial Intelligence Project Overview
12 pages
Class47 49 - AttentionBasedModels Transformers 10 15may2023
No ratings yet
Class47 49 - AttentionBasedModels Transformers 10 15may2023
27 pages
AI Plant Disease Detection
No ratings yet
AI Plant Disease Detection
7 pages
Data Science Curriculum Overview
No ratings yet
Data Science Curriculum Overview
9 pages
Machine Learning Based Intrusion Detection System: Anish Halimaa A Dr. K.Sundarakantham
No ratings yet
Machine Learning Based Intrusion Detection System: Anish Halimaa A Dr. K.Sundarakantham
5 pages
AI Course Overview: Key Concepts
100% (1)
AI Course Overview: Key Concepts
79 pages
Pedestrian Detection via Semantic Features
No ratings yet
Pedestrian Detection via Semantic Features
10 pages
Longer Reading - A Brief History of Artificial Intelligence
No ratings yet
Longer Reading - A Brief History of Artificial Intelligence
6 pages
Generative AI Course Brochure
No ratings yet
Generative AI Course Brochure
11 pages
Dive into Deep Learning Overview
No ratings yet
Dive into Deep Learning Overview
972 pages
Blue and Green Modern Artificial Intelligence Presentation - 20241218 - 101145 - 0000
No ratings yet
Blue and Green Modern Artificial Intelligence Presentation - 20241218 - 101145 - 0000
12 pages
AI's Impact on Creative Fields
No ratings yet
AI's Impact on Creative Fields
7 pages
AI PPT
No ratings yet
AI PPT
34 pages
Fine-Grained Disentangled Learning for Emotion Recognition
No ratings yet
Fine-Grained Disentangled Learning for Emotion Recognition
5 pages
Research Roadmap for Digital UMKM
No ratings yet
Research Roadmap for Digital UMKM
4 pages
Machine Learning - AL3451 - Important Questions With Answer
No ratings yet
Machine Learning - AL3451 - Important Questions With Answer
25 pages
Machine Learning Overview & History
No ratings yet
Machine Learning Overview & History
14 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
35 pages
Mohit Resume
No ratings yet
Mohit Resume
1 page
ML Notes Question Bank Exstraction From Notes
No ratings yet
ML Notes Question Bank Exstraction From Notes
163 pages
Error Analysis for ChatGPT Translation Evaluation
No ratings yet
Error Analysis for ChatGPT Translation Evaluation
8 pages
Deep Learning Long Answers
No ratings yet
Deep Learning Long Answers
3 pages
Cv-00 Course Organization
No ratings yet
Cv-00 Course Organization
39 pages