Deep Learning
Christian S. Perone
Jun 2016
Convolutional
Neural Networks
Architectural Zoo
Who
Christian S. Perone
Machine Learning Engineer
Software Engineer
Blog
http://blog.christianperone.com
Open-source
http://github.com/perone
Twitter @tarantulae
Agenda
 Traditional Architectures
 Siamese Networks
 Dense Predictions
 Video
 Music Recommendation
 Localization, Detection, Alignment
 Q & A
Convolutional
Neural
Networks
Traditional
Architectures
1
Traditional Building Blocks
Three main building blocks:
– Convolutional
– Pooling
– Fully Connected (Dense)
Image by Liu et al, 2016
Architecture Overview
Traditional Building Blocks
The input data
Color Image
(RGB)
28 pixels (width)
28 pixels (height)
3 channels
(RGB)
Traditional Building Blocks
Convolutions
Animation by Vincent Dumoulin et al, 2016
Image by Apple
Traditional Building Blocks
Pooling
Images by Karpathy, cs2311 Stanford, 2016
Traditional Building Blocks
Fully Connected
Traditional Architecture
The canonical CNNs
Images by Rob Hess et al, 2016
Convolutional
Neural
Networks
Siamese
Networks
2
Architectural Zoo
The Siamese Architecture
Learning Hierarchies of Invariant Features. Yann LeCun.
Architectural Zoo
The Siamese Architecture
Learning visual similarity for product design with convolutional neural networks, Sean Bell et al
Architectural Zoo
The Siamese Architecture
Learning visual similarity for product design with convolutional neural networks, Sean Bell et al
Architectural Zoo
The Siamese Architecture
Learning visual similarity for product design with convolutional neural networks, Sean Bell et al
Architectural Zoo
The Siamese Architecture
Learning Deep Representations for Ground-to-Aerial Geolocalization, Tsung-Yi Lin et al. 2016.
Architectural Zoo
The Siamese Architecture
Learning Deep Representations for Ground-to-Aerial Geolocalization, Tsung-Yi Lin et al. 2016.
Convolutional
Neural
Networks
Dense
Prediction
3
Architectural Zoo
Dense Prediction
Fully Convolutional Networks for Semantic Segmentation. Jonathan Long et al. 2015
Architectural Zoo
Dense Prediction
Fully Convolutional Networks for Semantic Segmentation. Jonathan Long et al. 2015
Architectural Zoo
Dense Prediction
Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification. Satoshi Iizuka et al. 2016
Architectural Zoo
Dense Prediction
Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification. Satoshi Iizuka et al. 2016
Architectural Zoo
Dense Prediction
Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification. Satoshi Iizuka et al. 2016
Architectural Zoo
Dense Prediction
Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification. Satoshi Iizuka et al. 2016
Architectural Zoo
Dense Prediction
Learning to Simplify: Fully Convolutional Networks for Rough Sketch Cleanup. Edgar Simo-Serra et al. 2016.
Architectural Zoo
Dense Prediction
Learning to Simplify: Fully Convolutional Networks for Rough Sketch Cleanup. Edgar Simo-Serra et al. 2016.
Convolutional
Neural
Networks
Video
4
Architectural Zoo
Video
Large-scale Video Classification with Convolutional Neural Networks . Andrej Karpathy. 2014
Architectural Zoo
Video
Large-scale Video Classification with Convolutional Neural Networks . Andrej Karpathy. 2014
Architectural Zoo
Video
Learning Spatiotemporal Features with 3D Convolutional Networks . Du Tran et al. 2015
Convolutional
Neural
Networks
Music
Recommendation
5
Architectural Zoo
Music Recommendation
Recommending music on Spotify with Deep Learning. Sander Dieleman. 2014.
Convolutional
Neural
Networks
Localization
Detection
Alignment
6
Architectural Zoo
Localization, Detection and Alignment
Rich feature hierarchies for accurate object detection and semantic segmentation. Girschick et al. 2014.
Architectural Zoo
Localization, Detection and Alignment
Selective Search for Object Recognition. Uijlings et al. IJCV 2013
Architectural Zoo
Localization, Detection and Alignment
Fast R-CNN. Girschick et al. 2015.
Architectural Zoo
Localization, Detection and Alignment
Image by Fei-Fei Li & Andrej Karpathy & Justin Johnson. Cs231n, 2016.
Fast R-CNN. Girschick et al. 2015.
Architectural Zoo
Localization, Detection and Alignment
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Ren et al. 2015.
Image by Ross Girschick / Slide from CS231n, Fei-Fei & Andrej Karpathy & Justin Johnson. 2016.
Architectural Zoo
Localization, Detection and Alignment
Right Whale Recognition Challenge, winner report. Robert Bogucki, Deepsense.io. 2016.
Architectural Zoo
Localization, Detection and Alignment
Spatial Transformer Networks. Max Jaderberg et al. 2015.

More Related Content

PPTX
Deep neural networks
PDF
Deep learning for medical imaging
PDF
Introduction to object detection
PPTX
Introduction to CNN
PPTX
IMAGE SEGMENTATION.
PDF
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
PPTX
Intro to deep learning
PDF
Image captioning with Keras and Tensorflow - Debarko De @ Practo
Deep neural networks
Deep learning for medical imaging
Introduction to object detection
Introduction to CNN
IMAGE SEGMENTATION.
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Intro to deep learning
Image captioning with Keras and Tensorflow - Debarko De @ Practo

What's hot (20)

PDF
Deep Learning - Convolutional Neural Networks
PDF
Digital Image Processing - Image Compression
PPT
Brain tumor detection by scanning MRI images (using filtering techniques)
PPTX
Machine Learning - Convolutional Neural Network
PDF
Deep learning based object detection basics
PPTX
Introduction to Deep learning
PDF
Image segmentation with deep learning
PDF
Understanding Convolutional Neural Networks
PDF
Stable Diffusion path
PDF
(2017/06)Practical points of deep learning for medical imaging
PPTX
Convolutional Neural Network and Its Applications
PDF
Yurii Pashchenko: Zero-shot learning capabilities of CLIP model from OpenAI
PPTX
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
PPTX
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
PDF
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
PPTX
Facial Expression Recognition System using Deep Convolutional Neural Networks.
PDF
A brief introduction to recent segmentation methods
PDF
Data Science - Part XVII - Deep Learning & Image Processing
PDF
An introduction to Deep Learning
PDF
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Deep Learning - Convolutional Neural Networks
Digital Image Processing - Image Compression
Brain tumor detection by scanning MRI images (using filtering techniques)
Machine Learning - Convolutional Neural Network
Deep learning based object detection basics
Introduction to Deep learning
Image segmentation with deep learning
Understanding Convolutional Neural Networks
Stable Diffusion path
(2017/06)Practical points of deep learning for medical imaging
Convolutional Neural Network and Its Applications
Yurii Pashchenko: Zero-shot learning capabilities of CLIP model from OpenAI
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Facial Expression Recognition System using Deep Convolutional Neural Networks.
A brief introduction to recent segmentation methods
Data Science - Part XVII - Deep Learning & Image Processing
An introduction to Deep Learning
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Ad

Similar to Deep Learning - Convolutional Neural Networks - Architectural Zoo (20)

PPT
The Allosphere
PDF
Multimodal Deep Learning (D4L4 Deep Learning for Speech and Language UPC 2017)
PDF
Modeling perceptual similarity and shift invariance in deep networks
PDF
Deep Audio and Vision - Eva Mohedano - UPC Barcelona 2018
PDF
Audio and Vision (D4L6 2017 UPC Deep Learning for Computer Vision)
PDF
Audio and Vision (D2L9 Insight@DCU Machine Learning Workshop 2017)
PDF
Image Translation with GAN
PPTX
Entertainment ML
PPT
Objects for modeling world
PDF
Self-supervised Audiovisual Learning 2020 - Xavier Giro-i-Nieto - UPC Telecom...
PDF
IRJET- Music Genre Recognition using Convolution Neural Network
PDF
7-200404101602.pdf
PPTX
Generative models
PDF
Jia-Bin Huang's Curriculum Vitae
PPT
10 Minute Research Presentation on Ambisonics and Impact
PDF
Music in the Archives
PDF
Nithin Xavier research_proposal
PPTX
Intro to Auto Speech Recognition -- How ML Learns Speech-to-Text
PPT
The Concurrent Constraint Programming Research Programmes -- Redux
PPTX
Audio-Only Augmented Reality System for Social Interaction
The Allosphere
Multimodal Deep Learning (D4L4 Deep Learning for Speech and Language UPC 2017)
Modeling perceptual similarity and shift invariance in deep networks
Deep Audio and Vision - Eva Mohedano - UPC Barcelona 2018
Audio and Vision (D4L6 2017 UPC Deep Learning for Computer Vision)
Audio and Vision (D2L9 Insight@DCU Machine Learning Workshop 2017)
Image Translation with GAN
Entertainment ML
Objects for modeling world
Self-supervised Audiovisual Learning 2020 - Xavier Giro-i-Nieto - UPC Telecom...
IRJET- Music Genre Recognition using Convolution Neural Network
7-200404101602.pdf
Generative models
Jia-Bin Huang's Curriculum Vitae
10 Minute Research Presentation on Ambisonics and Impact
Music in the Archives
Nithin Xavier research_proposal
Intro to Auto Speech Recognition -- How ML Learns Speech-to-Text
The Concurrent Constraint Programming Research Programmes -- Redux
Audio-Only Augmented Reality System for Social Interaction
Ad

More from Christian Perone (10)

PDF
PyTorch 2 Internals
PDF
Gradient-based optimization for Deep Learning: a short introduction
PDF
Bayesian modelling for COVID-19 seroprevalence studies
PDF
Uncertainty Estimation in Deep Learning
PDF
PyTorch under the hood
PDF
Word Embeddings - Introduction
PDF
Apache Spark - Intro to Large-scale recommendations with Apache Spark and Python
PDF
Machine Learning com Python e Scikit-learn
PDF
Python - Introdução Básica
PDF
C++0x :: Introduction to some amazing features
PyTorch 2 Internals
Gradient-based optimization for Deep Learning: a short introduction
Bayesian modelling for COVID-19 seroprevalence studies
Uncertainty Estimation in Deep Learning
PyTorch under the hood
Word Embeddings - Introduction
Apache Spark - Intro to Large-scale recommendations with Apache Spark and Python
Machine Learning com Python e Scikit-learn
Python - Introdução Básica
C++0x :: Introduction to some amazing features

Recently uploaded (20)

PDF
The AI Revolution in Customer Service - 2025
PDF
Streamline Vulnerability Management From Minimal Images to SBOMs
PDF
Fitaura: AI & Machine Learning Powered Fitness Tracker
PDF
Advancements in abstractive text summarization: a deep learning approach
PPTX
Strategic Picks — Prioritising the Right Agentic Use Cases [2/6]
PPTX
How to use fields_get method in Odoo 18
PDF
Intravenous drug administration application for pediatric patients via augmen...
PPTX
maintenance powerrpoint for adaprive and preventive
PPTX
Report in SIP_Distance_Learning_Technology_Impact.pptx
PDF
Connector Corner: Transform Unstructured Documents with Agentic Automation
PDF
AI.gov: A Trojan Horse in the Age of Artificial Intelligence
PPT
Overviiew on Intellectual property right
PDF
Chapter 1: computer maintenance and troubleshooting
PPTX
Presentation - Principles of Instructional Design.pptx
PDF
CEH Module 2 Footprinting CEH V13, concepts
PDF
Technical Debt in the AI Coding Era - By Antonio Bianco
PDF
Optimizing bioinformatics applications: a novel approach with human protein d...
PPTX
Blending method and technology for hydrogen.pptx
PDF
Altius execution marketplace concept.pdf
PPTX
How to Convert Tickets Into Sales Opportunity in Odoo 18
The AI Revolution in Customer Service - 2025
Streamline Vulnerability Management From Minimal Images to SBOMs
Fitaura: AI & Machine Learning Powered Fitness Tracker
Advancements in abstractive text summarization: a deep learning approach
Strategic Picks — Prioritising the Right Agentic Use Cases [2/6]
How to use fields_get method in Odoo 18
Intravenous drug administration application for pediatric patients via augmen...
maintenance powerrpoint for adaprive and preventive
Report in SIP_Distance_Learning_Technology_Impact.pptx
Connector Corner: Transform Unstructured Documents with Agentic Automation
AI.gov: A Trojan Horse in the Age of Artificial Intelligence
Overviiew on Intellectual property right
Chapter 1: computer maintenance and troubleshooting
Presentation - Principles of Instructional Design.pptx
CEH Module 2 Footprinting CEH V13, concepts
Technical Debt in the AI Coding Era - By Antonio Bianco
Optimizing bioinformatics applications: a novel approach with human protein d...
Blending method and technology for hydrogen.pptx
Altius execution marketplace concept.pdf
How to Convert Tickets Into Sales Opportunity in Odoo 18

Deep Learning - Convolutional Neural Networks - Architectural Zoo