0% found this document useful (0 votes)

57 views

CS7.505: Computer Vision: Spring 2022

Uploaded by

Aryan Jain

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views

CS7.505: Computer Vision: Spring 2022

Uploaded by

Aryan Jain

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

CS7.

505: Computer Vision

Spring 2022: Introduction
Graphics
Artificial
Physics
Intelligence

Machine
Mathematics
Learning

Neurobiology Computing
Imaging

Anoop M. Namboodiri
Biometrics and Secure ID Lab, CVIT,
IIIT Hyderabad
Course Outline, Topics
Computer Vision

Geometry Image Grouping Recognition

Pinhole Camera Model Segmentation as Labelling Feature Detection, Descriptors
Proj. Geometry, Camera Matrix Graphcut, Binary Segmentation Face Detection, Recognition
Camera Calibration MRF for Segmentation Pedestrian Detection (HoG,SVM)
2-View Geometry, Homography Multi-label MRFs Bag of Words, SURF, Others
Fundamental Matrix Image-to-Image Networks, Segm Indexing and Retrieval
Stereo Corr., Depth Estimation Monocular Depth Estimation CNNs for Recognition
SFM and Bundle Adjustment CNN Training, Transfer Learning
Image Rectification CNNs for Detection
Computational Imaging
What about Deep Learning?
• DL has become the primary driving force behind most recent
success in CV. However, this is the first course on Computer Vision.
So we will limit the amount of DL in this course.
• Computer vision has a strong mathematical and conceptual basis
developed over 4 decades
• Geometry
• Optimization
• Visual object representations
• Optics, Lighting, Appearance models
• You need to know the basics to build on it
Pre-Requisites for the Course
• Linear algebra and a good mathematical outlook
• Vectors, matrices, eigenvalues, singular values
• 2D/3D geometry
• See course page for a more detailed list of topics
• Image/Signal processing
• Filtering, edge detection, segmentation
• Transforms, analysis
• Pattern Analysis, Algorithms, Programming
• Features, classifiers
• Training, testing, validation
• Python/C++, OpenCV
Brush up these topics if you are not certain. A reading list of
online material will be prepared for the preliminaries
Reference Books
No single textbook

Forsyth & Ponce Hartley & Zisserman Rick Szeliski Kevin P. Murphy
Indian Edition Indian Edition PDF Online PDF Online

… and several papers and resources.

Administrivia
• Grade Distribution
• Quizzes: Q1 + Q2 (~16%)
• Exams: MidTerm + Final (~34%)
• Homeworks/Assignments: (~25%)
• Project: In groups of 3 (~25%)
• This is an advanced elective that you opted for
• We expect you to work hard to learn well.
• Class participation lifts the level of the class
• We don’t want credit-seekers or resume-padders here
• Mode of Classes
• The classes will be conducted in person as long as the pandemic allows us
Class Etiquettes
• Be in the class before 2pm
• Keep your cellphones switched off. Those messages can
wait.
• Reduce noise in the class (online and offline)
• Switch off your cameras, microphones
• Put your hand up if you have a question
• If online, you may also type your questions in the chat
• If you have a doubt, ask. Others are also likely to have the
same doubt.
• A significant amount of learning comes from questions
asked by participants. So please listen to the lecture and
to other participant’s questions.
What is Computer Vision?
• Understanding of visual inputs (images/videos) by computers.
• Making sense out of them. Describing them.
• Does computer vision mimic the human vision?
• Certainly in many of its goals
• Why? Human vision is among the best!
• Sophisticated and efficient but not understood well
• Should computers process visual inputs like humans?
Not necessarily!
• Human visual system need not limit computer vision
• We draw inspiration from it as often as is convenient
Human perception is not perfect…
Copyright A.Kitaoka 2003
Three “Urges” on seeing a Picture*
Segmentation
• Given an image, you want to do:
Group proximate and similar
parts into meaningful regions

Recognition
Recollect previously seen
objects from memory

Reconstruction
Measure quantitative aspects:
Number, Size, Distance, etc.
*Jitendra Malik; Mysore Park, Dec. 2011
The Three Rs of Computer Vision
Reorganization (Segm.)
Recognition
Connecting what we
see to our memory

Reconstruction
Measure/recreate a
3D model of what
we see in the world

Group semantically similar pixels

Why is it Difficult?

90 126 180 120 102 131 126 91

82 140 143 182 180 142 138 81
81 141 148 195 188 147 140 80
75 144 150 210 198 149 141 73
71 144 151 241 214 150 143 70
88 142 147 236 205 146 141 85
106 139 142 225 197 141 138 101
128 135 139 184 180 138 132 121
Scene Interpretation
Segmentation and Labeling
1.Hand-carved Shesham 20. African cooking pot
wooden screen
21. Decoy bird
2.Wooden flowers
22. Painted candlestick
3.Wicker basket
23. Thai wooden swan
4.Pair of hand-carved Thai
24.Carved wooden duck
candlesticks
25. Embroidered mirror
5.Indonesian rattan screen
cushion covers
6.Dhurry covered armchair
26. Green hexagonal
7.Hand-painted chest Indian box
8.Striped wooden Indian 27. Painted Indian oil
candlestick bottle
9.Stone terracotta Thai 28. Joint wooden snake
10.Moroccan ceramic 29. Black embroidered
candlestick cushion
11.Blue Egyptian glass 30.Moroccan ceramic jar
decanter
31.Painted wooden
12.Bronze goblet-shaped candlestick
candlesticks
32.Thai pot with lid
13.Painted wooden Indian
33.Octagonal Indian box
elephant
34.Shallow twig baskets
14.Blue Egyptian glass
goblets 35.Mexican paper mache
fake fruit and
15.Indian brass filigree box
vegs
16.Painted Indian oil bottle
36.Nakshe Kantha
17.Large African water pot Bengali wall-
hanging
18.Philippino twig basket
37.Wooden shell bowl
19.Philippino bamboo covered
urn 38. Wooden servers
Computer Vision
• Goal: Extract all possible information about a visual scene by
computer processing
What? When? Where? Who? How? Why? How many?
• Over 50% of the brain is devoted to vision for humans.
– Must be important to us!
• Why is it difficult?
Chairs and Chairs
• Which are chairs?
• Large intra-class variations
• How do we describe a chair?
• Basic property: Sittability!
• We infer a lot from pictures.
Can we instruct a computer
to do the same?
• Do we understand how we
infer?
Applications: Medical

CT Scan

Computer Assisted Surgery

Segmentation
Applications: Space Imaging

Ikonos

Rio Negro (black) meets Amazon (blue)

Applications: Automated Inspection

Manual PCB Inspection Automated PCB Inspection

Applications: Biometrics

Travel

Computer Access Disney Land

Applications: Broadcasting

Chroma Keying: Replacing Backgrounds

Field Understanding: Virtual Line

Ball Tracking: Hawk Eye Player Tracking: CVIT, IIITH

3D Shape and Motion Recovery
• Structure light scanner, laser
range finder
• Multi-camera stereo, structure
recovery
• Reverse Engineering
• Virtualized/Augmented reality
Applications: Others
• Surveillance
• Automated Assembly
• Mail Sorting
• Face detection (photography)
• Robot Navigation
• Content-Based Image Retrieval
• Entertainment
• And many more… with your help…
Why Automated Vision?
1. High reliability
2. High repeatability
3. More objective evaluation
4. Lower cost
5. Higher speed
6. Ability to operate in hazardous environments

General purpose machine vision system do not exist.

Recent: Structure from Motion

• Approximate 3D structure from an unstructured collection of images!

[PhotoTourism, SIGGRAPH2006]
• PhotoSynth
• Autodesk 123D: Your pictures to model
• And many more to follow soon
Recent: Natural Gaming

Microsoft Kinect

• You are the controller. Interact naturally with the game.

• Fastest Selling Electronic Device Ever: 80 lakh units in 60 days!!
• Finding great use in Computer Vision, Robotics, etc.
Recent: Automotive Safety

Can help avoid accidents greatly!

The Real Problem

Develop something similar for Indian roads!

What More is Possible?
• Much much more .....
• The journey has just begun for computer vision.
• Large amount of data, high computing power, machine
learning algorithms continue to transform computer vision.
• Big things are yet to come.
Questions?
M1 Geometry: Imaging and Camera Model
The Pinhole Camera

Y y

𝑌
𝑦=𝑓
𝑍
Camera with Lens
do di

! ! ! 𝑑"
Thin lens equation: =# +# 𝑑! = 𝑓
" ! " 𝑑" − 𝑓
Focus and DOF
Aperture
do di

Focal Ratio = f / d
Aperture vs. DOF

Object Distance (do)

Aperture (d)
Geometric Distortions

original pincushion barrel

Geometric Distortions
Lens Flare
Chromatic Aberration
Normal lenses diffract different wavelengths to different degree
Sampling an Image: Resolution
Resolution
• The number of samples in an image (number of sensor elements) is referred to
as its resolution
• The resolution is typically represented as the product of number of samples in
the horizontal and vertical directions in the image. e.g.: 32x32, 256x256,
640x480

Common Resolutions:

NTSC: 648 x 486

Typical Webcam: 1280 x 720
High-end SLR: 11,648×8,736 *
Hubbles Telescope: 1,600 x 1,600
Fujifilm GFX100
Camera Model: Objectives
• Mathematically model what a camera does
• Also understand what the model means
• Getting the model for a real-world camera
• Estimation from real world measurements
• Special imaging configurations with simpler properties
• Simpler relationships
• General theory on fitting linear models under noisy observations
• Techniques that work across problems
What does a Camera do?
• Form an image on the 2D image
plane of the 3D world visible to it.
• Image is behind the lens; the
scene is in front.
• 3D world is projected down to a
2D plane.
• Significant loss of information as
one dimension is dropped.
• Mathematical depiction of this
projection ...
Questions?

BS en 806-4-2010
No ratings yet
BS en 806-4-2010
56 pages
Plan Transparency Declaration Form (PTDF) - FINAL
No ratings yet
Plan Transparency Declaration Form (PTDF) - FINAL
3 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Lec 00
No ratings yet
Lec 00
76 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
lecture 1 AI Summary
No ratings yet
lecture 1 AI Summary
31 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
Cv Unit 1 Overview of Computer Vison and Application
No ratings yet
Cv Unit 1 Overview of Computer Vison and Application
51 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
CS436 CS5310 EE513 L01 Introduction
No ratings yet
CS436 CS5310 EE513 L01 Introduction
54 pages
UNIT-I_Introduction to Computer Vision
No ratings yet
UNIT-I_Introduction to Computer Vision
45 pages
intro
No ratings yet
intro
66 pages
DL4CV_Week01_Part01
No ratings yet
DL4CV_Week01_Part01
35 pages
Week 9 Lecture Notes
No ratings yet
Week 9 Lecture Notes
27 pages
Computer Vision: From Recognition To Geometry
No ratings yet
Computer Vision: From Recognition To Geometry
26 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
Unit 5 Introduction Robot Vision
No ratings yet
Unit 5 Introduction Robot Vision
60 pages
Cv Digital Notes
No ratings yet
Cv Digital Notes
77 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
Lec 1
No ratings yet
Lec 1
51 pages
CompVisNotes PDF
No ratings yet
CompVisNotes PDF
115 pages
CV - Lec01 - Introduction
No ratings yet
CV - Lec01 - Introduction
50 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
01_Introduction_To_MachineVision
No ratings yet
01_Introduction_To_MachineVision
53 pages
Computer Vision 1731163352
No ratings yet
Computer Vision 1731163352
153 pages
1 Sirg Bsu - 1
No ratings yet
1 Sirg Bsu - 1
46 pages
Overview
No ratings yet
Overview
5 pages
INT345 COMPUTER VISION
No ratings yet
INT345 COMPUTER VISION
2 pages
Ch1_TDMA_Image_Processing
No ratings yet
Ch1_TDMA_Image_Processing
34 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
00CV Intro Full
No ratings yet
00CV Intro Full
58 pages
Format of 1st Page - Seminar
No ratings yet
Format of 1st Page - Seminar
3 pages
Lecture 01
No ratings yet
Lecture 01
79 pages
Week1_Lecture2
No ratings yet
Week1_Lecture2
50 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
CS231A - Computer Vision: Project Proposals
No ratings yet
CS231A - Computer Vision: Project Proposals
46 pages
01 Introduction
No ratings yet
01 Introduction
33 pages
Unit 4 Computer Vision Lecture Notes 1 4 Compress
No ratings yet
Unit 4 Computer Vision Lecture Notes 1 4 Compress
138 pages
What Computer Vision With The OpenCV
100% (5)
What Computer Vision With The OpenCV
137 pages
CV Lecture 1
No ratings yet
CV Lecture 1
65 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
42 pages
Introduction to Data Science: (Khoa học dữ liệu)
No ratings yet
Introduction to Data Science: (Khoa học dữ liệu)
91 pages
Lecture 1
No ratings yet
Lecture 1
21 pages
39 Computer Vision
No ratings yet
39 Computer Vision
1 page
Unit 1 Chapter 1
No ratings yet
Unit 1 Chapter 1
27 pages
MODULE-1
No ratings yet
MODULE-1
18 pages
Lec 1 - 2
No ratings yet
Lec 1 - 2
39 pages
RMK Group 21cs905 CV Unit 2
No ratings yet
RMK Group 21cs905 CV Unit 2
76 pages
LectureNotes PDF
No ratings yet
LectureNotes PDF
212 pages
Gujarat Technological University: Elective Course
No ratings yet
Gujarat Technological University: Elective Course
3 pages
Administrivia: CMPSCI 370: Introduction To Computer Vision
No ratings yet
Administrivia: CMPSCI 370: Introduction To Computer Vision
12 pages
Chapter+1+Introduction+Part+1
No ratings yet
Chapter+1+Introduction+Part+1
72 pages
01 - Introduction
No ratings yet
01 - Introduction
37 pages
ECE885 Computer Vision: Prof. Bhupinder Verma
No ratings yet
ECE885 Computer Vision: Prof. Bhupinder Verma
59 pages
Computer Vision
100% (1)
Computer Vision
48 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
72 pages
Minecraft: 70 Top Minecraft House & Mods Ideas Exposed!: (Special 2 In 1 Exclusive Edition)
From Everand
Minecraft: 70 Top Minecraft House & Mods Ideas Exposed!: (Special 2 In 1 Exclusive Edition)
Jason Scotts
2/5 (1)
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
From Everand
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
Abhishek Vijayvargia
No ratings yet
Complete Download Introduction to Audiovisual Archives 1st Edition Peter Stockinger PDF All Chapters
100% (9)
Complete Download Introduction to Audiovisual Archives 1st Edition Peter Stockinger PDF All Chapters
50 pages
Governance and Societal Adaptation in Fragile States John Idriss Lahai All Chapter Instant Download
100% (6)
Governance and Societal Adaptation in Fragile States John Idriss Lahai All Chapter Instant Download
62 pages
Cover Pageee Mee
No ratings yet
Cover Pageee Mee
5 pages
ISO 17025 Training Day 1
No ratings yet
ISO 17025 Training Day 1
2 pages
Lec 3
No ratings yet
Lec 3
23 pages
ASRA Guidelines For CNB
100% (1)
ASRA Guidelines For CNB
66 pages
Graphical Solution of LP in Two Variables Example 1 (3.1-9)
No ratings yet
Graphical Solution of LP in Two Variables Example 1 (3.1-9)
4 pages
Solution Manual For Capital Markets: Institutions and Instruments, 4th Edition, Frank J. Fabozzi, Franco Modigliani
100% (1)
Solution Manual For Capital Markets: Institutions and Instruments, 4th Edition, Frank J. Fabozzi, Franco Modigliani
37 pages
Harmony Training Lab Manual
No ratings yet
Harmony Training Lab Manual
85 pages
Self and Peer Rating Activity Sheet - Chapter 1 - PR2
No ratings yet
Self and Peer Rating Activity Sheet - Chapter 1 - PR2
1 page
4N65
No ratings yet
4N65
8 pages
Attagel 50 January 2018 R5 ED2
No ratings yet
Attagel 50 January 2018 R5 ED2
2 pages
MPU22012 Business Plan Presentation 2021-2022
No ratings yet
MPU22012 Business Plan Presentation 2021-2022
5 pages
D60 Brochure
No ratings yet
D60 Brochure
2 pages
OEM Compact Pressure Switch Socket Wrench Mounting Model PSM01
No ratings yet
OEM Compact Pressure Switch Socket Wrench Mounting Model PSM01
3 pages
Approach To Scientific Writing PDF
No ratings yet
Approach To Scientific Writing PDF
3 pages
Research Progress II
No ratings yet
Research Progress II
4 pages
The Consilience Project | Technology is Not Values Neutral: Ending the Reign of Nihilistic Design - The Consilience Project
No ratings yet
The Consilience Project | Technology is Not Values Neutral: Ending the Reign of Nihilistic Design - The Consilience Project
40 pages
RRL About Early Intervention
No ratings yet
RRL About Early Intervention
2 pages
Module Pimap System
No ratings yet
Module Pimap System
82 pages
Abstract
No ratings yet
Abstract
2 pages
Unit 3 - Skills and Attitudes Student S Copy
No ratings yet
Unit 3 - Skills and Attitudes Student S Copy
1 page
VoxSmart (Spain) - DevOps (Senior)
No ratings yet
VoxSmart (Spain) - DevOps (Senior)
3 pages
Market Access Map Uschi
No ratings yet
Market Access Map Uschi
2 pages
SR6015039 Pranamit Sen
No ratings yet
SR6015039 Pranamit Sen
2 pages
Strap Strategy: Hiral Thanawala
No ratings yet
Strap Strategy: Hiral Thanawala
4 pages
Hairline Shapes and Meanings
No ratings yet
Hairline Shapes and Meanings
4 pages
Ficha Técnica Perfecto T Range
No ratings yet
Ficha Técnica Perfecto T Range
2 pages

CS7.505: Computer Vision: Spring 2022

Uploaded by

CS7.505: Computer Vision: Spring 2022

Uploaded by

CS7.

505: Computer Vision

Geometry Image Grouping Recognition

… and several papers and resources.

Group semantically similar pixels

90 126 180 120 102 131 126 91

Computer Assisted Surgery

Rio Negro (black) meets Amazon (blue)

Manual PCB Inspection Automated PCB Inspection

Computer Access Disney Land

Chroma Keying: Replacing Backgrounds

Field Understanding: Virtual Line

Ball Tracking: Hawk Eye Player Tracking: CVIT, IIITH

General purpose machine vision system do not exist.

• Approximate 3D structure from an unstructured collection of images!

• You are the controller. Interact naturally with the game.

Can help avoid accidents greatly!

Develop something similar for Indian roads!

Object Distance (do)

original pincushion barrel

NTSC: 648 x 486

You might also like