0% found this document useful (0 votes)

15 views6 pages

assignment1

Uploaded by

Stella

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views6 pages

assignment1

Uploaded by

Stella

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

ASSIGNMENT #1: HUMAN VISION AND BASIC IMAGE PROCESSING

Due January 22, 2008 (in lecture)

Reflection Ideation Exercise Bonus Challenge

The Man Who Mistook His Wife for a Hat (6 Points)

When Oliver Sacks has his initial encounter with “the man who mistook his wife for a hat,” he explains
that Dr P. did not look at him in the normal way, but rather “made sudden strange fixations – on my
nose, on my right ear, down to my chin, up to my right eye – as if noting (even studying) these individual
features, but not seeing my whole face, its changing expressions, 'me' as a whole.”

Dr P. could experience the world only as small individual features. He was unable to group these low-
level features into high-level constructs. Sacks writes that he “had no sense whatever of a landscape or
a scene,” and when it came to recognizing people, “in the absence of obvious ‘markers,’ he was utterly
lost.” In many ways, Dr P. functioned like a computer, construing the world “by means of key features
and schematic relationships… without the reality being grasped at all.”

What tasks could Dr P. still accomplish by perceiving the world in this way? What tasks presented him
with the most difficulty? What does this suggest about the capabilities of computer vision?

Vision by Man and Machine (6 Points)

Poggio describes the layout of ganglion cells in the retina as the biological equivalent of a center-
surround filter, “approximating the Laplacian of a Gaussian.” From the readings, the lecture material, or
from an outside source, cite two additional examples of components or processes in human vision that
have equivalents in Computer Science or Electrical Engineering. These could be particular
mathematical equations, algorithms, or electrical circuits, for example. How are the biological and
electronic implementations of these techniques the same, and how do they differ? Is one
implementation more flexible or robust than the other?
The Case of the Colorblind Painter (4 Points)
Land’s “Mondrian” experiments, described in the reading, used two identical displays of sheets of
colored paper mounted on boards. Each “Mondrian” was illuminated with its own set of three
projectors. These projectors used bandpass filters and brightness controls to carefully adjust the
intensities and wavelengths of light striking each board. A telescopic photometer could be pointed at
any area to measure the flux, one wave band at a time.

In a typical experiment, the illuminators were adjusted so that an area of the Mondrian at the left and
some differently-colored area of the Mondrian at the right were both sending the same triplet of
radiant energies to the eye. In the example below, the eye perceives the exact same triplet of long-,
middle-, and short-wave energies from the red area (left) and the blue area (right).

In this experiment, what color sensations would a normal subject consciously perceive? What does this
say about the relationship between reflectance and illumination of the objects in our world (the energy
reaching our eyes) and the sensation of color?
Digital Video Capture (8 Points)
Begin this exercise by capturing a short segment of digital video (2-5 minutes in length) of an
interesting environment or sequence of events in the real world. This could be the view out your
apartment window, what you see when riding your bicycle between classes, footage of a sporting
event, video of your pet or your friends… the possibilities are unlimited. Just make sure that there is
something interesting going on in the video (no videos of blank walls). Try to come up with an example
where you might appreciate using a computer as an “extra set of eyes” to watch things for you – for
example, warning you when a pot is boiling over in the kitchen, or when someone has removed a book
from your bookshelf. Here are some examples of the type of video you might create:

http://cs377s.stanford.edu/assignments/sushi.avi
http://cs377s.stanford.edu/assignments/skateboard.avi
http://cs377s.stanford.edu/assignments/crosswalk.avi

To play these videos, you may need to install the DivX codec, available from http://divx.com/.

You can capture your video in any of the following ways:

1. Many handheld point-and-shoot digital cameras are capable of capturing short, low-
resolution video clips directly in AVI, MPG, or MOV format. If you or one of your friends
has a digital camera with this capability, you can use it to capture a video.
2. Most USB webcams will allow you to capture digital video, using a program like Windows
Movie Maker or VB VidCap. If you have a USB webcam, you can capture your video this
way (though it will restrict you to capturing your video in the same location as your
computer).
3. If you have a handheld DV camera with a Firewire connection, you can use it to record
video, and copy the video to your computer using a program like Adobe Premiere,
Windows Movie Maker, or iMovie. If you have access to a video camera, but no way to get
the video onto a computer, you can bring your camera to office hours to have your video
captured to computer.
a) Once you have captured your video, watch through it a few times and mark the frames that
contain the events or information that you are interested in. For example, you might create a
list of events like this:

Timecode Event
0:35 A Train arrives
0:44 A Train departs
2:23 C Train arrives
2:56 C Train departs
3:33 A Train arrives
If you are interested in a continuous parameter or value rather than discrete events, you can
create a graph instead, like this:

Amount of Coffee in Coffeepot

60
Amount of Coffee (oz)

50
40
30
20
10
0
0
20
40
60
80
100
120
140
160
180
200
220
240
260
280
300
320
340
360
380
400
420
Time (sec)

b) From what you know so far about human visual processing, how is your visual system picking
out the event, value, or object of interest in your video?
c) From what little you know about computer vision, how might the same event, value, or object
be extracted by a computer algorithm? If you’re not sure, just hazard your best guess.
d) Post your video online, preferably by uploading the original video file to your personal
Stanford web space. If you have difficulty doing this, you can use a video sharing site like
YouTube instead, but the original source video is preferred. Include the URL of the video in
your assignment hand-in.

MATLAB Introduction (6 Points)

This exercise is a very basic introduction to image processing in MATLAB that will prepare you for a
more detailed walkthrough in lecture.

a) Your first task is to learn how to run MATLAB, preferably on your own computer. You can run
MATLAB in one of four ways:
1. MATLAB is already installed on the machines in the Myth cluster (Gates B08). You can
complete this exercise on one of the Myth machines, but you will not be able to follow
along during the in-class tutorial. Once you log in, type matlab at any prompt to begin.
2. You can run MATLAB remotely, but display it on your machine, using the computers in
Stanford’s Remote Computing facility. Information on the facility is available here:
http://www.stanford.edu/services/unixcomputing/environments.html#remote
Instructions for remotely running X-Windows programs such as MATLAB can be found
here:
http://www.stanford.edu/services/unix/moreX.html
3. You can run an online trial of MATLAB in your web browser at the MathWorks website:
http://www.mathworks.com/programs/trials/online_trials/index.html
However, your trial is limited to two hours in length, and you will not be able to upload,
save, or print your work, so this method is not recommended.
4. If you are a member of the Stanford Graphics Lab, you can install MATLAB on your
personal machine and use the Graphics Lab license server. The installation files are shared
as:
\\blur\Matlab installers\
License files and installation instructions are available here:
http://graphics.stanford.edu/lab/soft/matlab/
b) Watch the six-minute demo video entitled “Introduction to the Image Processing Toolbox,”
available on the MathWorks website:
http://www.mathworks.com/products/image/demos.html
This video gives you an introduction to some of the image processing capabilities in Matlab.
c) In this exercise we will write a MATLAB routine to detect round objects in an image. Begin by
downloading the following image:
http://cs377s.stanford.edu/assignments/tennis.jpg
Load the image into Matlab with the following commands:
RGB = imread('tennis.jpg');
imshow(RGB);
d) Now follow the step-by-step instructions in the MATLAB demo entitled “Identifying Round
Objects,” available here:
http://www.mathworks.com/products/demos/shipping/images/ipexroundness.html
Since you have already loaded your image, you can begin from Step 2 of the instructions.
e) How well does this algorithm perform? Where does it break down?
f) See if you can adjust any of the parameters in the sequence of commands to improve the
output. For example, the threshold is chosen automatically using the commands
threshold = graythresh(I);
bw = im2bw(I,threshold);
But you could also choose a threshold manually, like this:
bw = im2bw(I,0.5);
g) Save your output image as a JPEG file. Include a printout of your output image with your
assignment hand-in, and explain any changes you made to the code in order to produce it.
Where’s Waldo? (6 Points)
This is an open ended exercise based on the “Where’s Waldo?”
book series. You are asked to create an automated “Waldo
Detector” using image processing techniques in MATLAB.

We provide the following series of images for you to analyze:

http://cs377s.stanford.edu/assignments/waldo/wheresWaldo1.jpg
http://cs377s.stanford.edu/assignments/waldo/wheresWaldo2.jpg
http://cs377s.stanford.edu/assignments/waldo/wheresWaldo3.jpg
http://cs377s.stanford.edu/assignments/waldo/wheresWaldo4.jpg

In each image, Waldo is hiding in the midst of a busy crowd. Waldo

always wears the same red and white striped sweater and hat.
However, he may be carrying a stack of books that vary from scene
to scene. Here are a few sample images of Waldo that you may
find helpful in solving this problem:

http://cs377s.stanford.edu/assignments/waldo/waldo0.jpg
http://cs377s.stanford.edu/assignments/waldo/waldo1.jpg
http://cs377s.stanford.edu/assignments/waldo/waldo2.jpg
http://cs377s.stanford.edu/assignments/waldo/waldo3.jpg
http://cs377s.stanford.edu/assignments/waldo/waldo4.jpg
http://cs377s.stanford.edu/assignments/waldo/waldo5.jpg

Unfortunately, there are also several Waldo lookalikes in each scene. Try not to be fooled by these
impostors! Here are some fake Waldos to look out for:

http://cs377s.stanford.edu/assignments/waldo/waldoLookAlikes.jpg

Write a MATLAB function

FindWaldo(filename)

That loads the image specified by filename, and displays the image with the detected location of
Waldo marked by a rectangle. You may implement any of the techniques discussed in class or in this
assignment, or invent your own approach. For example, you might try template matching, examining
color distributions, or looking for colored stripes or circles. You may assume that each input image
contains only one valid Waldo.

In your assignment hand-in, include the code listing for your FindWaldo function and a printout of its
output on one of the input images.

English For Pharmacy and Parapharmacy PDF
No ratings yet
English For Pharmacy and Parapharmacy PDF
16 pages
Administrivia: CMPSCI 370: Introduction To Computer Vision
No ratings yet
Administrivia: CMPSCI 370: Introduction To Computer Vision
12 pages
EXwaPmVPSX r5bAgknhYEw Introduction FPCV 0 1
No ratings yet
EXwaPmVPSX r5bAgknhYEw Introduction FPCV 0 1
30 pages
Introduction FPCV-0-1
No ratings yet
Introduction FPCV-0-1
31 pages
Lect 1 and 2
No ratings yet
Lect 1 and 2
100 pages
Emotion-Oriented Computing: Possible Uses and Resources: André Valdestilhas
No ratings yet
Emotion-Oriented Computing: Possible Uses and Resources: André Valdestilhas
4 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
EXERSISES
No ratings yet
EXERSISES
21 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Image Manipulation Finall
No ratings yet
Image Manipulation Finall
7 pages
Number Plate Recognition
80% (10)
Number Plate Recognition
48 pages
Learning Image Processing With OpenCV - Sample Chapter
100% (1)
Learning Image Processing With OpenCV - Sample Chapter
24 pages
Dip Lecture - Notes Final 1
No ratings yet
Dip Lecture - Notes Final 1
173 pages
Course 15: Computational Photography
No ratings yet
Course 15: Computational Photography
24 pages
Computer Vision and Data Science Notes
No ratings yet
Computer Vision and Data Science Notes
11 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
Digital Image Processing Full Report
No ratings yet
Digital Image Processing Full Report
9 pages
Computer Vision Is An Interdisciplinary Scientific Field That Deals With How Computers Can Gain High-Level
No ratings yet
Computer Vision Is An Interdisciplinary Scientific Field That Deals With How Computers Can Gain High-Level
3 pages
Module 1_IP
No ratings yet
Module 1_IP
11 pages
(MIT Electrical Engineering and Computer Science) Berthold K.P. Horn - Robot Vision-MIT Press (1986)
No ratings yet
(MIT Electrical Engineering and Computer Science) Berthold K.P. Horn - Robot Vision-MIT Press (1986)
536 pages
V2 Thesis My Thesis ETH LIB
No ratings yet
V2 Thesis My Thesis ETH LIB
256 pages
Digital Image Processing: Vipin V Asst. Professor, ECE SJCET, Palai
No ratings yet
Digital Image Processing: Vipin V Asst. Professor, ECE SJCET, Palai
156 pages
Project List
No ratings yet
Project List
21 pages
Computer Vision Is A Field That Includes Methods For Acquiring
No ratings yet
Computer Vision Is A Field That Includes Methods For Acquiring
3 pages
"Introduction To Computer Vision": Submitted by
No ratings yet
"Introduction To Computer Vision": Submitted by
45 pages
Computer Vision
No ratings yet
Computer Vision
7 pages
1_Intro
No ratings yet
1_Intro
103 pages
Computational Visualistics
No ratings yet
Computational Visualistics
3 pages
Lecture 1
100% (1)
Lecture 1
21 pages
NCRTTC P154 PDF
No ratings yet
NCRTTC P154 PDF
14 pages
B.M.S College of Engineering: (Autonomous Institution Under VTU) Bangalore-560 019
No ratings yet
B.M.S College of Engineering: (Autonomous Institution Under VTU) Bangalore-560 019
25 pages
Losing Too Much Performance. Computer Vision Is Also Used in Fashion Ecommerce, Inventory Management, Patent Search, Furniture
No ratings yet
Losing Too Much Performance. Computer Vision Is Also Used in Fashion Ecommerce, Inventory Management, Patent Search, Furniture
27 pages
Digital Image Processing Full Report
No ratings yet
Digital Image Processing Full Report
4 pages
lecture01
No ratings yet
lecture01
5 pages
Ipfile
No ratings yet
Ipfile
4 pages
Maxwell SPIE 1998
No ratings yet
Maxwell SPIE 1998
7 pages
Digital Image Processing Fundamentals: There's More To It Than Meets The Eye
No ratings yet
Digital Image Processing Fundamentals: There's More To It Than Meets The Eye
15 pages
Image Recognition in Artificial Intelligence
100% (2)
Image Recognition in Artificial Intelligence
11 pages
Mc0086 Assignment
No ratings yet
Mc0086 Assignment
8 pages
Automatic Vision System Via Image Processing Final
No ratings yet
Automatic Vision System Via Image Processing Final
66 pages
Procesamiento Digital de Imágenes
No ratings yet
Procesamiento Digital de Imágenes
4 pages
Exercises With Solutions 1-10
No ratings yet
Exercises With Solutions 1-10
10 pages
Unit 1
No ratings yet
Unit 1
13 pages
IP_unit1
No ratings yet
IP_unit1
37 pages
Structural Road Monitoring System
No ratings yet
Structural Road Monitoring System
41 pages
Computer Vision Research Statement
No ratings yet
Computer Vision Research Statement
5 pages
CV&IP chapter-one
No ratings yet
CV&IP chapter-one
28 pages
Chunk 2
No ratings yet
Chunk 2
31 pages
Computer Vision
No ratings yet
Computer Vision
3 pages
Assosa University Department of Surveying Engineering: Digital Image Analysis
No ratings yet
Assosa University Department of Surveying Engineering: Digital Image Analysis
39 pages
Alvin Wan 2018 (Public) Personal, Relevant Background and Future Goals Statement
No ratings yet
Alvin Wan 2018 (Public) Personal, Relevant Background and Future Goals Statement
3 pages
Computer Vision MCQs
No ratings yet
Computer Vision MCQs
3 pages
ch3
No ratings yet
ch3
22 pages
Prateek Joshi Resume
No ratings yet
Prateek Joshi Resume
2 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
1 Introduction
No ratings yet
1 Introduction
67 pages
Instant OpenCV for iOS
From Everand
Instant OpenCV for iOS
Kirill Kornyakov
No ratings yet
OpenGL Data Visualization Cookbook
From Everand
OpenGL Data Visualization Cookbook
Raymond C. H. Lo
No ratings yet
MATLAB Made Easy: Project-Based Learning for Young Innovators
From Everand
MATLAB Made Easy: Project-Based Learning for Young Innovators
Eric Okoth Ogur
No ratings yet
Exploring XPresso With CINEMA 4D R19
From Everand
Exploring XPresso With CINEMA 4D R19
Pradeep Mamgain
No ratings yet
2017 Supervised Machine Learning Based Surface Inspection by Synthetizing Artificial Defects
No ratings yet
2017 Supervised Machine Learning Based Surface Inspection by Synthetizing Artificial Defects
6 pages
ethhadmur1lolfe6bymk0if90.Tobias_Berninger_Dissertation
No ratings yet
ethhadmur1lolfe6bymk0if90.Tobias_Berninger_Dissertation
221 pages
scirobotics.abm1421
No ratings yet
scirobotics.abm1421
13 pages
thesis_Jiang Xiaoyue
No ratings yet
thesis_Jiang Xiaoyue
193 pages
Cmyk Separations 2020 New
100% (1)
Cmyk Separations 2020 New
3 pages
Lesson - 02 Colour Therapy
No ratings yet
Lesson - 02 Colour Therapy
11 pages
Hci 2m
No ratings yet
Hci 2m
8 pages
168 Visual Perceptual Skills
No ratings yet
168 Visual Perceptual Skills
3 pages
Specifying Colors — Matplotlib 3.8.4 Documentation
No ratings yet
Specifying Colors — Matplotlib 3.8.4 Documentation
5 pages
11541_Startup_Library_Colored_Pencil_Rubin_Craftsy_v1(1)
No ratings yet
11541_Startup_Library_Colored_Pencil_Rubin_Craftsy_v1(1)
17 pages
TLE 10 Illustration Q2
No ratings yet
TLE 10 Illustration Q2
5 pages
Artist Color PDF
No ratings yet
Artist Color PDF
220 pages
Download Complete Defensive Living Preserving Your Personal Safety Through Awareness Attitude and Armed Action 2nd Edition Ed Lovette PDF for All Chapters
100% (5)
Download Complete Defensive Living Preserving Your Personal Safety Through Awareness Attitude and Armed Action 2nd Edition Ed Lovette PDF for All Chapters
82 pages
A Conceptual Study of Drishti in Ayurvedic and Modern Point
No ratings yet
A Conceptual Study of Drishti in Ayurvedic and Modern Point
8 pages
Prevalence and Types of Color Vision Deficiency Am
No ratings yet
Prevalence and Types of Color Vision Deficiency Am
6 pages
An Algorithm For Consciousness
100% (7)
An Algorithm For Consciousness
37 pages
Cog Sci 03 Poster
No ratings yet
Cog Sci 03 Poster
12 pages
Seating Arrangement Questions PDF For IBPS PO Exam
No ratings yet
Seating Arrangement Questions PDF For IBPS PO Exam
9 pages
UN Flag Colors
No ratings yet
UN Flag Colors
48 pages
PP - Brand Guidelines - Newsroom - 2022
No ratings yet
PP - Brand Guidelines - Newsroom - 2022
1 page
2nd PT Arts
No ratings yet
2nd PT Arts
3 pages
Ip Unit-5
No ratings yet
Ip Unit-5
28 pages
Skilled Vision: Cristina Grasseni
No ratings yet
Skilled Vision: Cristina Grasseni
7 pages
Cinema and Embodied Affect Senses of Cinema
No ratings yet
Cinema and Embodied Affect Senses of Cinema
21 pages
PL Id503221122 - 1
No ratings yet
PL Id503221122 - 1
1 page
Digital Imaging Essential Skills 3rd Edition Mark Galer pdf download
No ratings yet
Digital Imaging Essential Skills 3rd Edition Mark Galer pdf download
71 pages
VAKOG Language: Change Your Language and You Change Your Entire Experience of Life On Earth!
No ratings yet
VAKOG Language: Change Your Language and You Change Your Entire Experience of Life On Earth!
6 pages
Color Palette and The 56 Excel ColorIndex Colors
No ratings yet
Color Palette and The 56 Excel ColorIndex Colors
28 pages
Color by Adition and Subtraction
No ratings yet
Color by Adition and Subtraction
10 pages
Carta de Colores Pantone
No ratings yet
Carta de Colores Pantone
10 pages
Patrulla Canina Patron Puntocruz
No ratings yet
Patrulla Canina Patron Puntocruz
10 pages
Chroma Pure Manual 3 X
No ratings yet
Chroma Pure Manual 3 X
77 pages
Howes 2022 in Defense of Materiality Attending To The Sensori Social Life of Things
No ratings yet
Howes 2022 in Defense of Materiality Attending To The Sensori Social Life of Things
23 pages

assignment1

Uploaded by

assignment1

Uploaded by

ASSIGNMENT #1: HUMAN VISION AND BASIC IMAGE PROCESSING

Due January 22, 2008 (in lecture)

Reflection Ideation Exercise Bonus Challenge

The Man Who Mistook His Wife for a Hat (6 Points)

Vision by Man and Machine (6 Points)

You can capture your video in any of the following ways:

Amount of Coffee in Coffeepot

MATLAB Introduction (6 Points)

We provide the following series of images for you to analyze:

In each image, Waldo is hiding in the midst of a busy crowd. Waldo

Write a MATLAB function

You might also like