0% found this document useful (0 votes)

111 views13 pages

Fundamentals of Computer Vision

Computer vision technology enables machines to interpret and respond to images and videos, facing challenges such as the need for extensive datasets and time-sensitive decision-making. Key tasks include object classification, identification, detection, and segmentation, utilizing various techniques like geometric transformations and feature detection. The document outlines the importance of deep learning algorithms in enhancing object recognition and detection capabilities.

Uploaded by

nameera0987654321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views13 pages

Fundamentals of Computer Vision

Uploaded by

nameera0987654321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Computer vision technology is crucial part of AI, that aids in

building a machine with ability to look at an image/video, to

understand it, and to respond to it.

Challenges in Computer Vision:

 Image is stored as vector array in digital form. Deep learning
techniques are required to get insights from this data.
 A very huge data set would be required to train the system
to identify objects at various angles/environmental
conditions.
 Time based decision making. Example: Alert has to be
generated by a surveillance robot, when someone crosses a
railway line and a train is approaching, otherwise, it should
be considered normal.
 In case of living objects, ability to differentiate between, the
live object, a statue of the object, a life size poster/photo of
the object.
 Understanding the object with its context

What is Computer Vision ?

Computer vision tasks include methods
for acquiring, processing, analyzing and understanding digital
images, and extraction of high-dimensional data from the real
world in order to produce numerical or symbolic information, e.g.
in the forms of decisions
PURPOSE OF COMPUTER VISION
Object Classification
Object Identification
Object Verification
Object Detection
Object Landmark Detection
Object Segmentation
Object Recognition
[Link]
lex_auth_012983793978515456273_shared/web-hosted/assets/[Link] https://
[Link]/common-content-store/Shared/Shared/Public/
lex_auth_012983793978515456273_shared/web-hosted/assets/[Link]

Visualization
The purpose is to observe the objects that are not visible in an
image

Image Sharpening and Restoration

The purpose is to create a better image

Image Retrieval

The purpose is to seek for the image of interest

Measurement of Pattern

The purpose is to measure various objects in an image

Image Recognition

The purpose is to distinguish the objects in an image

Changing color spaces:
Color spaces define how the range value of pixels are represented
for different mediums. Some of the widely used color spaces
include RGB, HSV, CMYK, LAB color space, YCrCb color space, etc.

Example: By default, digital images will be in RGB (Red, Blue,

Green) color space. In RGB color space, the pixels have only the
color components information, and has no direct details about
brightness or saturation intensity. It would be pose complexity if
we wish to process the brightness of the image. Similarly,
some kind of processing could provide poor results in some color
spaces. Hence, we might choose to convert it to a different color
space based on the requirement.

CMYK - Cyan, Magenta, Yellow and blacK - The color space which
is widely used in printers.

HSV - Hue, Saturation, Value - preferred color space in high

quality graphics, due to the property that, it has separate
components to denote color, intensity and brightness. So it is
easy to maintain the color and just alter the brightness in the
images.
Geometric Transformations:
As the name indicates, geometric transforms refers to altering the
geometric aspects of the images like size, orientation, etc.,
without affecting the actual contents of the image. Some of the
geometric transformation techniques are listed below:

Scaling:

Generally images are scaled down to minimize computational

time and resources, while taking to consideration that, the feature
details are not lost.

Rotation/Reflection/Translation:

Images will be rotated in different angles so that it would be

analysed in different perspectives.
Mirroring can also be done to obtain insights about the image.

mage translation refers to moving pixels in an image from one

position to another, with aim of changing an objects position/
angle.
IMAGE SEGMENTATION

In computer vision, segmentation is the process of extracting

pixels in an image that are related. Segmentation algorithms
usually take an image and produce a group of contours (the
boundary of an object that has well-defined edges in an image) or
a mask where a set of related pixels are assigned to a unique
color value to identify it.

The main purpose for Image segmentation is to partition an

image into a collection of set of pixels and achieve the following
results for

– Meaningful regions (coherent objects)

– Linear structures (line, curve, …)

– Shapes (circles, ellipses, …)

The very simple example for image segmentation is method

based on thresholding.

Binary Segmentation

You have an image, and you want to segment into 2 part (light
and dark). So you can use the threshold is a value, for example,
100. So after segmenting, your image will be segmented into 2
part: the pixels with higher than 100 intensity and the pixels (you
can set value for it 255 - is maximum intensity value, white color)
with less than 100 intensity (corresponding intensity is 0 -
minimum intensity value, black color). This is call binary
segmentation. If you use more thresholds, you have more
segments.
Feature

A feature is a piece of information which is relevant for solving the

computational task related to a certain application. Features may
be specific structures in the image such as points, edges or
objects. Features may also be the result of a general
neighborhood operation or feature detection applied to the
image.

Main Component Of Feature Detection And Matching

Detection:

Identify the Interest Point in the image.

The features that are in specific locations of the images, such as

mountain peaks, building corners, doorways, or interestingly
shaped patches of snow. These kinds of localized features are
often called keypoint features (or even corners) and are often
described by the appearance of patches of pixels surrounding the
point location.
The features that can be matched based on their orientation and
local appearance (edge profiles) are called edges and they can
also be good indicators of object boundaries and occlusion events
in the image sequence.

Description:

The local appearance around each feature point is described in

some way that is (ideally) invariant under changes in illumination,
translation, scale, and in-plane rotation. We typically end up with
a descriptor vector for each feature point.

Matching:

Descriptors are compared across the images, to identify similar

features. For two images we may get a set of pairs (Xi, Yi) ↔ (Xi`,
Yi`), where (Xi, Yi) is a feature in one image and (Xi`, Yi`) its
matching feature in the other image.

IMAGE RECOGNITION
Recognition is one of the toughest challenges in the concepts in
computer vision. For the human eyes, recognizing an object’s
features or attributes would be very easy. Humans can recognize
multiple objects with very small effort. However, this does not
apply to a machine. It would be very hard for a machine to
recognize or detect an object because these objects vary. They
vary in terms of viewpoints, sizes, or scales.

Object Recognition

Object recognition is used for indicating an object in an image or

video. This is a product of machine learning and deep learning
algorithms. Object recognition tries to acquire this innate human
ability, which is to understand certain features or visual detail of
an image.

The output of object recognition will include the identified object

category along with the probability of correctness.

Object recognition refers to identification of what is present in the

image, while object detection refers to locating where it is present
in the image.

Object recognition through deep learning can be achieved

through training models or through utilizing pre-trained deep
learning models. To train models from scratch, the first thing you
need to do is to collect a large number of datasets. Then you
need to design a certain architecture that will be used for creating
the model.

Just like in deep learning, object recognition through algorithmic

approach is also possible.

The following algorithms are commonly uses approaches:

 HOG feature extraction

 Bag of words model
 Viola-Jones algorithm

IMAGE DETECTION
Image or Object Detection is a technique that processes the
image and detects objects in it.

When it comes to applying deep machine learning to image

detection, developers use Python along with open-source libraries
like OpenCV image detection, Open Detection, Luminoth,
ImageAI, and others. These libraries simplify the learning process
and offer a ready-to-use environment.

The commonly used techniques for Object Detection are

• Haar cascades algorithm

• Viola Jones Algorithm

Object detection uses an object’s feature for classifying its class.

For example, when looking for circles in an image, the machine
will detect any round object. To recognize any instances of an
object in a class, this algorithm uses learning techniques and
extracted features of an image.

Understanding Computer Vision Applications
No ratings yet
Understanding Computer Vision Applications
29 pages
Understanding Computer Vision Applications
No ratings yet
Understanding Computer Vision Applications
30 pages
Understanding Computer Vision Applications
No ratings yet
Understanding Computer Vision Applications
12 pages
Making Machines See Notes
No ratings yet
Making Machines See Notes
6 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
35 pages
Applications of Computer Vision Explained
No ratings yet
Applications of Computer Vision Explained
47 pages
Computer Vision Basics for Class 12 AI
No ratings yet
Computer Vision Basics for Class 12 AI
28 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
6 pages
Introduction to Computer Vision Basics
No ratings yet
Introduction to Computer Vision Basics
23 pages
Computer Vision 202526
No ratings yet
Computer Vision 202526
34 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
4 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
4 pages
Unit-5 Computer Vision-1
No ratings yet
Unit-5 Computer Vision-1
5 pages
Making Machines See
No ratings yet
Making Machines See
29 pages
Exploring Computer Vision Applications
No ratings yet
Exploring Computer Vision Applications
19 pages
Class 10 Computer Vision Overview
88% (16)
Class 10 Computer Vision Overview
7 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
4 pages
Chapter 5 Computer Vision Notes Updated
No ratings yet
Chapter 5 Computer Vision Notes Updated
6 pages
Making Machines See-2
No ratings yet
Making Machines See-2
63 pages
Computer Vision Applications and Processes
No ratings yet
Computer Vision Applications and Processes
4 pages
Computer Vision Techniques Overview
No ratings yet
Computer Vision Techniques Overview
29 pages
Applications of Computer Vision Explained
No ratings yet
Applications of Computer Vision Explained
2 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
7 pages
Computer Vision Notes for Class 10
No ratings yet
Computer Vision Notes for Class 10
2 pages
Introduction to Computer Vision Concepts
No ratings yet
Introduction to Computer Vision Concepts
27 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
Computer Vision Fundamentals and Applications
No ratings yet
Computer Vision Fundamentals and Applications
39 pages
Computer Vision Fundamentals and Techniques
No ratings yet
Computer Vision Fundamentals and Techniques
200 pages
Computer Vision vs. Image Processing Explained
No ratings yet
Computer Vision vs. Image Processing Explained
9 pages
Class 10 Computer Vision Notes
No ratings yet
Class 10 Computer Vision Notes
5 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
Understanding Computer Vision Applications
No ratings yet
Understanding Computer Vision Applications
6 pages
Applications of Computer Vision
No ratings yet
Applications of Computer Vision
10 pages
Making Machines See: Computer Vision Basics
No ratings yet
Making Machines See: Computer Vision Basics
6 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
4 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
5 pages
CV Quest File 2
No ratings yet
CV Quest File 2
7 pages
Introduction to Computer Vision Concepts
No ratings yet
Introduction to Computer Vision Concepts
9 pages
Overview of Computer Vision Applications
No ratings yet
Overview of Computer Vision Applications
12 pages
Computer Vision Overview and Applications
No ratings yet
Computer Vision Overview and Applications
4 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
7 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
14 pages
Overview of Computer Vision Applications
No ratings yet
Overview of Computer Vision Applications
9 pages
AI Student Project on Computer Vision
No ratings yet
AI Student Project on Computer Vision
26 pages
Introduction to Computer Vision Basics
No ratings yet
Introduction to Computer Vision Basics
178 pages
Understanding Computer Vision Techniques
No ratings yet
Understanding Computer Vision Techniques
9 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
33 pages
Introduction to Computer Vision Basics
No ratings yet
Introduction to Computer Vision Basics
14 pages
Understanding Computer Vision in AI
No ratings yet
Understanding Computer Vision in AI
9 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
49 pages
Computer Vision Techniques and Challenges
No ratings yet
Computer Vision Techniques and Challenges
31 pages
Computer Vision in AI Applications
No ratings yet
Computer Vision in AI Applications
3 pages
Computer Vision Notes, MCQ, Assertion & Reasoning
No ratings yet
Computer Vision Notes, MCQ, Assertion & Reasoning
11 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
50 pages
Overview of Computer Vision Concepts
No ratings yet
Overview of Computer Vision Concepts
16 pages
Introduction to Computer Vision Basics
No ratings yet
Introduction to Computer Vision Basics
31 pages
Applications of Computer Vision Explained
No ratings yet
Applications of Computer Vision Explained
3 pages
Computer Vision Overview for Class 10
No ratings yet
Computer Vision Overview for Class 10
39 pages
Introduction to Computer Vision Basics
No ratings yet
Introduction to Computer Vision Basics
10 pages
Bank Transaction Analysis Project Overview
No ratings yet
Bank Transaction Analysis Project Overview
27 pages
Understanding Mobile Agents in Computing
No ratings yet
Understanding Mobile Agents in Computing
4 pages
DSA Operations on Linked Lists
No ratings yet
DSA Operations on Linked Lists
16 pages
Inter-Process Communication in OS
100% (1)
Inter-Process Communication in OS
6 pages
Color Codes and Designations Guide
No ratings yet
Color Codes and Designations Guide
1 page
How To Interpret Visual Fields
No ratings yet
How To Interpret Visual Fields
10 pages
Image Formation by Convex and Concave Lenses
No ratings yet
Image Formation by Convex and Concave Lenses
7 pages
Corneal Dystrophies & Keratoconus Overview
No ratings yet
Corneal Dystrophies & Keratoconus Overview
25 pages
Basement Parking Lighting Basics
No ratings yet
Basement Parking Lighting Basics
23 pages
Ophthalmology: Eye Examination Techniques
No ratings yet
Ophthalmology: Eye Examination Techniques
10 pages
Understanding Glaucoma: Causes and Management
No ratings yet
Understanding Glaucoma: Causes and Management
21 pages
Vision 2020: Global Blindness Elimination Plan
No ratings yet
Vision 2020: Global Blindness Elimination Plan
25 pages
Visual Contrast
No ratings yet
Visual Contrast
48 pages
S-Curve Contrast Adjustment Code
No ratings yet
S-Curve Contrast Adjustment Code
4 pages
Macro Photography From Snapshots To Great Shots by Rob Sheppard
100% (5)
Macro Photography From Snapshots To Great Shots by Rob Sheppard
255 pages
Lens Diagrams and Light Refraction
No ratings yet
Lens Diagrams and Light Refraction
1 page
Rhegmatogenous Retinal Detachment Case
100% (1)
Rhegmatogenous Retinal Detachment Case
33 pages
Arts Long Test: Multiple Choice & Identification
No ratings yet
Arts Long Test: Multiple Choice & Identification
2 pages
10 1371@journal Pone 0210205
No ratings yet
10 1371@journal Pone 0210205
12 pages
TCS230/TCS3200 Color-Matching Guide
No ratings yet
TCS230/TCS3200 Color-Matching Guide
2 pages
Benefits of Mobile Camera Lenses
No ratings yet
Benefits of Mobile Camera Lenses
2 pages
Ophthalmology Internship Overview
No ratings yet
Ophthalmology Internship Overview
8 pages
Elements and Principles of Design
No ratings yet
Elements and Principles of Design
4 pages
Efficient Image Enhancement Techniques
No ratings yet
Efficient Image Enhancement Techniques
5 pages
Basics of Digital Photography Explained
No ratings yet
Basics of Digital Photography Explained
4 pages
Anatomy and Physiology of the Eye
No ratings yet
Anatomy and Physiology of the Eye
65 pages
Understanding Color Blindness Through VR
No ratings yet
Understanding Color Blindness Through VR
5 pages
Evolution S Witness How Eyes Evolved, Ivan R .20
No ratings yet
Evolution S Witness How Eyes Evolved, Ivan R .20
1 page
Types of Eye Movements Explained
No ratings yet
Types of Eye Movements Explained
3 pages
Enhancing Palm Leaf Manuscript Images
No ratings yet
Enhancing Palm Leaf Manuscript Images
10 pages
Retinomotor Values and Eye Movements
No ratings yet
Retinomotor Values and Eye Movements
20 pages
Ophthalmic Equipment and Innovations Guide
No ratings yet
Ophthalmic Equipment and Innovations Guide
40 pages
Photochemistry and Function of Vision
No ratings yet
Photochemistry and Function of Vision
75 pages
Image Processing MCQ Overview
No ratings yet
Image Processing MCQ Overview
10 pages

Fundamentals of Computer Vision

Uploaded by

Fundamentals of Computer Vision

Uploaded by

Computer vision technology is crucial part of AI, that aids in

building a machine with ability to look at an image/video, to

Challenges in Computer Vision:

What is Computer Vision ?

Image Sharpening and Restoration

The purpose is to create a better image

The purpose is to seek for the image of interest

The purpose is to measure various objects in an image

The purpose is to distinguish the objects in an image

Example: By default, digital images will be in RGB (Red, Blue,

HSV - Hue, Saturation, Value - preferred color space in high

Generally images are scaled down to minimize computational

Images will be rotated in different angles so that it would be

mage translation refers to moving pixels in an image from one

In computer vision, segmentation is the process of extracting

The main purpose for Image segmentation is to partition an

– Meaningful regions (coherent objects)

– Linear structures (line, curve, …)

– Shapes (circles, ellipses, …)

The very simple example for image segmentation is method

A feature is a piece of information which is relevant for solving the

Main Component Of Feature Detection And Matching

Identify the Interest Point in the image.

The features that are in specific locations of the images, such as

The local appearance around each feature point is described in

Descriptors are compared across the images, to identify similar

Object recognition is used for indicating an object in an image or

The output of object recognition will include the identified object

Object recognition refers to identification of what is present in the

Object recognition through deep learning can be achieved

Just like in deep learning, object recognition through algorithmic

The following algorithms are commonly uses approaches:

 HOG feature extraction

When it comes to applying deep machine learning to image

The commonly used techniques for Object Detection are

• Haar cascades algorithm

• Viola Jones Algorithm

Object detection uses an object’s feature for classifying its class.

You might also like