We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 13
Computer Vision Class 10
Notes
Introduction to Computer Vision
Computer vision is a field of artificial
see, observe and make sense of visual
data(like images & videos). Computer vision
enables computers and systems to extract
useful information from digital photos,
videos, and other visual inputs and to
execute actions or make recommendations
based on that information.fe Vision and Artificial Intelligence
Computer vision is a field of artificial intelligence (Al).
Al enables computers to think, and computer vision enables Al to see,
of visual data(like images & videos).
observe and make sense
Artificial Intelligence
Deep
Learnin
Machine
Learning
Computer
VisionComputer Vision Vs. Image Processing
[ Computer Vision
input
Computer Vision is a superset of
ImageProcessing.
+ Examples - Object
Handwritingrecognition, etc.
detection,
Computer vision deals with extracting
information from the input images or
videos toinfer meaningful information and
understanding them to predict the visual
Image Processing
Image processing is mainly focused on
processing the raw input images to
enhance them or preparing them to do
other tasks
Image Processing is
ComputerVision.
Examples - Rescaling image, Correcting
brightness, Changing tones, etc.
a subset ofHow the computer vision work?
Computer vision analyzes visual data using
complex algorithms. The computer vision
algorithm broke the images into pixels and
processed them using a machine learning
technique and compared them with the
dataset to find the pattern or objects.
Applications of Computer Vision
In the 1970s, computer vision as a concept
was first introduced. Everyone was excited
by the new uses for computer vision.
However, a considerable technological
advance in recent years has elevated
computer vision to the top of many
companies’ priority lists. Let's examine a few
of them:¢ Facial Recognition — Security being
the most important application
involves use of Computer Vision for
facial recognition. It can be either
guest recognition or log maintenance
of the visitors.
Face Filters - Modern-day apps like
Instagram and Snapchat have face
filter applications, which capture the
face using a camera and a computer
vision algorithm that is able to identify
the facial dynamics of the person.
Google's Search by Image — Google
has an interesting feature of getting
search results through an image.
Computer vision takes the input image
from the users, compares it with the
database of images, and gives us the
search result.e Computer Vision in Retail — Retailers
can use Computer Vision techniques
to track customers’ movements
through stores, analyze navigational
routes and detect walking patterns.
Self-Driving Cars - Computer vision is
the fundamental technology behind
developing autonomous vehicles.
Most leading car manufacturers in the
world are investing money in artificial
intelligence for developing niall
versions of hands-free technolog¢ Medical Imaging — Computer vision
supported physicians. The medical
image is used to read and convert 2D
scan images into interactive 3D
models that enable medical
professionals to gain a detailed
understanding of a patient's health
condition.
Google Translate App — If anyone
wants to read signs in a foreign
language, point your phone's camera
at the words and let the Google
Translate app tell you what it means in
your preferred language almost
instantly.5.2 Computer Vision Tasks
‘The various applications of Computer Vision are based on a certain number of tasks that are
performed to get certain information from the input image which can be directly used for
predictionor forms the base for futher analysis. The tasks used in a computer vision application are:
station Sone
even
‘center | seoemeraton
Classification
The image Cassiication problem is the task of assigning an input image one label from a fixed
set ofcategories. Ths is one of the core problems in CV that, despite its simplicity, has a large
variety of practical applications.
Classification* Localisation
Thisis the task that involves both processes of identifying what object is present in the image
and at the same time identifying at what location that object is present in that image. It is
used only for single objects.
Object Detecti
Object detection is the process of finding instances of real-world objects such as faces,
bicycles, and buildings in images or videos. Object detection algorithms typically use
extracted features andlearning algorithms to recognize instances of an object category. It is
commonly used in applications such as image retrieval and automated vehicle parking
systems.
Instance Segmentation
Instance Segmentation is the process of detecting instances of the objects, giving them a
category, andthen giving each pixel a label based on that. A segmentation algorithm takes an
image as input and outputs a collection of regions (or segments)
Classification Instance
Classification + Localization lect Detection — segmentation
CAT, 006, DUCK CAT, DOG, DUCK
Sgt tec saanse cece2. Basics of Images-Pixel, Resolution,
Pixel value, grayscale and RGB images
1. Basics of Pixels
The word “pixel” means a picture element.
Every photograph, in digital form, is made up
of pixels. They are the smallest unit of
information that make up a picture. Usually
round or square, they are typically arranged
in a 2-dimensional grid.
In the image above, one portion has been
magnified many times over so that you can
see its individual composition in pixels.
2. Resolution
The number of pixels in an image is
sometimes called the resolution. When the
term is used to describe pixel count, on
convention is to express resolution as ee
width by the height, for example a monitor3. Pixel value
Each of the pixels that represents an image
which describes how bright that pixel is,
and/or what colour it should be. The most
common pixel format is the byte image,
where this number is stored as an 8-bit
integer giving a range of possible values
from 0 to 255. Typically, zero is to be takeny
as no colour or black and 255 is taken t ie
full colour or white.4. Grayscale Images
A grayscale image is a digital image that
contains only shades of black, gray, and
white. In a grayscale image, the highest,
darkest shade is black, which has a zero
value of pixels, and the lightest possible
shade is white, which has 255 values of
pixels.
Let us look at an image to understand about
grayscale images.
Here is an example of a grayscale image. as
you check, the value of pixels is within the
images we see in the form of these numbers.5. RGB Images
All the images that we see around are
coloured images. These images are made up
of three primary colours Red, Green and Blue.RGB Images
All the images that we see around us are coloured images. These images are
made up of three primary colours Red, Green, and Blue.
All the colours that are present can be made by combining different
intensities of red, green, and blue.