0% found this document useful (0 votes)

26 views50 pages

DIP-Imagery MontenegroJoo

Uploaded by

Phương Lâm Mỹ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views50 pages

DIP-Imagery MontenegroJoo

Uploaded by

Phương Lâm Mỹ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/281690161

Digital Image Processing

Book · January 2000

CITATIONS READS

0 1,951

1 author:

Javier Montenegro Joo

National University of San Marcos
102 PUBLICATIONS 257 CITATIONS

SEE PROFILE

All content following this page was uploaded by Javier Montenegro Joo on 13 September 2015.

The user has requested enhancement of the downloaded file.

VirtualDynamicsSoft
Science & Engineering Virtual Labs - EduVirtualLabs

Digital Image Processing

Javier Montenegro Joo

VirtualDynamicsSoft Science & Engineering Virtual Labs

www.VirtualDynamicsSoft.com
[email protected]

This document contains the class notes of the course on Digital Image Processing given by
Prof. Montenegro Joo to Science and Engineering graduate-level students.
Experiments based on the theory in this document may be executed with Imagery, the Digital
Image Processing EduVirtualLab authored by Prof. Montenegro Joo

Classroom lectures make use of the Imagery EduVirtualLab as a visual aid. Out of class,
students practice on their own images with Imagery, installed on the university Computer
Room machines.

Images, image transformations and operations on images shown in this document have all
been generated with the Imagery EduVirtualLab.

This is a computer-assisted course. When the Imagery icon appears next to a subject heading,
the reader will find experimental support about that theme in the Imagery EduVirtualLab.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 2

Pre-requisites to this course

The student taking this course on Digital Image Processing must have knowledge and
experience on any Computer Programming Language, Matrices, Analytic Geometry,
Derivates, Integrals, Gradients and the Laplacian.

Contents.-

1. Introduction.
2. Applications of DIP.
3. Two examples of situations that demand an efficient DIP.
4. Images from the point of view of the DIP (Binary, Grey-level and Colour images).
5. Conversion of Colour to Grey-level images.
6. Digitization.
7. Pixels and its Neighbourhoods.
8. Geometric Transformations on Images (Translation, Rotation, Scaling, Shearing).
9. Problems generated by discretization.
10. Straight lines in DIP.
11. Variation of Darkness and brightness in an image
12. Image Colour Inversion.
13. Rotating and Flipping images.
14. Image subtraction.
15. Segmentation of images
16. Histograms. (Grey-leveled images and color images).
17. Histogram-based Binarization of grey-leveled images
18. Boundary (Edge) Detection of Binary Images.
19. Histogram_thresholding (Detection of edges in grey-levelled images)
20. Spatial Operators, Box filters, Windows, Templates and Masks.
21. User defined convolution filters
22. Smoothing filters.
23. Noise-Reduction Median Filters.
24. Unsharp Masking Filter.
25. Detection of Discontinuities in Digital Images (Points, Lines, Edges).
26. The Gradient.
27. Edge Detection by First Derivatives and Gradient.
28. Edge Enhancement by Gradient. The Sobel Operators.
29. Generalised Sobel Operators.
30. Edge Detection with the Laplacian.
31. High-boost Filter.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 3

32. Image Dilation & Erosion.

33. Opening and Closing.
34. Thresholded image difference and Controlled image fusion
35. Controlled Image Fusion
36. Pattern Recognition
37. Signature: Pattern Centroidal profile representation
38. Invariant Moments
39. The Massive M.K. Hu’s Invariant Moments
40. The Boundary C.C. Chen’s Invariant moments
41. The Polar Hough Transform
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 4

Acknowledgement.-

The material in this document as well as Imagery the accompanying EduVirtualLab

on Digital Image Processing would not have been possible without the deep and
decisive influence of Prof. Luciano da Fontoura Costa, from The Cybernetic Vision
Research Group, Sao Paulo University, (USP Sao Carlos Brasil), who introduced the
author to the field of Computer Vision, specifically Pattern Recognition. The author,
Javier Montenegro Joo, expresses his sincere and special gratitude to Prof. da
Fontoura Costa for all the time, dedication and very especially for the high doses of
knowledge transfer.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 5

Introduction

Optics, a branch of Physics, deals with the radiation emitted by objects, when this radiation is
in the visible spectrum it is called (visible) light, additionally, when this radiation encounters
an opaque surface it renders an image characteristic of the object it is coming from. Optical
instruments such as lenses, prisms, mirrors, etc, are used by physicists to visualize the
aforementioned radiation and to study it. Consequently, Digital Image Processing (DIP) may
be regarded as just another tool of Optics, because the algorithms of DIP are simply virtual
instruments to manipulate and to study the images produced by the radiation generated by
objects.

DIP deals with the algorithms used to transform images, which may need to be transformed
merely for aesthetics but also to extract information from them, being this the case of images
used in medical applications and in Pattern Recognition.

DIP is necessary because images especially photographs contain imperfections, impurities

and noise or they are not neat enough to be used as a source of information, particularly
technical information. This situation turns more dramatic when attempting to obtain the
information from an image (image analysis) in an automatic or autonomous way; this is, by
means of a mechanised and computer-controlled system, which is the case in industrial
inspection and in radars and detectors used in defence systems.

Applications of DIP

DIP has main applications in image reconstruction and pattern recognition, especially
autonomous (mechanised). It is highly probable that many successful pattern recognition
applications are kept secret for commercial or security reasons.

Common applications of DIP include Automatic industrial inspection (quality control), Radar
and detection systems, Autonomous robots, Optical character recognition (text recognition),
Geophysical data analysers, Chromosome classification, Electrocardiogram analysis,
Radiography, fingerprint recognition, military target recognition.

Obviously industrial and medical applications are much easier to perform than military ones
because there are illumination control, and no camouflage in the former.

Two examples of situations that demand an efficient DIP

In the automatic industrial quality control hundreds of products must be checked in a short
time and those presenting imperfections must be automatically identified and separated.

In automatic defence systems a dot in the sky, which is approximating to a vessel on the sea,
must be identified in a very short time in order for the ship to activate its defences and shoot
it down, if identified as an enemy aircraft.

The two situations just mentioned used to be solved in the past in a manual way and so it
demanded a long time, nowadays with all the research and development in DIP these
situations have become much more manageable.

In medical applications there is also the need to improve the quality or reduce the level of
noise present in some images, here however the time factor is not as crucial as it is in the
examples above mentioned.

A distorted image due to camera shaking might be corrected in some level by means of DIP
techniques.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 6

Images from the point of view of the DIP

DIP regards an image as a function z = f(x,y) where the value of z at the point (x,y)
represents the intensity of the light there, this is the colour. The values of x and y are limited
by the image length and width.
In the field of DIP an image -may be a photograph- is discretized as a two-dimensional light
intensity function f(x,y), where (x,y) are the coordinates of every point and the value of the
function f at point (x,y) is the colour at that point, in this way an image is a matrix with rows
and columns indicated by x and y, and the matrix element indicates its colour.

Digital image researchers have developed mathematical operations over functions like f(x,y)
so as to transform them. A few examples of these transformations include (1) Cleaning a
noisy image, (2) Detection of straight lines (illegal airports) in aerial photographs through a
cloudy day, (3) Detection of contours in poor quality images, etc. DIP deals with the
transformations that can be made on images to extract some information from them.

Binary images
In a Binary Image (that in strict black & white) the points (x,y) have either of two values 1 or
0, sometimes 0 represents the white background and 1, the black colour silhouette, the
opposite being also possible.

Grey-level images
In a grey-level image the light intensity values go from 0 through 255, this making a total of
256 colour levels in each point (x,y).

Two versions of the same image, the one at the left is in (strict) black and white and it is called a
Binary Image, the image at the right side is in grey-levels. Obviously the grey-level image has much
more details (information) than the binary image, but the former demands much more storage space
to be saved and much more memory when it is displayed on a computer screen. A colour image
requires even more memory and storage.

Colour images
Colour images are generally represented in the RGB system (Red, Green, Blue), in this case
every point (x,y) of an image is associated to three values R, G and B, each one varying from
0 (0%) to 255 (100%), the consequence of this is that colour images can contain a total of
256 x 256 x 256 = 16777216 different colours. When R = G = B = 0 the resulting colour is
black, and if R = G = B = 255, the resulting colour is white

Conversion of Colour to Grey-level images

[ Imagery module: Color – Color to grey-level transformation ]
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 7

In colour images each colour has three components, R, G and B which vary from 0 to 255.
In grey-level images the colour has only a single component which varies from 0 to 255. In
order to make the conversion from colour to grey levels there are some algorithms, the one
proposed by the NTSC (National Television Standards Commission) states that each grey
level has 56% Red, 33% Green and 11% Blue, notice that 56% + 33% + 11% = 100%.

Under the NTSC criteria the colour RGB(240, 36, 128) becomes the grey level with 56% of
240 plus 33% of 36 and 11% of 128, which is 134.4 + 11.88 + 14.08 = 160.36 = 160. The
white colour RGB(255,255,255) becomes the grey level 142.8 + 84.15 + 28.05 = 255 and
the black colour RGB(0,0,0) becomes the grey level 0.

There are other proposals to carry out the conversion from colour to grey level, like the 3-6-1
which proposes 30% Black and 60% Green and 10% Blue. In general every person may
propose his/her own conversion rule. Notice that if the blue colour component has a high
percentage, the grey level image may result rather dark, for this reason the red and green
components are assigned high percentages in the conversions.

The figure above shows three colour to grey-level transformations achieved with Imagery. Image (A) is
the colour input image, images B, C and D are the corresponding transformations to grey-level.
Transformations B and C are standard while D is a user-defined transformation. The RGB
percentages used in the transformations are respectively: Image B: RGB (0.56, 0.33, 0.11), Image C:
RGB(0.30, 0.60, 0.10) and image D: RGB( 0.20, 0.30, 0.50)

Obviously colors is a matter of individual taste, hence everyone may define his own color to
grey-level transformation rule.
Return to Index

Digitization

It is by means of Digitization that an image (a photograph) is represented as a matrix, this is,

as a discrete set of points (x,y) in a bidimensional grid.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 8

The digitisation of an image generates a mapping of the image over a grid of discrete
coordinates x and y, this has a huge influence in the digital image processing of the grid
(image), because operations like the derivative, which in ordinary Calculus are carried out
over a space of continuous coordinates X-Y, when trying to be achieved over an image, this
will be executed on a discrete set of coordinates, where X and Y have only discrete values.

Pixels and Neighborhoods

Every point of the bidimensional matrix or grid representing an image is called Pixel or Pel. It
is very important to bear in mind that the coordinates x and y of a pixel are discrete, this is,
their values can be only x, y = 0,1,2,3,… The value stored in the pixel (x,y) is the image
colour at that position.

Part (a) of the figure above shows a (central) pixel C and its four nearest neighbours,
identified as North, South, East and West, or Top, Bottom, Left and Right. Part (b) of the
figure shows the coordinates of a (central) Pixel (x,y) and its eight neighbours.

Neighbourhoods of a pixel

Every pixel (x,y) has four nearest neighbours (Top, Bottom, Right
and Left), these are at a unit distance, there are also other four
next nearest neighbours in the diagonals, these are slightly farther
at a distance of 1.4142 Some Digital Image Processing
applications demand a 9-pixel neighbourhood, and others a 4-pixel
neighbourhood.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 9

Geometric Transformations on Images

[ Imagery modules: Geometry – Geometric Transformations – Translation, Rotation, Size scaling,
Shearing ]

The most common geometric transformations are Translation, Rotation, Scaling and
Shearing, these are accomplish by operating on the image pixel coordinates.

In the bi-dimensional cases shown below after a geometric transformation the position (x,y)
of a pixel is changed to (xnew, ynew). Coefficients Dx, Dy, Sx and Sy not necessarily have
integer values.

In Translation, the pixel is displaced to another position a distance Dx and Dy respectively

from their original position (x,y).

In Rotation, the pixel (x,y) is rotated an angle  with respect to the origin of coordinates (0,0).
Slightly different equations in x and y can generate rotations with respect to a different
reference point.

In Scaling, pixel (x,y) is displaced to position (x Sx, y Sy) from (0,0). When this operation is
carried-out over the pixels of a polygon, this becomes larger or smaller, depending on the
values Sx and Sy. If Sx = Sy the change of size is uniform, same in all directions, if these
are different, the size change is not uniform.

In Shearing not necessarily Sx = Sy, this means that shearing may be not uniform in x and
y.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 10
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 11

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 12

Problems generated by discretization

The main problem in DIP is that due to the fact that pixels have only discrete coordinates
some geometric transformations produce an image that is only an approximation of the
original one. This can be visualized by rotating a point P by an angle A and then rotating by –
A the resulting point. Mathematically this operation generates the original point P, however in
DIP the original point is not always recovered.

As an example, consider the rotation of an image point P = (10,12) by an angle A = 45° ,

using the rotation equations above:

Xnew = 10 Cos 45° - 12 Sin 45° = -1.4142

Ynew = 10 Sin 45° + 12 Cos 45° = 15.5556

since images have only discrete coordinates, the resulting point is (-1,16).
As an attempt to obtain the original point, rotate (-1,16) by –45°:

Xnew = -1 Cos(-45°) - 16 Sin(-45°) = -1 Cos 45° + 16 Sin 45° = 10.60660

Ynew = -1 Sin(-45°) + 16 Cos(-45°) = 1 Sin 45° + 16 Cos 45° = 12.02081

These results generate the image point (11,12), and it can be seen that the original point
cannot be recovered.

Straight lines in DIP

[ Imagery module: Geometry – Representing a Straight Line ]

Mathematically a straight line is a succession of connected points all with the same slope. In
DIP this is not necessarily true, because due to the discretization of image pixel coordinates
a line may also be a succession of line segments and not necessarily connected.

In some applications it is necessary to have a full line, one without gaps, and if there are
chances the line might appear as a set of aligned line-segments and these include gaps, like
objects G and H in the figure, it is necessary then to use an algorithm for automatically
detecting the gaps if they exist and refill them. This is the case when it is essential for
example to know the exact number of pixels in a line, a situation arising in several pattern
recognition algorithms.
Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 13

Variation of Darkness and brightness in an image

[ Imagery module: Color – Image brightness and darkness ]

The black and white degrees in grey-levelled image range from 0 through 255, and the
darker the image, the lower its grey levels. Colour images have R, G, and B components
ranging from 0 through 255 each, and here also, the brighter the image, the higher its R, G,
and B components.

The algorithm to change the degree of brightness or darkness in a colour image is:

Darkness:
New_Red = Red - DeltaDarkness
New_Green = Green - DeltaDarkness
New_Blue = Blue - DeltaDarkness

Brightness:
New_Red = Red + DeltaBrightness
New_Green = Green + DeltaBrightness
New_Blue = Blue + DeltaBrightness

In the example images below, DeltaDarknes = 70, and DeltaBrightness = 60.

In a grey-leveled image simply increase (or reduce) its grey levels in order to make it brighter
(or darker).
Return to Index

Image Colour Inversion

[ Imagery module: Transformations – Image Transformations ]

The colours of an image can be “Inverted” by means of the following operation on its Red,
Green and Blue colour components

Original Colour
Grey-level Inverted
Image Image
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 14

Rotating and Flipping images

[ Imagery module: Transformations – Image Transformations ]

Operating on every image pixel (x,y), Images may be rotated and flipped with the
transformations shown next. Additional transformations can easily be devised. For all the
transformations the origin of coordinates is placed in the top left corner of the image.

Original
(x,y)

Horizontal Mirror
(x,y) >>> (-x,y)

Upside down
(x,y) >>> (x,-y)

90° Rotation
(x,y) >>> (y,-x)

90° Rotation and Vertical Flip

(x,y) >>> (y,x)

270° Rotation and Vertical Flip

(x,y) >>> (-y,-x)
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 15

Image Subtraction
[ Imagery module: Transformations – Image Subtraction]

Given two images f(x,y) and g(x,y) the difference image h(x,y) is given by
h( x, y)  f ( x, y)  g ( x, y)
and it is computed by obtaining the difference between pairs of corresponding pixels in
images f(x,y) and g(x,y). The resulting image h(x,y) contains only the regions making the
difference between f(x,y) and g(x,y).
Image subtraction has important applications in image enhancement and in image
segmentation. Image subtraction is commonly used in radiography enhancement, a medical
imaging application.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 16

Image Segmentation
[ Imagery module: Segmentation – Binary image segmentation through wrapping ]

Image segmentation, this is, the process of identifying individual pixels in an image matrix as
being members of different objects or regions in a scene, is an essential constituent of
Machine Vision, a topic Artificial Intelligence deals with.

In simple words, image segmentation is the process of dividing an image into regions. For
instance, if there is an image showing an apple, an orange, a book and a pen, after
segmentating this image, four new images may be generated, each one showing one of the
mentioned objects. Once images have been segmentated, the generated images may be
used to achieve some tasks in pattern recognition.

The Wrap to Segment Algorithm

The algorithm introduced here to carry out segmentation of binary images consists in
surrounding (encapsulating) the objects inside an image, in capsules or wrappings and then
extracting each capsule. In the figure below, it is possible to appreciate some limitations of
the algorithm.

In order to wrap every object of the image in a capsule, a top-down and left-right sweeping of
the primary (original) image containing the objects is executed, and as soon as a pixel
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 17

different from the background is detected, walk along its border and mark the surrounding
pixels, these marked pixels -once first with last are joined- become the capsule wrapping the
object. The process is repeated for as many pixels different from the background are
detected, using a different mark (capsule or wrapping) for every object. Subsequently carry
out the detection of capsules by just scanning the image searching for the marked pixels and
extract the contents of each capsule.

Limitations of the algorithm

The proposed algorithm performs rather well, however there are some restrictions, which
arise specially if the algorithm is to be applied to automatic pattern recognition:

(1) Elements in an image must be separated, this is, objects in the image must be very well
individualized. The proposed algorithm does not operate well with overlapped objects,
because these may be regarded as a single object.
(2) The borders (frontiers) of the elements of the image must be well defined.
(3) Inside the rectangle that tightly surrounds every object, there must be only one object,
this despite the fact that objects are not wrapped in rectangular capsules. This problem may
be appreciated in the frame 2-2 above, which includes a pistol and a rectangle, notice that
the rectangle appears in frames 2-2 and 2-3.

In order to read more about this algorithm a published paper is included in the annex.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 18

Histograms
[ Imagery module: Histograms – Histograms (Grey-levelled images) ]

The histogram of a grey-level digital image gives the number of pixels per grey level in the
image.

The histogram gives also information about the probability of finding a given grey-level in the
image, the higher the number of pixels for a given grey level, the higher the probability of
finding that grey level in the image, and vice versa.

Histogram equalization: A histogram has been equalized when it has been normalized
between 0 and 1, with 0 representing black and 1 representing white. In this way the grey
levels may be regarded as random quantities in the interval [0, 1].

Imagery allows simultaneously visualizing and comparing the histograms of three grey-leveled images.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 19

Histograms of colour images

[ Imagery module: Histograms – RGB Histograms ]

Color images in the RGB (red, green and blue) representation, build each pixel color as a
combination of intensities ranging from 0 through 255 of red, green and blue colors.

The histogram of a color image gives the number of pixels per each intensity of Red, Green
and Blue in the image.

A color image generates three histograms, corresponding to the red, green and blue color
components. These histograms in general must be different among them, unless the image
has the same amount and distribution of red, green, and blue.

When a color image histogram has been equalized, it has been normalized between 0 and 1,
with 0 representing the darkest intensity of the color it is associated to, and 1 represents the
brightest intensity of that color.

The figure shows Lenna’s color image and the associated red, green and blue histograms obtained
with Imagery.

Since pixels in grey-leveled images have the same intensity of red, green and blue, then the
R, G and B histograms generated by a grey-leveled image are all equal. Hence a grey-
leveled image may be regarded as a color image whose pixels have the same quantities of
red, green, and blue.
Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 20

Histogram-based Binarization of Grey-Levelled Images

[ Imagery module: Histograms – Binarizer ]

Sometimes it is necessary to work with a binary (strict black and white) version of a grey-
levelled image; this happens especially in pattern recognition applications, which usually
operate on binary images, because in those cases, only the silhouette of an object is
needed. The histogram of a grey-levelled image may be used to set a binarization threshold.
When binarizing a grey-levelled image, those pixels in the image whose grey-levels are
above the prefixed threshold are highlighted, by for example showing them in white colour on
a black background.

In the image above the input grey-levelled image to be binarized is the side view of a head,
which resulted from Nuclear Magnetic Resonance (NMR) scanning. The associated
histogram appears in green color, and at the bottom are three binarization instances,
obtained with thresholds of 75, 120 and 150, respectively.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 21

Boundary (Edge) Detection of Binary Images

[ Imagery module: Edges – Binary Image Edge Detector ]

Notice that an interior pixel in an image is completely

surrounded by other pixels, this is, it has a full
neighbourhood, while a border pixel has an incomplete
neighbourhood. This means that in order to detect edge
pixels, it is necessary to detect those pixels with
incomplete neighbourhoods.

The algorithm to identify edge pixels in a binary image.-

There are two techniques to detect the edges in a binary
image:
The edge pixels in an image may be detected either by
checking the four-nearest-neighbours around each image
pixel, or by checking the eight pixels around every image
pixel. In both cases, 4 or 8 neighbours checked, if only
one pixel is missing around the image pixel (x,y), then
(x,y) is considered to be an edge (border) pixel.

When a 8-pixel neighbourhood is used the edges of the

resulting image have more pixels than with a 4-pixel
neighbourhood. This algorithm does not eliminate noise.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 22

Detection of edges in grey-levelled images through Histogram

Thresholding
[ Imagery module: Histograms – Borderliner ]

The figure below shows how the histogram of a grey-levelled image can be used to set a
threshold so that the image is binarized (turned into strict black and white) and then by
means of the algorithm to detect incomplete neighbourhoods around pixels, the edge pixels
are detected.

The binarized image displays (in white) only those pixels whose grey-level is above the
binarization threshold, those pixels whose grey-levels are below this threshold are discarded.

At the top of the figure the grey-leveled input image is shown along with its corresponding
histogram in green color, at the bottom, the binarized input image associated to a
binarization threshold of 89, is shown and this is accompanied by the edge images obtained
with the four and eight neighborhood techniques respectively. When the four neighboring
pixels technique is used, the edge image has 1528 pixels, and when the eight pixels
neighborhood is considered, the edge image contains 2035 pixels.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 23

Spatial Operators, Box Filters, Windows, Templates and Masks

Spatial Operators are also known as Box Filters, Windows, Masks and Templates. When a
spatial operator T operates on an image f(x,y) it generates the image g(x,y):
g ( x, y)  T  f ( x, y)

These are usually 3x3 (may be smaller or larger) matrices containing a weight factor in each
cell. By means of a discrete convolution the centre of the filter matrix is placed on each
image pixel and the new value of that pixel is the weighted sum of the pixels in its 3x3
neighbourhood.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 24

User defined convolution filters

[ Imagery module: Convolution – User defined Convolution filters ]

The image below shows a module included in the Imagery EduVirtualLab which allows the
user to operate on grey-leveled images with his/her own 3x3 filters, this module enables the
user to investigate the effect of the filters he/she defines. The module allows visualizing on
screen and comparing the effect of three different user-defined filters.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 25

Smoothing Filters
[ Imagery module: Convolution – Spatial Operators ]

This filter is used for noise removal by means of neighbourhood averaging, it is given by:

The spatial neighbourhood averaging or smoothing filter

replaces the value of each pixel with the average of
pixels in its neighbourhood, it adds all the pixels within
the 3x3 matrix and divides the sum by nine (the total
number of cells in the matrix). Obviously larger
neighbourhoods may also be used.

For smoothing filters the sum of all the factors must be

one, otherwise the grey level in an image region with
constant grey levels is not preserved.

Smoothing operators are also known as Lowpass filters,

these achieve noise reduction but blur edges and sharp details in the image.

Noise-Reduction Median Filters

[ Imagery module: Masks – Noise-reduction median filters ]

Median filters are classified as nonlinear filters. Median filters achieve noise reduction with
small blurring. The grey-level of each pixel in the image is replaced by the Median of the
grey-levels in the neighbourhood of the pixel and not by the neighbourhood average as
Smoothing filters do. Their size and shape depends on the application. The filter may be
applied to an image in any of its four versions: Square, Cross, Vertical and Horizontal Strip

Given a set of values, the Median m is such that half the values are less than m and half are
greater than m. A Median filter forces pixels with distinct grey-levels to be more like their
neighbours.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 26

To set up a median filter order from min to max the values of the grey-levels of the pixels in
the neighbourhood of each pixel p take the Median of this set (including p) and replace the
grey level of pixel p with the Median.

In a 3x3 neighbourhood the

median is the fifth largest
value, in a 5x5 neighbourhood
the median is the 13th largest
value and so on.

The shape of the Median filter, this is its type of neighbourhood, seriously affects its filtering
effects. The most common shapes are Square, Cross, Horizontal Strip and Vertical Strip.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 27

Unsharp Masking Filter

[ Imagery module: Edges – Comparison of Sharpening Filters ]

This filter is used for image enhancement and it is based on subtracting a blurred (smoothed)
image from its original.
Let G(x,y) be the enhanced image obtained from f(x,y) by means of

G( x, y)  f ( x, y)  Smooth[ f ( x, y) ]

where Smooth[f(x,y)] is the smoothed version of f(x,y) obtained as the local average of the
eight neighbouring pixels surrounding but not including each pixel (x,y):

then G(x,y) is calculated by means of the filter

The drawback of the Unsharp Masking filter is that it enhances noise and introduces some
ringing around noisy dots.

Detection of Discontinuities in Digital Images

The three basic discontinuities in a digital image are: Points, Lines and Edges.
The easiest way of detecting a discontinuity in an image is by convolving the image with a
3x3 mask or filter:

The Result of the operation at the location of every image pixel centred in the centre of the
filter, is
R  n1Wn Z n
9

where Zn is the grey-level of pixel n in the image and Wn is the Weight of pixel n in the mask.
The result R is defined with respect to the central pixel of the mask.

Detection of Points
[ Imagery module: Masks – Point & Small hole detection mask ]

The grey-level of an isolated point is quite different from the grey-levels of its neighbours, a
filter to detect this is
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 28

and the Result of the evaluation of the filter at every pixel position must be

| Result | > Threshold

Return to Index
where Threshold is not negative, obviously detection results depend on the value of
Threshold. This filter detects points and small holes.

Detection of Lines
[ Imagery module: Masks – Line detection masks ]

In order to detect lines horizontal, vertical and diagonal lines at 45° and 135°, the following
filters may be used

each one of these filters is associated to a particular result given by

R  n1Wn Z n
9

Suppose the four line detection masks are run through an image and if at a given pixel (xo
,yo), it happens that | Result m | > | Result n |, this means that pixel (xo ,yo) is more likely to
belong to a line in the direction of mask m than in the direction of mask n. For example, if
for a given pixel (xo ,yo) it is found that

| R45 | > | R135 | and | R45 | > | RH | and | R45 | > | Rv |

then that pixel (Xo,Yo) is more likely associated to a 45° line.

Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 29

Return to Index
Detection of Edges

An Edge or Border is the boundary between

two regions with different grey-levels. Also any
sharp intensity change between neighbouring
pixels is an Edge.

In order to detect edges a filter which

emphasizes the changes in grey-levels and
reduces regions with constant grey-levels is
necessary.

Algorithms to detect and enhance edges are

based on the First Derivative or Gradient.

The Gradient

The Gradient of a function f(x,y) is a vector that points in the direction of maximum rate of
change of the function f(x,y). The
magnitude G of the gradient gives the
maximum rate of change of f(x,y) in the

direction of vector G .

The angle  giving the direction of the

gradient is measured with respect to the
x-axis. An image constitutes a discrete
grid and for this reason the derivative
operator can only be approximated.
Different authors have proposed different version of this approximation.

Edge Detection by First Derivatives and by Gradient

[ Imagery module: Edges – Edge Enhancement by First Derivatives and by Gradient ]
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 30

First derivative Dx with respect to x, and Dy with respect to y, and Gradient Dx+Dy
Operators are used to detect edges. Dx and Dy detect edges mainly perpendicular to their
directions, this is, Dx detects mainly vertical edges and Dy
detects horizontal edges. However the gradient Dx+Dy is an isotropic edge detector, this
means that it detects edges independent of their orientation.

The Dx and Dy operators are used to compute the

magnitude of the Gradient, which consists in
obtaining the partial derivatives with respect to x
and y at every pixel location.

Derivative Operators (high-pass filters) like Digital

Gradient and Digital Laplacian are used for Edge
Detection, these belong to the class of Sharpening
Operators.
Sharpening operators sharpen edges but also
enhance noise and introduce "ringing" around
noisy dots these are in contrast to Smoothing
operators, which reduce noise but blur edges.

The magnitude of the Digital Gradient (first derivative) can be used to detect edges in an
image and the sign of the Laplacian (second derivative) can be used to detect whether an
edge pixel lies on the dark or light side of the edge.

The second derivative of an edge pixel is Positive when the pixel lies on the dark side of an
edge, it is Negative when the pixel is on the light side of an edge
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 31

The figure above shows the edge detection by first derivatives (Gradient). The input image
(frames.bmp) containing lines with different orientations has been processed with the
Imagery module that detects the gradient of an image. The other three images are output
images, and in these, detected pixels have been highlighted. Image Dx displays the
derivative with respect to x, detecting primarily verticals. Image Dy shows the derivative with
respect to y, and it can be seen that detects mainly horizontals, however, in image Dx+Dy
all edges -independent of their direction- have been detected.

In the figure the Gradient threshold has been set to 0.50. The lower the threshold, the higher
the number of detected pixels. Only image pixels whose gradient is above the gradient
threshold are detected. When the threshold is too high, no pixel is detected and the output
image is blank.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 32

Edge Enhancement by Gradient. The Sobel Operators

[ Imagery module: Edges – Edge Enhancement by Gradient: The Sobel Operator ]

The gradient is used to detect edges, but it enhances noise. The edge enhancement by
gradient is optimal when the image is noise-free, however if it includes noise this is also
enhanced, and in cases like this, taking the derivative of the image in order to boost the
object borders may not produce the desired results.
The Sobel edge detectors operate not only on binary images, they also operate on grey-
scale images.

The Sobel operators to compute the gradient (first

derivative) have the advantage of achieving differencing
with a smoothing effect.

The
image above displays the Sobel’s edge enhancement by Gradient (first derivative). The input
(grey-levelled) image is a cut of a human brain. The output (binary) images are the
corresponding edge profiles under three different thresholds. It can be seen that the lower
the threshold, the higher the number of detected edge pixels. Only those image pixels
whose gradients are above the threshold are detected and highlighted.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 33

The figure above allows appreciating the noise enhancing effect of the Sobel derivative
operator. It can be seen that when the threshold is high, object edges may be imperceptible,
and on the other extreme, when the threshold is low, noise is enhanced.

Notice that finding the correct threshold for only object edges to be enhanced in a noisy
image may be relatively easy, if done manually, however, in computer vision applications -
where most image processing is made automatically by a computer- and in a case like the
one shown in the image, it may be not so easy to find the correct threshold.

The Generalized Sobel Operators

[ Imagery module: Edges – Generalized Sobel Operators ]

The Generalized Sobel operators

are used to detect edges in the
direction of the two axes X and Y
and diagonals. When applying
these operators, some smoothing
perpendicular to the direction of
the edge is achieved.

Return to Index

Edge Detection with the Laplacian

Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 34

[ Imagery module: Edges – Edge enhancement via the Laplacian operator ]

The Laplacian is a second derivative operator used to detect edges, but it also enhances
noise.

An important source of performance degradation is the presence of noise in the original

image, because this is also enhanced along with object edges. A possible way of enhancing
edges by means of derivative operators, without enhancing noise, is smoothing (low-pass)
filtering before edge enhancement by a derivative operator.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 35

High-boost Filter
[ Imagery module: Edges – High boost filter ]

This filter is an edge-enhancement (sharpening) tool, it is also known as High-frequency

emphasis filter.

An original image f(x,y) may be represented as the sum of its two components
f(x,y) = Highpass + Lowpass
then Highpass = f(x,y) – lowpass and this is a kind of sharpening mask.
Let A be an image amplification factor, then
High boost = A f(x,y) – lowpass
Adding and subtracting f(x,y): High boost = A f(x,y) – f(x,y) + f(x,y) – lowpass
High boost = (A-1) f(x,y) + f(x,y) – lowpass
High boost = (A-1) f(x,y) + Highpass

Notice that when A = 1, then the standard Highpass image is obtained. When A > 1, part of
the original image is added back to the Highpass result, then the High boost image looks
more like the original image and includes some degree of edge-enhancement that depends
on the value of A.

Images treated with the High-boost filter are much neater

and have better quality than those treated with highpass
filters.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 36

Image Dilation & Erosion

Dilation expands a region whereas Erosion erodes (shrinks) it. Dilation and Erosion
masks may be either 4-pixel or 8-pixel neighbourhoods around a central pixel.

Dilation
[ Imagery module: Transformations – Dilation of binary images ]

In Dilation pixels having at least one neighbour within the predefined (4 or 8 pixel) mask
survive, otherwise they are discarded.

With Dilation small holes or cracks become filled and contour lines become smoother.
Shapes or objects dilated with an 8-pixel mask result much more dilated than those dilated
with a 4-pixel mask.

Erosion
[ Imagery module: Transformations – Erosion of binary images ]

Erosion removes boundary pixels, thus only pixels having full neighbourhoods survive an
Erosion operation, all other pixels vanish. Eroded objects (shapes) are thinner than their
originals.

With Erosion some noise may be eroded and then fade away. Groups of pixels connected
by a small bridge, become disconnected after erosion, and objects smaller than the mask
completely disappear.

Erosion with an 8-pixel mask erodes much more than with a 4-pixel mask.

Opening & Closing

The Opening operation removes small objects in an image; it is achieved by Erosion followed
by Dilation.
Erosion eliminates small objects in an image but it also shrinks all the remaining objects. In
order to avoid this shrinking, the image may be dilated after erosion.

The Closing operation can be used to refill holes and cracks; it is achieved by Dilation
followed by Erosion.
Dilation refills small holes and cracks, but enlarges the objects in an image. This
enlargement may be reversed by eroding the image after it has been dilated.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 37

Return to Index

Thresholded Image Difference and Controlled Image Fusion

[ Imagery module: Transformations – Thresholded image difference ]

The (absolute) difference between two images f and g is given by d = | f – g |; the absolute
value of the difference is considered because the difference may be positive or negative,
depending on the grey-level of the corresponding pixels on both images.
When the difference d = | f – g | = 0, both f and g are exactly equal images, without the
slightest difference between them.

Since the grey-levels vary from 0 through 255, a given pixel at (xo,yo) in two images f and g,
may differ by some value between 0 and 255, and a controlled difference between the
images may be obtained. This controlled difference may detect only those pixels satisfying

where T is a threshold. When T = 1 then the minimum (absolute) difference in the grey-
levels of corresponding pixels at (xo,yo) in f and g is detected and, when T is smaller than the
difference between the grey-levels at (xo,yo), no difference is detected.

Once the different pixels have been detected in images f and g, these may be reproduced on
the image that does not include them and, in this way a new image h being the fusion of
images f and g is generated. Notice that the new image h is a controlled fusion of f and g.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 38

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 39

Controlled Image Fusion.-

[ Imagery module: Transformations – Controlled image fusion ]

Image Fusion by Weighted Average of two input images

The weighted average of two images generates an image that contains a prescribed amount
of information from both input images. In this case, user-defined percentages of the two input
images are combined according to the following algorithm:

 Read both input images to be fused (combined).

 Set the desired percentages (P1 and P2) of each input image in the output image.
 Obtain the RGB values of each input image pixel, these are and
, respectively.
 Calculate the new RGB values of each pixel in the output image according to:

( ) ( )
{ ( ) ( )
( ) ( )

As usual the devil hides in the details and, notwithstanding the resulting image has
information from both input images, the drawback of this algorithm is that the output image
has -with regard to color- less information than any of the two constituent images, this
because every pixel color has been averaged.

In the Imagery Virtual Lab the above mentioned drawback has in some way been avoided,
by adding the option of ignoring (disregarding) a color during image fusion. When a color to
disregard is selected, that color is assigned a contribution percentage of 0 during fusion, in
this way any color that fuses with the selected color contributes with 100 % to the image
fusion.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 40

Pattern Recognition

Pattern Recognition is the collection of techniques that allow classifying objects or signals
within a group of pre defined categories.
Animals constantly carry out pattern recognition when they identify objects and persons (by
sight) and when they identify sounds (by ear).

Some technical applications of pattern recognition are:

 Recognition of hand-made digits on mail envelopes.
 Recognition of Chinese characters.
 Recognition of graphics representing diodes and electric circuits.
 Recognition of solid objects in robotics.
 Identification of seismic waves.
 Graphic classification of chromosomes.
 Analysis of electrocardiograms and radiographies.
 Recognition of fingerprints and human faces

The Invariant Pattern Recognition is the collection of techniques (algorithms and strategies)
allowing to classify or recognize and object within an image, independently of its position,
orientation and size. Pattern recognition makes use of digital image processing.

Common techniques used in Invariant pattern recognition.-

 Invariant moments (massive and boundary)
 Neural networks.
 Hough transforms (used to detect straight lines, circles, ellipses, etc)

Computer Vision

Computer Vision, also known as Cybernetic Vision, is the area of Artificial Intelligence aimed
at recognition and classification of objects within images. The goal of Computer Vision is
the autonomous and automatic application of algorithms and techniques belonging to digital
image processing, so as to replace the human eye and accomplish the functions of the
human visual system.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 41

Signatures: Pattern Centroidal Profile Representation

[ Imagery module: Pattern Recognition – Signatures: Pattern Centroidal Profile Representation ]

The Pattern Centroidal Profile reduces the pattern (object) boundary 2-D representation to a
much simpler 1-D function representation, the “Signature” of the pattern.

Given a pattern (object) in input image, its Centroid (geometric center) is detected by means
of the Physics equation for Centre of Mass given by

thus the coordinates of the Centroid (Xc,Yc) are

and then the whole pattern is displaced in such a way as to put its centroid in the Origin of
Coordinates (0,0),

Since neither rotation nor scaling is carried out, the orientation and size of the translated
pattern are the same it had originally. Next the Angle and Distance of every border point
(x,y) with respect to (0,0) is computed, this constitutes the Centroidal Profile Representation
(Signature) of the pattern.

The Centroidal profile representation is possible only as long as the object is not solid but
edged, has no holes and the image is noiseless.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 42

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 43

The Invariant Moments

There are two types of Invariant Moments, the Invariant Moments, created by M. K. Hu, on
1961, operating on all the pixels of the object to be recognized, and the Improved Invariant
Moments, created by C.C. Chen, on 1993, which operate only on the boundary (edge) pixels
of the object. Here the former are referred to as Massive Invariant Moments and the later as
Boundary Invariant Moments.

The M. K. Hu’s Massive Invariant Moments.-

[ Imagery module: Pattern Recognition – Massive RTS Invariant Moments ]

The two-dimensional Geometric Moments of order pq of a density distribution

(intensity function) f x, y  are defined as

 
m pq   x p, q  0,1,2,
p
y q f ( x, y ) dx dy, (1)
  

These geometrical moments are not invariant. The double integrals are to be
considered over the whole area of the object including its boundary, this implies
 
computational complexity of order O N . The density distribution function f x, y  gives
2

the intensity color of the point x, y  in image space, in simpler words, the function f(x,y) is
the color of point (x,y) in the image. In practical pattern recognition applications the image
space is reduced to a binary version, and in such a case f x, y  takes the value of 1 when
the pixel x, y  represents objects or even noise and it is 0 when it is part of the
background.

Notice that
∫ ∫ ( ) ∫ ∫ ( )

is the total area of the object f(x,y), this is, its total number of pixels, its area.

When the geometrical moments mpq in equation (1) are referred to the object Centroid or
Centre of Mass  xc , y c  , they become the Central Moments , and they are invariant to
translation:
 
 pq    (x  x ) ( y  yc ) q f ( x, y ) dxdy
p
c (2)
  

where xc  m10 / m00 and yc  m01 / m00 .

Central Moments may be normalized to turn also invariant to area scaling (change of
size) through the relation
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 44

( )

The set of seven lowest order Rotation, Translation and Scale (RTS) invariant functions i
include invariants up to the third order, it is given by:

1   20  02
2  (20  02 ) 2  4112
3  (30  312 ) 2  (321  03 ) 2
4  (30  12 ) 2  (21  03 ) 2
5  (30  312 )(30  12 )[(30  12 ) 2  3(21  03 ) 2 ] 
 (321  03 )(21  03 )[3(30  12 ) 2  (21  03 ) 2 ]
6  (20  02 )[(30  12 ) 2  (21  03 ) 2 ]  411(30  12 )(21  03 )
7  (321  03 )(30  12 )[(30  12 ) 2  3(21  03 ) 2 ] 
 (30  312 )(21  03 )[3(30  12 ) 2  (21  03 ) 2 ] (4)

In practical pattern recognition applications the equations (1) and (2) are discretized for
binary images according to

∑∑ ( ) ( )

∑∑ ( )( ) ( ) ( )

where mpq and  pq are computed by sweeping the image space.

In practice when the set of equations (4) is applied to a group of n images containing
different (rotation, translation and scale) instances of the same object, seven numbers
are obtained from each image (instance). These numbers are, if
not equal, at least close to each other for every

( ) ( ) ( ) ( )
{
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 45

Application example of the Massive Invariant Moments

[ Imagery module: Pattern Recognition – Massive RTS Invariant Moments ]

As an example of the application of the Invariant Moments, the following six different RTS
(Rotation, Translation and Size) instances of a holder were submitted to the Massive
Invariant Moments module in the Imagery Virtual Lab.

The following table shows the seven invariant massive moments for the six different
instances of the holder. In order to avoid dealing with huge numbers, their logarithms were
used.

LnPhi1 LnPhi2 LnPhi3 LnPhi4 LnPhi5 LnPhi6 LnPhi7

1 1.382 6.005 4.891 7.679 14.119 10.747 14.626
2 1.395 6.059 4.894 7.368 13.544 10.403 14.720
3 1.353 6.239 4.601 6.777 12.510 9.897 13.694
4 1.382 6.005 4.891 7.679 14.119 10.747 14.626
5 1.374 6.031 4.769 7.014 12.967 10.058 13.980
6 1.394 6.025 4.869 7.111 13.101 10.124 17.884

As it can be seen, the invariant moments are not exactly equal for different instances of the
same object, exists a range of variation, and in pattern recognition applications the range of
variation of the invariant moments must be taken into account.

The table below shows the range of variation of the invariant moments for the holders used
in this example.

Max values Min values Difference

Max_LnPhi1 = 1.395 Min_LnPhi1 = 1.353 0.043
Max_LnPhi2 = 6.239 Min_LnPhi2 = 6.005 0.234
Max_LnPhi3 = 4.894 Min_LnPhi3 = 4.601 0.293
Max_LnPhi4 = 7.679 Min_LnPhi4 = 6.777 0.903
Max_LnPhi5 = 14.119 Min_LnPhi5 = 12.510 1.609
Max_LnPhi6 = 10.747 Min_LnPhi6 = 9.897 0.850
Max_LnPhi7 = 17.884 Min_LnPhi7 = 13.694 4.190
Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 46

The C.C. Chen’s Boundary Invariant Moments.-

[ Imagery module: Pattern Recognition – Boundary RTS Invariant Moments ]

C.C. Chen introduces a method to compute a set of slightly different invariant functions
based on computations only along the boundary of the object. In this way the computational
complexity of the problem and the computer time are reduced from O( N2 ) to O(N).

Computations to achieve the Boundary Moments are much simpler than those to obtain the
Massive Moments, however those require pre-processing in order to extract the boundaries
of the objects

C.C. Chen uses the same RTS invariant functions given by equations (4) deduced originally
by Hu, however he introduces a new scaling factor  instead of 
(see equations (12) and (3)) to achieve invariance to boundary length scaling.

The Chen's Boundary Geometrical Moments are given by

mpq   f ( x, y) x p y q dl p, q  0,1,2, (7)

C
where the integral is to be evaluated along the object edge (boundary) C.

Discretizing equation (7), mpq results in

m pq  
( x , y )C
f ( x, y ) x p y q (8)

The coordinates of the object centroid  xc , y c  are given by

( ) ( ) ( )
notice that m00 is the length of the curve C, the edge of the object.

For the Boundary Moments to turn invariant to translation, they must be referred to the object
centroid:

 pq   f ( x, y) ( x  xc ) p ( y  yc ) q dl (10)
C
these are the Boundary Central Moments, and the integral must be evaluated along the edge

C of the object. In the discrete case  pq above becomes

 pq  
( x, y ) C
f ( x, y) ( x  xc ) p ( y  yc )q (11)

and it can be seen that after discretization it is not necessary to carry out the sum in any
particular order; this means that x, y   C can be taken in any order, for example, as they
are met when sweeping the image space top-down and left-right.

The C.C. Chen's scale normalized central moments are given by

 pq
 pq    p  q 1 p  q  2, 3,... (12)
 00

here is the length of C. The are scale and translation invariant.

Finally, the invariant moments are the same as those used with the
Hu’s Massive Moments.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 47

Return to Index

The Polar Hough Transform.-

[ Imagery module: Pattern Recognition – The Polar Hough Transform ]
[ Imagery module: Pattern Recognition – Hough Transform based line detector ]

The Polar Hough Transform is given by

( )

and it allows to represent a straight line by a pair ( ) . The

Hough transform (HT) associates every point (x,y) of a straight line to a
sinusoid, and all the sinusoids generated by aligned points intersect on
a point ( ) , which characterizes the set of aligned points.

The Accumulator Space ρ-ϴ is discretized in cells of coordinates ( ), and the sinusoid
associated to a point (x,y) contributes with votes to the accumulator cells it passes by. Even
noise dots in input space, generate a sinusoid in the Accumulator.

Aligned points in X-Y space generate sinusoids in the Accumulator that intersect in at least
one point ( ), whose coordinates may be used to identify and reconstruct that set of
aligned points, or line.
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 48

As the sinusoid passes by different cells ( ) in the accumulator, the votes stored in those
cells is incremented in one by each sinusoid passing by a cell, in this way, the value stored in
each cell of the Accumulator is the number of sinusoids crossing that cell.

Since all the sinusoids corresponding to aligned points (or line) pass by a given cell, the
counting stored in that cell is the number of dots in the line, and after Hough Transforming an
input space (input image), the highest counts in the accumulator will correspond to aligned
points or lines and the lowest counts will be associated to noise or to sets of a few aligned
dots.

In Pattern Recognition applications after Hough transforming the input space XY, the
accumulator cells containing the highest values are identified, cropped and processed so as
to extract information from them, this automatically discards noise and also short lines.

For instance, after Hough transforming a Cartesian input space XY containing a rectangle,
many cells of the accumulator will store different integer values, but the four highest stored
values will correspond to the four sides of the rectangle. The cells containing these highest
values are easily detected; consequently the rectangle is represented by only four
accumulator pairs ( ). Noise dots, if present in input space, will store low values in the
accumulator cells.

Return to Index
Digital Image Processing Javier Montenegro Joo www.VirtualDynamicsSoft.com 49

References
This Digital Image Processing course is based on the following material:

[1] J. Montenegro Joo. Geometric-Transformations Invariant Pattern Recognition in the Hough Space,
Doctoral Degree Project. Cybernetic Vision Research Group, Instituto de Física de Sao Carlos
(IFSC), Dpto. de Física e Informática, Universidade de Sao Paulo (USP), Sao Carlos, SP, Brazil.
(August 1994)

[2] J. Montenegro Joo. Invariant Boundary moments in Pattern Recognition. The method of C.C.
Chen. Doctoral Qualification Exam (April 1994). Cybernetic Vision Research Group, Instituto de
Física de Sao Carlos (IFSC), Dpto. de Física e Informática, Univ. de Sao Paulo (USP), Brazil.

[3] J. Montenegro Joo.. A Polar-Hough-Transform Based Algorithm for the Translation,

Orientation and Size-Scale Invariant Pattern Recognition of Polygonal Objects.
UMI Dissertations Microform LD03769. Oct. 1998

[4] J. Montenegro Joo. Invariant Recognition of Rectangular Biscuits through an Algorithm Operating
exclusively in Hough Space. Flawed Pieces Detection. RIF-UNMSM, Vol 5 (2002)

[5] Javier Montenegro Joo, Improved-Invariant-Edge Moments Without Object-Edge Tracing.

Electronica UNMSM, No 12, Dec. 2003

[6] J. Montenegro Joo. Improved Moment Invariants Know How, Why and When,
RIF-UNMSM., Vol. 8, No 2, 2005

[7] Javier Montenegro Joo. Knowing-how on Boundary Geometric Moments,

Electronica UNMSM, No 16, Dec. 2005

[8] J. Montenegro Joo. Boundary Geometric Moments and its application to automatic quality control
in the Industry. JMJ, Industrial Data, Vol. 9, No 1, 2006

[9] Javier Montenegro Joo, Hough-Transform based algorithm for the automatic invariant recognition of
rectangular chocolates. Detection of defective pieces. Industrial Data Vol 9, No 2, 2006.

[10] J. Montenegro Joo. Hough-Transform based Automatic Invariant Recognition of Metallic Corner-
Fasteners. Industrial Data, Vol. 10 - No 1 – 2007

[11] Javier Montenegro Joo. Automatic Classification of Products in the Industry via Invariant
Boundary Moments. Industrial Data, Vol 10, No 2, 2007

[12] Javier Montenegro Joo. Image Segmentation through Encapsulation of its Constituents.
Industrial Data, Vol 13, No 1, 2010

View publication stats

V2i2p03 PDF
No ratings yet
V2i2p03 PDF
9 pages
Dip MSC Cs Notes
No ratings yet
Dip MSC Cs Notes
165 pages
Dip Unit-2
No ratings yet
Dip Unit-2
21 pages
Image Processing Applications
No ratings yet
Image Processing Applications
22 pages
Digital Image Processing Overview
No ratings yet
Digital Image Processing Overview
12 pages
What Is Meant by Digital Image Processing
No ratings yet
What Is Meant by Digital Image Processing
10 pages
Digital Image Processing: How It Works
No ratings yet
Digital Image Processing: How It Works
39 pages
Digital Image Processing Overview and Techniques
No ratings yet
Digital Image Processing Overview and Techniques
3 pages
DIP - Lecture 1
No ratings yet
DIP - Lecture 1
49 pages
DIP - Lecture 1
No ratings yet
DIP - Lecture 1
47 pages
Digital Image Processing Guide
No ratings yet
Digital Image Processing Guide
61 pages
DIP Notes
No ratings yet
DIP Notes
87 pages
Applications of Digital Image Processing: Prof. Vijay B Pujari, Sairaj S Sauryavanshi Satyajeet S. Mandale
No ratings yet
Applications of Digital Image Processing: Prof. Vijay B Pujari, Sairaj S Sauryavanshi Satyajeet S. Mandale
3 pages
Lec 1
No ratings yet
Lec 1
39 pages
Image Processing Lecture 1
100% (1)
Image Processing Lecture 1
37 pages
Digital Image Processing
No ratings yet
Digital Image Processing
56 pages
Digital Image Processing
No ratings yet
Digital Image Processing
175 pages
Unit 1 Image Processing
No ratings yet
Unit 1 Image Processing
57 pages
Digital Image Processing
No ratings yet
Digital Image Processing
79 pages
Digital Image Processing
No ratings yet
Digital Image Processing
151 pages
Group 1-Intruduction and Digital Image Fundamental
No ratings yet
Group 1-Intruduction and Digital Image Fundamental
35 pages
Image Processing: ACADEMIC YEAR: 2022-2023
No ratings yet
Image Processing: ACADEMIC YEAR: 2022-2023
113 pages
Chapter 2
No ratings yet
Chapter 2
66 pages
Image Processing Lecture Notes
No ratings yet
Image Processing Lecture Notes
133 pages
r20 Dip Ppts
No ratings yet
r20 Dip Ppts
326 pages
1.1. Introduction To DIP
No ratings yet
1.1. Introduction To DIP
61 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
27 pages
Lectures1 2
No ratings yet
Lectures1 2
14 pages
Digital Image Processing (Intro)
No ratings yet
Digital Image Processing (Intro)
25 pages
Unit I DIP PPT Final Subathra
No ratings yet
Unit I DIP PPT Final Subathra
234 pages
Assignment-1 Digital Image Processing
No ratings yet
Assignment-1 Digital Image Processing
8 pages
Prelim & Course PLG
No ratings yet
Prelim & Course PLG
23 pages
Introduction to Digital Image Processing
No ratings yet
Introduction to Digital Image Processing
73 pages
Applications and Usage
No ratings yet
Applications and Usage
5 pages
Digital Image Processing: Chapter 1: Introduction
No ratings yet
Digital Image Processing: Chapter 1: Introduction
39 pages
Digital Image Processing: Chapter 1: Introduction
No ratings yet
Digital Image Processing: Chapter 1: Introduction
39 pages
Part 1
No ratings yet
Part 1
20 pages
Digital Image Processing Guide
No ratings yet
Digital Image Processing Guide
21 pages
Lecture IP#2
No ratings yet
Lecture IP#2
40 pages
Image Processing Course Intro
No ratings yet
Image Processing Course Intro
49 pages
Lect01 - Introductions and Fundamentals
No ratings yet
Lect01 - Introductions and Fundamentals
69 pages
Dip PPT4
No ratings yet
Dip PPT4
75 pages
Dip Notes Lecture Notes 1 5
No ratings yet
Dip Notes Lecture Notes 1 5
202 pages
Unit 1 Digital Image Fundamentals (DIP)
No ratings yet
Unit 1 Digital Image Fundamentals (DIP)
13 pages
Dip Notes
No ratings yet
Dip Notes
149 pages
MATLAB
No ratings yet
MATLAB
6 pages
Fundamentals of Digital Image Processing and Basic Concept of Classification
No ratings yet
Fundamentals of Digital Image Processing and Basic Concept of Classification
12 pages
Introduction To Image Processing
No ratings yet
Introduction To Image Processing
26 pages
Digital Image Processing Applications
No ratings yet
Digital Image Processing Applications
6 pages
Digital Image Processing Intro
No ratings yet
Digital Image Processing Intro
39 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
IVP Unit 1
No ratings yet
IVP Unit 1
143 pages
Mip Unit 1
No ratings yet
Mip Unit 1
42 pages
Note I-2
No ratings yet
Note I-2
19 pages
Dip Digital Notes
No ratings yet
Dip Digital Notes
172 pages
Dip Unit 1
No ratings yet
Dip Unit 1
75 pages
Dept:Electronics and Communication Engineering Subject Name:Digital Image Processing SUBJECT CODE:15A04708
No ratings yet
Dept:Electronics and Communication Engineering Subject Name:Digital Image Processing SUBJECT CODE:15A04708
123 pages
DIP Notes-M
No ratings yet
DIP Notes-M
64 pages
Ii PG Dip Notes
No ratings yet
Ii PG Dip Notes
49 pages
Image Processing Techniques
No ratings yet
Image Processing Techniques
34 pages
Digital Pinhole Photography Guide
No ratings yet
Digital Pinhole Photography Guide
10 pages
DIP Image Enhancement PART2
No ratings yet
DIP Image Enhancement PART2
234 pages
Noise Ninja Plug-In User Guide
No ratings yet
Noise Ninja Plug-In User Guide
54 pages
308 Prelim Topic 1 4
No ratings yet
308 Prelim Topic 1 4
13 pages
Image Processing 2016 2019
No ratings yet
Image Processing 2016 2019
3 pages
Introduction To Digital Watermarking
No ratings yet
Introduction To Digital Watermarking
17 pages
Intensity Transformation Techniques
No ratings yet
Intensity Transformation Techniques
97 pages
IFZ12.eBook Digital Artist New HR
No ratings yet
IFZ12.eBook Digital Artist New HR
116 pages
101 Potrait Photography Tips
No ratings yet
101 Potrait Photography Tips
22 pages
7712 All
No ratings yet
7712 All
1 page
Enhancement and Segmentation of Historical Records
No ratings yet
Enhancement and Segmentation of Historical Records
19 pages
Image Stitching Using Matlab
No ratings yet
Image Stitching Using Matlab
5 pages
Photoshop CS3 For Forensics Professionals - A Complete Digital Imaging Course For Investigators George Reis (Wiley, 2007)
No ratings yet
Photoshop CS3 For Forensics Professionals - A Complete Digital Imaging Course For Investigators George Reis (Wiley, 2007)
291 pages
PDF-FSK IVP Ch6 Image Enhancement 2023
No ratings yet
PDF-FSK IVP Ch6 Image Enhancement 2023
148 pages
Astha Singh - 19419MCA017 Assignment-3
No ratings yet
Astha Singh - 19419MCA017 Assignment-3
9 pages
Module 4 Chapter 6 - Color Image Processing
No ratings yet
Module 4 Chapter 6 - Color Image Processing
12 pages
BPJ Vol 15 No 4 P 2203-2208
No ratings yet
BPJ Vol 15 No 4 P 2203-2208
6 pages
HW 4
No ratings yet
HW 4
2 pages
Digital Image Processing
No ratings yet
Digital Image Processing
26 pages
SweetFX - Settings - FIFA 14 - Best FIFA 14 SweetFX (Ultra Sharpness)
No ratings yet
SweetFX - Settings - FIFA 14 - Best FIFA 14 SweetFX (Ultra Sharpness)
7 pages
SweetFX Settings FIFA 14 K-Putt'e Config
No ratings yet
SweetFX Settings FIFA 14 K-Putt'e Config
8 pages
IMAGE Processing Lab Manual
No ratings yet
IMAGE Processing Lab Manual
98 pages
Updated Lab Manual 8 DIP
No ratings yet
Updated Lab Manual 8 DIP
22 pages
Steinmueller Photo - The Art of RAW Conversion
No ratings yet
Steinmueller Photo - The Art of RAW Conversion
297 pages
Color CCD Imaging With Luminance Layering: by Robert Gendler
No ratings yet
Color CCD Imaging With Luminance Layering: by Robert Gendler
4 pages
10 Quick PS Killer Tips
No ratings yet
10 Quick PS Killer Tips
12 pages
ImagineFX Presents - The Digital Artist's Survival Guide
92% (53)
ImagineFX Presents - The Digital Artist's Survival Guide
116 pages
Digital Image Processing MCQs: Sampling & Quantization
No ratings yet
Digital Image Processing MCQs: Sampling & Quantization
70 pages
Image Enhancement Techniques Overview
No ratings yet
Image Enhancement Techniques Overview
40 pages