Computer Vision (CSA401)
UNIT-3 (Lecture-2)
Image preprocessing: Discrete Cosine Transform (DCT)
The Discrete Cosine Transform (DCT) is a mathematical technique used to convert a signal or
image from the spatial domain to the frequency domain. It is particularly popular in image and
signal processing due to its ability to concentrate most of the signal's information in a small
number of coefficients, making it an efficient way to represent data.
Types of DCT: There are several types of DCT, including DCT-I, DCT-II, DCT-III, and DCT-
IV, each with slightly different definitions and applications. DCT-II is the most commonly used
type, especially in image compression.
Example:
Solution:
RAVIKANT NIRALA Page 1
Computer Vision (CSA401)
RAVIKANT NIRALA Page 2
Computer Vision (CSA401)
Note:
The first coefficient X0=10 represents the average value of the signal (DC component).
The other coefficients X1, X2, and X3 represent the amplitudes of the cosine waves at
different frequencies.
Applications:
Image Compression: DCT is widely used in JPEG compression, where it helps reduce
the amount of data required to represent an image by transforming the image into a
frequency domain and then compressing the less important frequencies.
Audio Compression: Used in audio codecs like MP3 to compress sound by eliminating
frequencies that are less perceptible to the human ear.
Signal Processing: Helps in various signal processing applications where data needs to
be represented in a compact form.
Properties:
Energy Compaction: DCT has the ability to pack most of the energy in a signal into a
few low-frequency components, making it effective for compression.
Orthogonality: DCT basis functions are orthogonal, meaning they do not interfere with
each other, which helps in reconstructing the original signal without loss of information
when all components are used.
RAVIKANT NIRALA Page 3