0% found this document useful (0 votes)
81 views17 pages

CIS679: Multimedia Basics: Multimedia Data Type Basic Compression Techniques

This document discusses multimedia compression techniques. It describes different multimedia data types like audio, images, and video. Compression is needed due to high data rates of multimedia. Techniques include lossless compression using Huffman coding and run length encoding, and lossy compression using discrete cosine transform and quantization. Compression involves preprocessing, transformation, quantization, and entropy encoding to remove spatial, spectral and perceptual redundancies.

Uploaded by

nayeem1952754
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
81 views17 pages

CIS679: Multimedia Basics: Multimedia Data Type Basic Compression Techniques

This document discusses multimedia compression techniques. It describes different multimedia data types like audio, images, and video. Compression is needed due to high data rates of multimedia. Techniques include lossless compression using Huffman coding and run length encoding, and lossy compression using discrete cosine transform and quantization. Compression involves preprocessing, transformation, quantization, and entropy encoding to remove spatial, spectral and perceptual redundancies.

Uploaded by

nayeem1952754
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd

CIS679: Multimedia Basics

 Multimedia data type


 Basic compression techniques
Multimedia Data Type
 Audio
 Image
 Video
Audio
 Digitization
 Sampling
 Quantization
 Coding

 Higher sampling rate -> higher quality


 Nyquist sampling theorem: for lossless digitization, the
sampling rate should be at least twice the maximum
frequency responses
 Higher bits per sample -> higher quality
 Sampling at 8 KHz, 8 bit samples -> 64kbits/sec
 CD-quality audio
 Sampling at 44.1KHz, 16 bit samples -> 705.6 kbits/sec
Image/Video
 Digitization
 Scan a picture frame
 Digitize every pixel

 Color represented by RGB


 Normally converted to Y (black and white TV), U
and V
 Luminance Y = 0.30R + 0.59G + 0.11 R
 Chrominance U = (B-Y) * 0.493
V = (R-Y) * 0.877
Video Transmission Standards
 NTSC
 Y = 0.30R + 0.59G + 0.14B
 I = 0.60R + 0.28G + 0.32B
 Q = 0.21R + 0.52G + 0.21B

 PAL
Studio-quality TV
 NTSC
 525 lines at 30 frames/second
 Y sampled at 13.5 MHz, Chrominance values at 6.75 MHz
 With 8-bit samples,
 Data rate = (13.5 + 6.75 + 6.75) * 8 = 216 Mbps
Summary of Multimedia Data Types
 Audio data rate = 64kbps, and 705.6kbps
 Video date rate = 216 Mbps
 Compression is required!
Can Multimedia Data Be Compressed?
 Redundancy can be exploited to do compression!
 Spatial redundancy
 correlation between neighboring pixels in image/video
 Spectral redundancy
 correlation among colors

 Psycho-visual redundancy
 Perceptual properties of human visual system
Categories of Compression
 Lossless
 No distortion of the original content
 Used for computer data, medical images, etc.

 Lossy
 Some distortion
 Suited for audio and video
Compression Techniques
Run-length Coding

Entropy
Huflfman Coding
Encoding

Arithmetic Coding

DPCM

Prediction DM

FFT

Transformation DCT

Source Coding Bit Position

Layered Coding Subsampling

Sub-band Coding

Vector Quantization

J PEG

MPEG

Hybrid Coding H.261

DVI RTV, DVI PLV


Entropy Encoding Techniques
 Lossless compression
 Run-length encoding
 Represent stream as (c1, l1), (c2, l2),…, (ck, lk)
 1111111111333332222444444 = (1, 10) (3, 5) (2,4) (4, 5)
 Or ABCCCCCCCCDEFGGG = ABC!8DEFGGG
 Pattern Substitution
 Substitute smaller symbols for frequently used patterns
Huffman Coding
 Use variable length codes
 Most frequently used symbols coded with fewest
bits
 Codes are stored in a codebook
 Codebook transferred with the compressed
stream
Source Encoding Techniques
 Transformation encoding
 Transform the bit-stream into another domain
 Data in the new domain more amenable to compression
 Type of transformation depends on data

 Image/video transformed from time domain into


frequency domain (DCT)
Differential/Predictive Encoding
 Encoding the difference between actual value and
a prediction of that value
 Number of Techniques
 Differential Pulse Code Modulation (DPCM)
 Delta Modulation (DM)
 Adaptive Pulse Code Modulation (APCM)
 How they work?
 When consecutive change little
 Suited for audio and video
Vector Quantization
 Divide the data stream into blocks or vectors
 One or two dimensional blocks

 Use codebooks
 Find the closest symbol in codebook for a given
sample
 Transmit the reference to that symbol
 Codebook present at sender/receiver
 When no exact match, could send the error
 Lossy or lossless
 Useful with known signal characteristics
 Construct codebooks that can match a wide range
of symbols
Major Steps of Compression
 Preparation
 Uncompressed analog signal -> sampled digital form

 Processing
 Source coding
 DCT typically used: Transform from time domain ->
frequency domain
 Quantization
 Quantize weights into integer codes
 Could use different number of bits per coefficient

 Entropy encoding
 Lossless encoding for further compression
Conclusion
 Multimedia data types
 Why multimedia can be compressed?
 Categories of compression
 Compression techniques
 Entropy encoding
 Source encoding
 Hybrid coding

 Major steps of compression


 What’s next?
 JPEG
 MPEG

You might also like