0% found this document useful (0 votes)

213 views23 pages

4 Huffman and Shannon Fano Coding

The document discusses Huffman and Shannon-Fano coding techniques for digital communication, highlighting their differences in lossless data compression. Huffman coding generates variable-length codes based on symbol frequency, while Shannon-Fano coding is a simpler method that uses cumulative probabilities to assign codewords. Both methods aim to minimize average code length, but Huffman coding generally results in shorter average code lengths compared to Shannon-Fano coding.

Uploaded by

ahmedomohdo20

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

213 views23 pages

4 Huffman and Shannon Fano Coding

Uploaded by

ahmedomohdo20

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

ECE421 Digital Communication

Huffman and Shannon Fano

Coding

Lecturer: Dr. Reham Samir

References

n K. Deergha Rao “Channel Coding Techniques for

Wireless Communications” Springer, 2015.

n Behrouz A. Forouzan “Data Communication and

Networking” (5th Edition), McGraw Hill, 2015.

n Behrouz A. Forouzan “Data Communication and

Networking” (3rd Edition), McGraw Hill, 2004.

n Peyton Z. Peebles, Jr., Ph.D “Digital Communication

Systems” Prentice-Hall, 1987.
Lossless and Lossy Compression

n Lossless compression, refers to the process of encoding data more

efficiently so that it occupies fewer bits or bytes but in such a way
that the original data can be reconstructed, bit-for-bit, when the
data is decompressed.

n Lossy compression techniques achieve compression by discarding

some of the original data.

n Lossless compression techniques produce an exact duplicate of the

original data but cannot achieve high levels of compression.
n Example: RGB color image with an original size of 9.9 megabytes
can only be reduced to 6.5 megabytes using the lossless PNG format
Huffman Coding

n Huffman coding is a source coding method for creating

instantaneous codes with minimum average code length, therefore,
it is called compact code.

n Any other code for the same alphabet cannot have a lower expected
length than the code constructed by the algorithm.
Huffman Coding

n It is a lossless data compressing technique generating variable

length codes for different symbols.

n It considers frequency/probability of alphabets for generating

codes.

n The length of the code for a character is inversely proportional to

frequency of its occurrence (the more often a symbol occurs in the
original data the shorter the binary string used to represent it in the
compressed data).

n No code is prefix of another code.

Huffman Coding

n Given
n Set of symbol probabilities [Pi] , i=1,2,3,…N , where N is the number
of symbols

n D is the base of representation  D = 2 for c ={0,1} for binary code

 D = 3 for c ={0,1,2} for ternary code

n The number of Huffman stages = (N-D)/(D-1)

n If the number is not an integer, add a dummy symbol.

n The dummy symbols have probability 0 and are inserted to fill the tree.

n At each stage, the number of symbols is reduced by D − 1.

Huffman Coding

n Huffman steps
1. Sort the symbol from the largest probability to the smallest.

2. Add the probability of the smallest (D) symbols into one symbol.

3. Repeat 1 and 2 until reaching D symbols.

4. Assign c to the D symbols (EX: for c ={0,1} , assign 0 to higher

branch, 1 to lower branch)

5. Go back to extract the compacted symbols, adding more X to the

extracted ones.
Huffman Coding

n Example: Consider a random variable X taking values in the set X

= {1, 2, 3, 4, 5} with probabilities 0.25, 0.25, 0.2, 0.15, 0.15,
respectively. Create a compact code, then calculate the average
length of code and Entropy. Use binary code c ={0,1}
Huffman Coding

n Sol: The number of stages = 3 stages (integer number)

n This code has average length:

n Entropy:
Huffman Coding

n Example: Repeat the last example, use ternary code c={0,1,2}.

n Sol: The number of stages = 1 stage (integer number)

n This code has an average length of 1.5 ternary digits.

n Entropy: H(x) = 1.442 ternary digits.

Huffman Coding

n Example: Consider a random variable X taking values in the set X

= {1, 2, 3, 4, 5, 6} with probabilities 0.25, 0.25, 0.2, 0.1, 0.1, 0.1
respectively. Create a compact code, then calculate the average
length of code. Use c ={0,1,2}.

n Sol:
n In this example N = 6, D = 3 then (N-D)/(D-1) =3/2 , then we need to
add dummy symbol.

Then N = 7 , D = 3 then (N-D)/(D-1) =2

Huffman Coding

n Sol:

n This code has an average length of 1.7 ternary digits.

Huffman Coding

n Example: Using Huffman coding, calculate the average length of

code required for encoding the message ‘mississippi’? Use binary
code c ={0,1}
Huffman Coding

n Sol: The number of stages = 2 stages (integer number)

n i 4/11 1 4/11 1 7/11 0

n s 4/11 00 4/11 00 4/11 1

n p 2/11 010 3/11 01

n m 1/11 011

n The average length of code =

4/11 +2(4/11)+3(2/11)+3(1/11) = 21/11 = 1.909 bits

Huffman Coding

n Note that: Depending on where one puts the merged probabilities, the
Huffman coding procedure results in different codeword lengths.

n Example: Consider a random variable X with a distribution

n The Huffman coding procedure results in codeword lengths of (2, 2,
2, 2) or (1, 2, 3, 3) with binary code c ={0,1} Prove
Huffman Coding

n Disadvantages:
n Huffman coding requires two passes one to build a statistical model
of the data and a second to encode it so is a relatively slow process.
This in turn means that lossless encoding techniques that use
Huffman coding are notably slower than other techniques when
reading or writing files.

n Another disadvantage of the Huffman coding is that the binary strings

or codes in the encoded data are all different lengths. This makes it
difficult for decoding software to determine when it has reached the
last bit of data and if the encoded data is corrupted.
Shannon-Fano-Elias (S-F-E) Coding

n It is a lossless coding scheme used in digital communication.

n Compared to Huffman encoding, Shannon-Fano-Elias coding

method is the simple.

n S-F-E steps
1. Sort the symbols.

2. Calculate cumulative probability.

n We will use the cumulative distribution function to allot codewords.

n We can take . Assume that .

n The cumulative distribution function F(x) is defined as:

Shannon-Fano-Elias (S-F-E) Coding

n S-F-E steps
3. Calculate modified cumulative distribution function.

n Where denotes the sum of the probabilities of all symbols less than
plus half the probability of the symbol .

4. Find modified cumulative distribution in binary.

5. Find the length of the code word.

n The codeword length bits suffice to describe .
2

6. Generate the code word.

Shannon-Fano-Elias (S-F-E) Coding

n The average codeword length is:

n Thus, this coding scheme achieves an average codeword length that

is within 2 bits of the entropy.

n Example: Create Shannon-Fano code given the following table,

then calculate the average codeword length and the entropy.
Shannon-Fano-Elias (S-F-E) Coding

n Sol:

n The average codeword length is 2.75 bits

n The entropy is 1.75 bits.

n Note that:
n For Huffman code, the average codeword length is 1.75 bits Prove
Shannon-Fano-Elias (S-F-E) Coding

n Example: Create Shannon-Fano code given the following table.

Then calculate the average codeword length.
Shannon-Fano-Elias (S-F-E) Coding

n Sol:

n Note that : We denote

n The average codeword length is 3.5 bits

n Compared with Huffman code, the average codeword length is 2.3

bits. Then shannon Fano code is 1.2 bits longer on the average than
Shannon-Fano-Elias (S-F-E) Coding

n Note That:
n The average code length in Huffman code is shorter than the average
code length in shannon code.

Huffman Coding: Encoding Messages Efficiently
No ratings yet
Huffman Coding: Encoding Messages Efficiently
40 pages
Analog and Digital Communication
No ratings yet
Analog and Digital Communication
16 pages
Previous Year Question Paper
No ratings yet
Previous Year Question Paper
11 pages
Cyclic Codes
No ratings yet
Cyclic Codes
30 pages
Loss Systems and Delay Systems 28883 0
No ratings yet
Loss Systems and Delay Systems 28883 0
14 pages
Computation of The DFT of Real Sequences
No ratings yet
Computation of The DFT of Real Sequences
45 pages
Module 5 - Part1
No ratings yet
Module 5 - Part1
36 pages
Linear Block Coding: Presented by
No ratings yet
Linear Block Coding: Presented by
12 pages
5CS3 ITC Unit II @zammers
No ratings yet
5CS3 ITC Unit II @zammers
50 pages
IoT Course Overview and Syllabus
No ratings yet
IoT Course Overview and Syllabus
97 pages
Chapter5 Encod Mod
100% (1)
Chapter5 Encod Mod
48 pages
MPMC Unit 3
No ratings yet
MPMC Unit 3
31 pages
RTOS
0% (1)
RTOS
1 page
Overview of Bus Architecture Types
No ratings yet
Overview of Bus Architecture Types
48 pages
EC8681-Microprocessors and Microcontrollers Lab Manual
0% (1)
EC8681-Microprocessors and Microcontrollers Lab Manual
95 pages
Signed Operand Multiplication Methods
No ratings yet
Signed Operand Multiplication Methods
25 pages
VHDL Programming Language Concepts and Differences
No ratings yet
VHDL Programming Language Concepts and Differences
3 pages
Unit 3 Introduction To Operating System Concepts
No ratings yet
Unit 3 Introduction To Operating System Concepts
19 pages
5CS3-01-ITC - Guess Paper @rawcoderz
No ratings yet
5CS3-01-ITC - Guess Paper @rawcoderz
45 pages
Wireless and Mobile Network Architecture: Handoff Management Detection and Assignment
No ratings yet
Wireless and Mobile Network Architecture: Handoff Management Detection and Assignment
30 pages
The Hadamard Transform
No ratings yet
The Hadamard Transform
4 pages
Digital Signal Processor Architecture Overview
No ratings yet
Digital Signal Processor Architecture Overview
24 pages
Ch6: Channel Coding: - Questions To Be Answered
No ratings yet
Ch6: Channel Coding: - Questions To Be Answered
28 pages
DSP Syllabus
100% (1)
DSP Syllabus
10 pages
8086 Microprocessor Arithmetic Operations
No ratings yet
8086 Microprocessor Arithmetic Operations
107 pages
Optical Packet Switching Seminar Overview
No ratings yet
Optical Packet Switching Seminar Overview
25 pages
A1167240649 - 12301 - 17 - 2019 - 2. Quantization PCM and TDM DM and ADM DPCM
No ratings yet
A1167240649 - 12301 - 17 - 2019 - 2. Quantization PCM and TDM DM and ADM DPCM
35 pages
Algorithms For Automatic Modulation Recognition of Communication Signals-Asoke K, Nandi, E.E Azzouz
No ratings yet
Algorithms For Automatic Modulation Recognition of Communication Signals-Asoke K, Nandi, E.E Azzouz
6 pages
VLSI Signal Processing Course Plan
No ratings yet
VLSI Signal Processing Course Plan
8 pages
Digital Arithmetic Concepts Explained
No ratings yet
Digital Arithmetic Concepts Explained
35 pages
Chapter 1
No ratings yet
Chapter 1
55 pages
8051 Microcontroller Overview and Features
No ratings yet
8051 Microcontroller Overview and Features
12 pages
Deep Learning Exam: Key Concepts
No ratings yet
Deep Learning Exam: Key Concepts
32 pages
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
No ratings yet
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
28 pages
Adhoc and Wireless Sensor Networks File Lab
No ratings yet
Adhoc and Wireless Sensor Networks File Lab
56 pages
Basic Simulation: Laboratory Manual
No ratings yet
Basic Simulation: Laboratory Manual
23 pages
Information Theory and Coding Syllabus
No ratings yet
Information Theory and Coding Syllabus
2 pages
Cache Mapping Functions
No ratings yet
Cache Mapping Functions
39 pages
GNUSim8085 Is A Graphical Simulator
No ratings yet
GNUSim8085 Is A Graphical Simulator
18 pages
What Is A Virtual Instrument and How Is It Different From A Traditional Instrument?
100% (1)
What Is A Virtual Instrument and How Is It Different From A Traditional Instrument?
22 pages
Digital Logic Design Tasks
No ratings yet
Digital Logic Design Tasks
2 pages
COA Imp Questions 20-21
No ratings yet
COA Imp Questions 20-21
4 pages
DPSD 2 Marks
No ratings yet
DPSD 2 Marks
23 pages
20.10.20 Line Coding
No ratings yet
20.10.20 Line Coding
27 pages
Huffman Coding for Data Compression
No ratings yet
Huffman Coding for Data Compression
5 pages
Application Layer in Computer Networks
No ratings yet
Application Layer in Computer Networks
92 pages
DC - Experiment - No. 4
No ratings yet
DC - Experiment - No. 4
9 pages
Digital Image Processing: Walsh-Hadamard Transform
No ratings yet
Digital Image Processing: Walsh-Hadamard Transform
18 pages
Unit-2 MPMC Notes Research
No ratings yet
Unit-2 MPMC Notes Research
62 pages
DCC Asgn 4
No ratings yet
DCC Asgn 4
8 pages
Computer Organization CPU Organization 1.3.1
No ratings yet
Computer Organization CPU Organization 1.3.1
13 pages
Digital Communication
No ratings yet
Digital Communication
19 pages
8085 Microprocessor-BCA VI
No ratings yet
8085 Microprocessor-BCA VI
113 pages
Communication Theory II - Lecture 7
No ratings yet
Communication Theory II - Lecture 7
34 pages
Lecture 3-Huffman Coding
No ratings yet
Lecture 3-Huffman Coding
30 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Huffman Coding for Image Compression
No ratings yet
Huffman Coding for Image Compression
24 pages
Source Coding
No ratings yet
Source Coding
35 pages
Ut 1 PPT
No ratings yet
Ut 1 PPT
77 pages
Chapter Three
No ratings yet
Chapter Three
30 pages
RCR Webinar - A Closer Look at 5G Advanced Release 18 - Juan Montojo
No ratings yet
RCR Webinar - A Closer Look at 5G Advanced Release 18 - Juan Montojo
39 pages
ECE312 Finl Exm 2018
No ratings yet
ECE312 Finl Exm 2018
2 pages
ECE312 Finl Exm 2017
No ratings yet
ECE312 Finl Exm 2017
2 pages
Lec1 Oscillators
100% (1)
Lec1 Oscillators
34 pages
Midterm 2017 1
No ratings yet
Midterm 2017 1
3 pages
Sheet 3
No ratings yet
Sheet 3
5 pages
IE.5 Analog Controller
No ratings yet
IE.5 Analog Controller
42 pages
Sheet 4
No ratings yet
Sheet 4
3 pages
Sec3 IE
No ratings yet
Sec3 IE
9 pages
Answer Sheet 9
No ratings yet
Answer Sheet 9
9 pages
Answer Sheet 7
No ratings yet
Answer Sheet 7
11 pages
Answer Sheet 8 (Sol)
No ratings yet
Answer Sheet 8 (Sol)
3 pages
EngEthicsLecture4 (Details)
No ratings yet
EngEthicsLecture4 (Details)
17 pages
Measurements 21 - 22 Mainstream Final22solution
No ratings yet
Measurements 21 - 22 Mainstream Final22solution
4 pages
Eng Ethics Lecture 4
No ratings yet
Eng Ethics Lecture 4
9 pages
ECE422 AdvElecSyst Lec9 TDMA
No ratings yet
ECE422 AdvElecSyst Lec9 TDMA
17 pages
Signals Sheet3
No ratings yet
Signals Sheet3
2 pages
Eng Ethics Lecture 2
No ratings yet
Eng Ethics Lecture 2
7 pages
Signals Sheet4
No ratings yet
Signals Sheet4
3 pages
ECE422 AdvElecSyst Lec6 LinkModels&Equations
No ratings yet
ECE422 AdvElecSyst Lec6 LinkModels&Equations
35 pages
ECE422 - AdvElecSyst - Lec5 - Satcoord and Look Angles
No ratings yet
ECE422 - AdvElecSyst - Lec5 - Satcoord and Look Angles
17 pages
ECE422 AdvElecSyst Lec8 FDMA
No ratings yet
ECE422 AdvElecSyst Lec8 FDMA
22 pages
Sheet5 (DR Islam)
No ratings yet
Sheet5 (DR Islam)
4 pages
NH4-N Sensor MODBUS RTU Manual
No ratings yet
NH4-N Sensor MODBUS RTU Manual
21 pages
5.1 Java Database Programming: 5.1.1 JDBC Characteristics
100% (1)
5.1 Java Database Programming: 5.1.1 JDBC Characteristics
60 pages
3CSS Propertiestext Controlling and Text Formatting
No ratings yet
3CSS Propertiestext Controlling and Text Formatting
5 pages
Node Red
No ratings yet
Node Red
14 pages
Power BI Certificate 1-4
No ratings yet
Power BI Certificate 1-4
32 pages
Zip 33i
No ratings yet
Zip 33i
43 pages
Indexed DB
No ratings yet
Indexed DB
3 pages
NWM CH 1
No ratings yet
NWM CH 1
29 pages
Knowledge Base: Cloning and Converting Virtual Machine Disks With Vmkfstools (1028042)
No ratings yet
Knowledge Base: Cloning and Converting Virtual Machine Disks With Vmkfstools (1028042)
5 pages
IT6502 Digital Signal Processing Guide
No ratings yet
IT6502 Digital Signal Processing Guide
145 pages
Coc 3
No ratings yet
Coc 3
4 pages
Converting PLC-5 or SLC500 Logic To Logix Based Logic
No ratings yet
Converting PLC-5 or SLC500 Logic To Logix Based Logic
80 pages
Program Specific Information Overview
No ratings yet
Program Specific Information Overview
3 pages
Micro-Project Proposal: Brief Introduction
No ratings yet
Micro-Project Proposal: Brief Introduction
23 pages
Mariadb
No ratings yet
Mariadb
222 pages
Associate Cloud Engineer - Study Notes
No ratings yet
Associate Cloud Engineer - Study Notes
14 pages
Secure Coding Practices Overview
100% (1)
Secure Coding Practices Overview
25 pages
Android CRUD App with SQLite
No ratings yet
Android CRUD App with SQLite
3 pages
Practical 6
No ratings yet
Practical 6
8 pages
Dbms Lab Programs
No ratings yet
Dbms Lab Programs
6 pages
Post PDB Script Error
No ratings yet
Post PDB Script Error
26 pages
Barracuda Backup Administrator's Guide v42
No ratings yet
Barracuda Backup Administrator's Guide v42
724 pages
Heap Data Structure
No ratings yet
Heap Data Structure
5 pages
WS MCQ (Sem-5) (Itscholar - Codegency.co - In) (MC)
No ratings yet
WS MCQ (Sem-5) (Itscholar - Codegency.co - In) (MC)
22 pages
Format Disk Solaris
No ratings yet
Format Disk Solaris
4 pages
Accenture Pseudo-code Quiz
No ratings yet
Accenture Pseudo-code Quiz
8 pages
Class 9 List Methods Tuple
No ratings yet
Class 9 List Methods Tuple
3 pages
Virtual Port Channel Quick Configuration Guide
No ratings yet
Virtual Port Channel Quick Configuration Guide
9 pages
In 1021 EnterpriseDataCatalog (REST-API) Reference en
100% (1)
In 1021 EnterpriseDataCatalog (REST-API) Reference en
59 pages
Basics of Computers
No ratings yet
Basics of Computers
13 pages

4 Huffman and Shannon Fano Coding

Uploaded by

4 Huffman and Shannon Fano Coding

Uploaded by

ECE421 Digital Communication

Huffman and Shannon Fano

Lecturer: Dr. Reham Samir

n K. Deergha Rao “Channel Coding Techniques for

n Behrouz A. Forouzan “Data Communication and

n Behrouz A. Forouzan “Data Communication and

n Peyton Z. Peebles, Jr., Ph.D “Digital Communication

n Lossless compression, refers to the process of encoding data more

n Lossy compression techniques achieve compression by discarding

n Lossless compression techniques produce an exact duplicate of the

n Huffman coding is a source coding method for creating

n It is a lossless data compressing technique generating variable

n It considers frequency/probability of alphabets for generating

n The length of the code for a character is inversely proportional to

n No code is prefix of another code.

n D is the base of representation  D = 2 for c ={0,1} for binary code

 D = 3 for c ={0,1,2} for ternary code

n The number of Huffman stages = (N-D)/(D-1)

n At each stage, the number of symbols is reduced by D − 1.

3. Repeat 1 and 2 until reaching D symbols.

4. Assign c to the D symbols (EX: for c ={0,1} , assign 0 to higher

5. Go back to extract the compacted symbols, adding more X to the

n Example: Consider a random variable X taking values in the set X

n Sol: The number of stages = 3 stages (integer number)

n This code has average length:

n Example: Repeat the last example, use ternary code c={0,1,2}.

n Sol: The number of stages = 1 stage (integer number)

n This code has an average length of 1.5 ternary digits.

n Entropy: H(x) = 1.442 ternary digits.

n Example: Consider a random variable X taking values in the set X

Then N = 7 , D = 3 then (N-D)/(D-1) =2

n This code has an average length of 1.7 ternary digits.

n Example: Using Huffman coding, calculate the average length of

n Sol: The number of stages = 2 stages (integer number)

n s 4/11 00 4/11 00 4/11 1

n p 2/11 010 3/11 01

n The average length of code =

4/11 +2(4/11)+3(2/11)+3(1/11) = 21/11 = 1.909 bits

n Example: Consider a random variable X with a distribution

n Another disadvantage of the Huffman coding is that the binary strings

n It is a lossless coding scheme used in digital communication.

n Compared to Huffman encoding, Shannon-Fano-Elias coding

2. Calculate cumulative probability.

n We can take . Assume that .

n The cumulative distribution function F(x) is defined as:

4. Find modified cumulative distribution in binary.

5. Find the length of the code word.

6. Generate the code word.

n The average codeword length is:

n Thus, this coding scheme achieves an average codeword length that

n Example: Create Shannon-Fano code given the following table,

n The average codeword length is 2.75 bits

n Example: Create Shannon-Fano code given the following table.

n Note that : We denote

n The average codeword length is 3.5 bits

n Compared with Huffman code, the average codeword length is 2.3

You might also like