Lecture 3

The document discusses derivation of average information and different models for data compression including physical models, probability models, Markov models, and composite source models. It describes how Markov models can be used in text compression by increasing the context size to reduce entropy.

Uploaded by

anushka

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Lecture 3

Uploaded by

anushka

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Data Compression Lecture 3

04/02/2024 1
Derivation of Average Information
• Given a set of independent events (A1, A2,…. An) with probability pi = P(Ai ),
we desire the following properties in the measure of average information H:
– We want H to be a continuous function of the probabilities pi .That is, a small
change in pi should only cause a small change in the average information.
– If all events are equally likely, that is, pi = 1/n for all i, then H should be a
monotonically increasing function of n. The more possible outcomes there are,
the more information should be contained in the occurrence of any particular
outcome.
– Suppose we divide the possible outcomes into a number of groups. We indicate
the occurrence of a particular event by first indicating the group it belongs to,
then indicating which particular member of the group it is. Thus, we get some
information first by knowing which group the event belongs to and then we get
additional information by learning which particular event (from the events in
the group) has occurred. The information associated with indicating the
outcome in multiple stages should not be any different than the information
associated with indicating the outcome in a single stage.
• suppose we have an experiment with three
outcomes A1, A2, and A3, with corresponding
probabilities p1, p2, and p3. The average
information associated with this experiment is
simply a function of the probabilities:

04/02/2024 3
Shannon showed that the only way all these conditions
could be satisfied was if

where K is an arbitrary positive constant.

04/02/2024 4
04/02/2024 5
04/02/2024 6
• In other words,
• We can generalize this for the case of n = km as

04/02/2024 7
04/02/2024 8
04/02/2024 9
04/02/2024 10
By convention we
pick K to be 1, and
we have the
formula

04/02/2024 11
Models

04/02/2024 12
Physical Models
• If we know something about the physics of the data generation process,
we can use that information to construct a model.
– For example, if residential electrical meter readings at hourly
intervals were to be coded, knowledge about the living habits of the
populace could be used to determine when electricity usage would
be high and when the usage would be low. Then instead of the
actual readings, the difference (residual) between the actual
readings and those predicted by the model could be coded.
– In general, however, the physics of data generation is simply too
complicated to understand, let alone use to develop a model.

04/02/2024 13
Probability Models
• The simplest statistical model for the source is to assume that each
letter that is generated by the source is independent of every other
letter, and each occurs with the same probability. We could call this
the ignorance model, as it would generally be useful only when we
know nothing about the source.
• The next step up in complexity is to keep the independence
assumption, but remove the equal probability assumption and assign
a probability of occurrence to each letter in the alphabet.
• For a source that generates letters from an alphabet

• Given a probability model (and the independence assumption), we

can compute the entropy of the source
• If the assumption of independence does not fit with our observation
of the data, we can generally find better compression schemes if we
discard this assumption. When we discard
04/02/2024 14
Markov Models
• What is a Markov model?
– A Markov model is a stochastic method for
randomly changing systems that possess the
Markov property. This means that, at any given
time, the next state is only dependent on the
current state and is independent of anything in
the past.
• Two commonly applied types of Markov
model are used when the system being
represented is autonomous -- that is, when
the system isn't influenced by an external
agent.
04/02/2024 15
Markov Models: Types
 Markov chains. These are the simplest type of Markov
model and are used to represent systems where all states
are observable. Markov chains show all possible states,
and between states, they show the transition rate, which
is the probability of moving from one state to another per
unit of time. This is used in Data Compression
 Hidden Markov models. These are used to represent
systems with some unobservable states. In addition to
showing states and transition rates, hidden Markov
models also represent observations and observation
likelihoods for each state.

04/02/2024 16
04/02/2024 17
Markov Chain in Data Compression
• A special Conditional Probability Model is
called a “Markov Model”(MM) which is also
called a “Discrete-Time Markov Chain”

In other words, knowledge of the past k symbols is equivalent to the

knowledge of the entire past history of the process. The values (States)
taken on by the set process.
04/02/2024 18
• If we assumed that the dependence was
introduced in a linear manner, we could view
the data sequence as the output of a linear
filter driven by white noise. The output of
such a filter can be given by the difference
equation

• is a white noise process. This model is often

used when developing coding algorithms for
speech and images.
• The use of the Markov model does not
04/02/2024 19
A two-state Markov model for binary images

• The entropy of a finite state process with

states Si is simply the average value of the
entropy at each state:
04/02/2024 20
Markov Models in Text Compression
• Markov models are particularly useful in text compression, where the
probability of the next letter is heavily influenced by the preceding
letters.
• the kth-order Markov models are more widely known as finite context
models, with the word context being used for what we have earlier
defined as state.
• Consider the word preceding. Suppose we have already processed
precedin and are going to encode the next letter. If we take no account
of the context and treat each letter as a surprise, the probability of the
letter g occurring is relatively low. If we use a first-order Markov model
or single-letter context (that is, we look at the probability model given
n), we can see that the probability of g would increase substantially. As
we increase the context size (go from n to in to din and so on), the
probability of the alphabet becomes more and more skewed, which
results in lower entropy.

04/02/2024 21
Composite Source Model
• In many applications, it is not easy to use a single model to
describe the source.
• A composite source can be viewed as a combination or
composition of several sources, with only one source being
active at any given time.
• A switch selects a source Si with probability Pi

04/02/2024 22

Assignment2 MGSC5125 Fall24
No ratings yet
Assignment2 MGSC5125 Fall24
3 pages
Markov Models Supervised and Unsupervised Machine Learning: Mastering Data Science And Python
From Everand
Markov Models Supervised and Unsupervised Machine Learning: Mastering Data Science And Python
William Sullivan
2/5 (1)
Tahoe Salt
100% (1)
Tahoe Salt
12 pages
Lecture 3-Print
No ratings yet
Lecture 3-Print
22 pages
chap2
No ratings yet
chap2
47 pages
Information Theory 1
No ratings yet
Information Theory 1
31 pages
Information Theory and Coding (Lecture 1) : Dr. Farman Ullah
No ratings yet
Information Theory and Coding (Lecture 1) : Dr. Farman Ullah
32 pages
Lossless Math
No ratings yet
Lossless Math
32 pages
Data Compression Unit-1 - 1
No ratings yet
Data Compression Unit-1 - 1
21 pages
Unit - 2 - Mathematical Preliminaries For Lossless Compression Models
No ratings yet
Unit - 2 - Mathematical Preliminaries For Lossless Compression Models
12 pages
2 BasicInformationTheory
No ratings yet
2 BasicInformationTheory
31 pages
Chapter 2 - Mathematical Preliminaries For Lossless Compression
No ratings yet
Chapter 2 - Mathematical Preliminaries For Lossless Compression
56 pages
3 Information Theory
No ratings yet
3 Information Theory
48 pages
DC M1 Merged
No ratings yet
DC M1 Merged
26 pages
Information Theory and Coding: Universit' A Degli Studi Di Siena Facolt'a Di Ingegneria
No ratings yet
Information Theory and Coding: Universit' A Degli Studi Di Siena Facolt'a Di Ingegneria
156 pages
Information Theory and Coding PDF
No ratings yet
Information Theory and Coding PDF
61 pages
Introduction To Information Theory and Coding
No ratings yet
Introduction To Information Theory and Coding
46 pages
CH 05 - Markov Model
No ratings yet
CH 05 - Markov Model
69 pages
Tanaman Indah Dan Bersih
No ratings yet
Tanaman Indah Dan Bersih
5 pages
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
No ratings yet
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
35 pages
B.E Semester: 6 - IT (GTU) : 2161603 - Data Compression and Data Retrieval
No ratings yet
B.E Semester: 6 - IT (GTU) : 2161603 - Data Compression and Data Retrieval
17 pages
Lec35 - 210108062 - ZAINAB ALI
No ratings yet
Lec35 - 210108062 - ZAINAB ALI
9 pages
Information Theory
No ratings yet
Information Theory
38 pages
Information Coding Techniques
No ratings yet
Information Coding Techniques
42 pages
Lecture 2 28 August, 2015: 2.1 An Example of Data Compression
No ratings yet
Lecture 2 28 August, 2015: 2.1 An Example of Data Compression
7 pages
Unit 4 - DC - 2023-2024
No ratings yet
Unit 4 - DC - 2023-2024
100 pages
Module-1
No ratings yet
Module-1
40 pages
Agenda For The Lecture: C Himanshu Tyagi. Feel Free To Use With Acknowledgement
No ratings yet
Agenda For The Lecture: C Himanshu Tyagi. Feel Free To Use With Acknowledgement
7 pages
1 Information Theory
No ratings yet
1 Information Theory
57 pages
Data Compression: Reference: Proakis Salehi (II Ed.) Cap.4
No ratings yet
Data Compression: Reference: Proakis Salehi (II Ed.) Cap.4
30 pages
Lectures 7 and 8
No ratings yet
Lectures 7 and 8
37 pages
Lecture 2-Print
No ratings yet
Lecture 2-Print
19 pages
Source Coding
No ratings yet
Source Coding
29 pages
21ECE72_Coding and Cryp Module 1
No ratings yet
21ECE72_Coding and Cryp Module 1
34 pages
Lecture 7 Source Coding 2024
No ratings yet
Lecture 7 Source Coding 2024
28 pages
Information T Information Theory and Coding: S.Chandramohan
No ratings yet
Information T Information Theory and Coding: S.Chandramohan
38 pages
Data Compression Basics: Discrete Source
No ratings yet
Data Compression Basics: Discrete Source
34 pages
PMIT-6214: Information Coding: Instructor: M. Shamim Kaiser Email: Text Phone: 01511000555
No ratings yet
PMIT-6214: Information Coding: Instructor: M. Shamim Kaiser Email: Text Phone: 01511000555
76 pages
Sayood DataCompression
No ratings yet
Sayood DataCompression
22 pages
Chapter 2 - Edited
No ratings yet
Chapter 2 - Edited
45 pages
Module 1
No ratings yet
Module 1
29 pages
Information Theory 5th Unit
No ratings yet
Information Theory 5th Unit
20 pages
Itc Mod 1 - 1
No ratings yet
Itc Mod 1 - 1
47 pages
INFORMATION THEORY AND SOURCE CODING
No ratings yet
INFORMATION THEORY AND SOURCE CODING
45 pages
Unit 1
No ratings yet
Unit 1
94 pages
Module 1
No ratings yet
Module 1
23 pages
Intro Lecture Notes
No ratings yet
Intro Lecture Notes
15 pages
ECE 565a: Information Theory Instructor: Prof. Salman Avestimehr
No ratings yet
ECE 565a: Information Theory Instructor: Prof. Salman Avestimehr
49 pages
Information Theory Channel Capacity
No ratings yet
Information Theory Channel Capacity
27 pages
Chapte-2 Information Theory and Coding
No ratings yet
Chapte-2 Information Theory and Coding
68 pages
CSCI 5832 Natural Language Processing: Jim Martin
No ratings yet
CSCI 5832 Natural Language Processing: Jim Martin
47 pages
Lec3 Source Coding Annotated Day4
No ratings yet
Lec3 Source Coding Annotated Day4
75 pages
9.1 Measure of Information - Entropy: Chapter Outline
No ratings yet
9.1 Measure of Information - Entropy: Chapter Outline
81 pages
Lec2 - Data Compression PDF
No ratings yet
Lec2 - Data Compression PDF
9 pages
comp101-lect02
No ratings yet
comp101-lect02
44 pages
TSBK08 Data Compression Exercises: Informationskodning, ISY, Link Opings Universitet, 2013
No ratings yet
TSBK08 Data Compression Exercises: Informationskodning, ISY, Link Opings Universitet, 2013
32 pages
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
No ratings yet
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
31 pages
Introduction To Data Compression - Guy E. Blelloch PDF
No ratings yet
Introduction To Data Compression - Guy E. Blelloch PDF
54 pages
Itc
No ratings yet
Itc
304 pages
Markov Random Field: Exploring the Power of Markov Random Fields in Computer Vision
From Everand
Markov Random Field: Exploring the Power of Markov Random Fields in Computer Vision
Fouad Sabry
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Perceptrons: Fundamentals and Applications for The Neural Building Block
From Everand
Perceptrons: Fundamentals and Applications for The Neural Building Block
Fouad Sabry
No ratings yet
3D Viewing
100% (1)
3D Viewing
3 pages
Lecture 5
No ratings yet
Lecture 5
31 pages
Lecture 4
No ratings yet
Lecture 4
18 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
Xii CH 11 Home Test
No ratings yet
Xii CH 11 Home Test
2 pages
Week 4 Statistics and Probability
80% (10)
Week 4 Statistics and Probability
10 pages
CS 107 Probability, AUA, Spring 2024, Lecture 01
No ratings yet
CS 107 Probability, AUA, Spring 2024, Lecture 01
13 pages
Quiz Ans Key
No ratings yet
Quiz Ans Key
12 pages
Control Charts
No ratings yet
Control Charts
62 pages
A Brief Introduction On Shannon's Information Theory: January 2016
No ratings yet
A Brief Introduction On Shannon's Information Theory: January 2016
10 pages
Econometrics
No ratings yet
Econometrics
9 pages
BACOSTMX Module-2 Lecture Cost-Behavior PDF
No ratings yet
BACOSTMX Module-2 Lecture Cost-Behavior PDF
69 pages
Lesson 5
No ratings yet
Lesson 5
5 pages
Objective Assignment 7: (Https://swayam - Gov.in)
No ratings yet
Objective Assignment 7: (Https://swayam - Gov.in)
4 pages
American Statistical Association
No ratings yet
American Statistical Association
19 pages
Energy Consumption Forecasting Model For Puerto Princesa Distribution System Using Multiple Linear Regression
No ratings yet
Energy Consumption Forecasting Model For Puerto Princesa Distribution System Using Multiple Linear Regression
4 pages
Assignment_3_prob_stat_2024_jan
No ratings yet
Assignment_3_prob_stat_2024_jan
10 pages
Stats Probability Cheat Sheat Exam 2
100% (1)
Stats Probability Cheat Sheat Exam 2
2 pages
Immediate download (Ebook) Stochastics: Introduction to Probability and Statistics by Hans-Otto Georgii ISBN 9783110293609, 3110293609 ebooks 2024
100% (6)
Immediate download (Ebook) Stochastics: Introduction to Probability and Statistics by Hans-Otto Georgii ISBN 9783110293609, 3110293609 ebooks 2024
67 pages
Complete Answer Guide for Introductory Statistics 10th Edition Weiss Test Bank
100% (17)
Complete Answer Guide for Introductory Statistics 10th Edition Weiss Test Bank
66 pages
A Probability and Statistics Companion TQW - Darksiderg
100% (6)
A Probability and Statistics Companion TQW - Darksiderg
273 pages
Practice Set 2
No ratings yet
Practice Set 2
2 pages
B4.1-R3 Syllabus
No ratings yet
B4.1-R3 Syllabus
2 pages
Ujjaval Modi 1
No ratings yet
Ujjaval Modi 1
6 pages
Signed Off Statistics and Probability11 q2 m5 Test of Hypothesis Lesson 2
No ratings yet
Signed Off Statistics and Probability11 q2 m5 Test of Hypothesis Lesson 2
6 pages
Chapter 4 Statistical Inference in Quality Control and Improvement
100% (1)
Chapter 4 Statistical Inference in Quality Control and Improvement
106 pages
Final Assign Harshi
0% (1)
Final Assign Harshi
15 pages
An Introduction To Variational Autoencoders: Foundations and Trends in Machine Learning
No ratings yet
An Introduction To Variational Autoencoders: Foundations and Trends in Machine Learning
89 pages
(Statistics For Biology and Health) Terry M. Therneau, Patricia M. Grambsch (Auth.) - Modeling Survival Data - Extending The Cox Model-Springer-Verlag New York (2000)
No ratings yet
(Statistics For Biology and Health) Terry M. Therneau, Patricia M. Grambsch (Auth.) - Modeling Survival Data - Extending The Cox Model-Springer-Verlag New York (2000)
356 pages
FIN213 - Tutorial 2 - Chapters 3 and 4 - Solutions Memo
No ratings yet
FIN213 - Tutorial 2 - Chapters 3 and 4 - Solutions Memo
4 pages
R All-in-One For Dummies 1st Edition Joseph Schmullerinstant download
100% (2)
R All-in-One For Dummies 1st Edition Joseph Schmullerinstant download
53 pages
First Course in Statistics 11th Edition McClave Solutions Manual download
100% (3)
First Course in Statistics 11th Edition McClave Solutions Manual download
52 pages

Lecture 3

Uploaded by

Lecture 3

Uploaded by

Data Compression Lecture 3

where K is an arbitrary positive constant.

• Given a probability model (and the independence assumption), we

In other words, knowledge of the past k symbols is equivalent to the

• is a white noise process. This model is often

• The entropy of a finite state process with

You might also like