Matching Method (PSM) - Mbarara. Toko

The document discusses matching methods, particularly propensity score matching (PSM), which is used to create a comparison group for evaluating program impacts when random assignment is not possible. It emphasizes the importance of observed characteristics in matching and highlights limitations such as the assumption of no unobserved differences that could bias results. Additionally, it outlines the steps involved in PSM and the challenges faced when trying to find suitable matches for treatment units.

Uploaded by

LawrieOkujja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views28 pages

Matching Method (PSM) - Mbarara. Toko

Uploaded by

LawrieOkujja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

Matching method &

(PSM)
DME WDK KAMPALA
Jimmy Toko
[email protected]
0701422116/0782399762
Intro
• Matching methods can be applied in the context of almost any
program assignment rules, so long as a group exists that has not
participated in the program.
• Matching methods typically rely on observed characteristics to
construct a comparison group, and so the methods require the
strong assumption of no unobserved differences in the
treatment and comparison populations that are also associated
with the outcomes of interest
Matching Cont.….
• Because of that strong assumption, matching methods are
typically most useful in combination with one of the other
methodologies such as the DiD.
• Matching uses statistical techniques to construct an artificial
comparison group by identifying for every possible observation
under treatment a non-treatment observation (or set of non-
treatment observations) that has the most similar characteristics
possible.
Key Concept:

• Matching uses large data sets and heavy statistical techniques

to construct the best possible artificial comparison group for a
given treatment group.
I.e.
Consider a case in which you are attempting to evaluate the impact of a
program and have a data set that contains both households that enrolled in
the program and households that did not enrol, for example, the
Demographic and Health Survey.
• The program that you are trying to evaluate does not have any clear
assignment rules (such as randomized assignment or an eligibility index)
that explain why some households enrolled in the program and others did
not.
• In such a context, matching methods will enable you to identify the set of
non enrolled households that look most similar to the treatment
households, based on the characteristics that you have available in your
data set.
• These “matched” none enrolled households then become the
comparison group that you use to estimate the counterfactual.
• Finding a good match for each program participant requires
approximating as closely as possible the variables or
determinants that explain that individual’s decision to enrol in
the program. Unfortunately, this is easier said than done.
• If the list of relevant observed characteristics is very large, or if
each characteristic takes on many values, it may be hard to
identify a match for each of the units in the treatment group.
• As you increase the number of characteristics or dimensions
against which you want to match units that enrolled in the
program, you may run into what is called “the curse of
dimensionality.”
• E.g. if you use only three important characteristics to identify
the matched comparison group, such as age, gender, and region
of birth, you will probably find matches for all program
enrolees in the pool of none enrolees, but you run the risk of
leaving out other potentially important characteristics.
• The figure below illustrates matching based on four
characteristics:
• age,
• gender,
• months unemployed, and
• secondary school diploma.
• However, if you increase the list of variables, say, to include
number of children, number of years of education, age of the
mother, age of the father, and so forth, your database may not
contain a good match for most of the program enrolees, unless
it contains a very large number of observations.
• the curse of dimensionality can be quite easily solved using a method
called “propensity score matching” (Rosenbaum and Rubin 1983).
• In this approach, we no longer need to try to match each enrolled unit
to a non enrolled unit that has exactly the same value for all observed
control characteristics.
• Instead, for each unit in the treatment group and in the pool of non
enrolees we compute the probability that a unit will enrol in the
program based on the observed values of its characteristics, the so-
called propensity score.
• Once the propensity score has been computed for all units, then units
in the treatment group can be matched with units in the pool of non
enrolees that have the closest propensity score.
• These “closest units” become the comparison group and are used to
produce an estimate of the counterfactual.
• The propensity score matching method tries to mimic the
randomized assignment to treatment and comparison groups by
choosing for the comparison group those units that have similar
propensities to the units in the treatment group.
• This score is a single number ranging from 0 to 1 that summarizes all
of the observed characteristics of the units as they influence the
likelihood of enrolling in the program.
• Since propensity score matching is not a real randomized
assignment method, but tries to imitate one, it belongs to the
category of quasi-experimental methods.
• The difference in outcomes (Y) between the treatment or
enrolled units and their matched comparison units produces the
estimated impact of the program.
• In summary, the program’s impact is estimated by comparing
the average outcomes of a treatment or enrolled group and the
average outcome among a statistically matched subgroup of
units, the match being based on observed characteristics
available in the data at hand.
• For propensity score matching to produce externally valid
estimates of a program’s impact, all treatment or enrolled units
need to be successfully matched to a none enrolled unit.
• It may happen that for some enrolled units, no units in the pool
of non-enrolees have similar propensity scores. In technical
terms, there may be a “lack of common support,” or lack of
overlap, between the propensity scores of the treatment or
enrolled group and those of the pool of non enrolees.
The figure shows the distribution of propensity scores separately for enrolees and non
enrolees.
• Crucially, these distributions do not overlap perfectly. In the
middle of the distribution, matches are relatively easy to find
because enrolees and non enrolees have similar characteristics.
• However, units with predicted propensity scores close to 1
cannot be matched to any non enrolees with similar propensity
scores.
• Intuitively, units who are highly likely to enrol in the program
are so dissimilar to non enrolling units that we cannot find a
good match for them.
• A lack of common support thus appears at the extremes, or
tails, of the distribution of propensity scores.
Matched Comparison Design PSM
The logic behind matching :

Matching finds for each observation a nearly identical observation in the control group
based on observable characteristics
The project impact is the average of the differences in outcomes between matched
pairs of observations
Intervention Group Outcome
(Cases) Intervention = O1
Matched (PSM)
Non Intervention Outcome
Group (Controls) No Intervention
= O2

Effect Size = O1 - O2
Note: Counterfactual is O3
Steps in PSM
• Jalan and Ravallion (2003a) summarize the steps to be taken
when applying propensity score matching.
• 1. You will need representative and highly comparable surveys to
identify the units that enrolled in the program and those that did
not
• 2. You must pool the two samples and estimate the probability
that each individual enrols in the program, based on individual
characteristics observed in the survey. This step yields the
propensity score.
• 3. you restrict the sample to units for which common support
appears in the propensity score distribution.
Steps Cont.…..
• 4. for each enrolled unit, you locate a subgroup of none rolled
units that have similar propensity scores.
• 5. you compare the outcomes for the treatment or enrolled units
and their matched comparison or none rolled units. The
difference in average outcomes for these two subgroups is the
measure of the impact that can be attributed to the program for
that particular treated observation.
• 6. the mean of these individual impacts yields the estimated
average treatment effect.
• Overall, it is important to remember two crucial issues about
matching.
• First, matching must be done using baseline characteristics.
• Second, the matching method is only as good as the
characteristics that are used for matching, so that having a large
number of background characteristics is crucial.
Limitations of the Matching Method

• Although matching procedures can be applied in many settings,

regardless of a program’s assignment rules, they have several
serious shortcomings.
• First, they require extensive data sets on large samples of units,
and even when those are available, a lack of common support
between the treatment or enrolled group and the pool of
nonparticipants may appear.
• Second, matching can only be performed based on observed
characteristics; by definition, we cannot incorporate
unobserved characteristics in the calculation of the propensity
score.
• So for the matching procedure to identify a valid comparison
group, we must be sure that no systematic differences in
unobserved characteristics between the treatment units and the
matched comparison units exist that could influence the
outcome (Y).
• Since we cannot prove that no such unobserved characteristics
that affect both participation and outcomes exist, we have to
assume that none exist.
• This is usually a very strong assumption. Although matching
helps to control for observed background characteristics, we
can never rule out bias that stems from unobserved
characteristics.
• In summary, the assumption that no selection bias has occurred
stemming from unobserved characteristics is very strong, and
most problematic, it cannot be tested.
• Matching is generally less robust than the other
evaluation methods.
• For instance, randomized selection methods do not
require the untestable assumption that there are no
unobserved variables that explain both participation in
the program and outcomes.
• They also do not require such large samples or as
extensive background characteristics as propensity score
matching.
• In practice, matching methods are typically used when
randomized selection, regression discontinuity design, and
difference-in-differences options are not possible.
• Many authors use so-called ex-post matching when no baseline
data are available on the outcome of interest or on background
characteristics.
• They use a survey that was collected after the start of the
program (that is, ex-post) to infer what people’s background
characteristics were at baseline (for example, age, marital
status), and then match the treated group to a comparison
group using those inferred characteristics.
• Generally we note that impact evaluations are best
designed before a program begins to be
implemented.
• Once the program has started, if one has no way to
influence how it is allocated and no baseline data
have been collected, very few, or no, solid options
for the evaluation will be available.
Discussion
• Consider a program that provides loans to poor farmers, so that they can buy
fertilizer to increase their maize production.
• In the year before the program started, the farmers who later enrolled in the
program harvested an average of 1,000 kg of maize per hectare (ha). One
year after the program started, their maize yields increased to 1200 kg/ha.
This increase over time is the before-and-after estimator of program impact:
200 (=1,200 – 1,000) kg/ha.
• Before enrolment
• After enrolment
• Before-and-After Estimator of Impact
• 1,000 kg/ha
• 1,200 kg/ha
• 200 (=1,200 – 1,000) kg/ha
• One year after the program started, we observe that farmers who enrolled in
the program harvested an average of 1,100 kg of maize per ha, while
farmers who did not enroll harvested an average of 1,000 kg/ha. The cross-
section (enrolled-and-no enrolled) estimator of a program’s impact is the
difference in the yields of these two groups:
• 100 (=1,100 – 1,000) kg/ha.
• Farmers Before
• After
• Cross-section (enrolled and non-enrolled) Estimator of Program
Impact
• Enrolled
• No measurement taken
• 1,100 kg/ha
• 100 (=1,100-1,000) kg/ha
• Questions for discussion:
• Is this a plausible estimate of the program impact?
• If not, is it likely to underestimate of overestimate the true
impact? why
• To see the problem with the cross-sectional estimator, consider
the following two scenarios:

Challenges That Face Entrepreneurships in Tanzania
No ratings yet
Challenges That Face Entrepreneurships in Tanzania
6 pages
Propensity Score Matching
No ratings yet
Propensity Score Matching
14 pages
Characteristics of Profesional Ethics
No ratings yet
Characteristics of Profesional Ethics
17 pages
138 Modeling Stochastic Wind - Loads - On Vertical - Axis Wind Turbines VEERS SANDIA
No ratings yet
138 Modeling Stochastic Wind - Loads - On Vertical - Axis Wind Turbines VEERS SANDIA
20 pages
17 Techniques On Impact Evaluation Propensity Score Matching
100% (1)
17 Techniques On Impact Evaluation Propensity Score Matching
29 pages
Introduction To Impact Evaluation
No ratings yet
Introduction To Impact Evaluation
73 pages
Propensity Score Models Slides
No ratings yet
Propensity Score Models Slides
22 pages
SSRN 286297
No ratings yet
SSRN 286297
49 pages
Matching Methods
No ratings yet
Matching Methods
103 pages
Assignment Cover: Primrose Student. ZQMS-ARC-REC-002
No ratings yet
Assignment Cover: Primrose Student. ZQMS-ARC-REC-002
12 pages
Propensity Score Matching
100% (1)
Propensity Score Matching
41 pages
Propensity Score Matching With SPSS
No ratings yet
Propensity Score Matching With SPSS
30 pages
EH426 AT3 2024 Matching
No ratings yet
EH426 AT3 2024 Matching
31 pages
Propensity Score Modelling
100% (2)
Propensity Score Modelling
59 pages
DPU4E HdwGuide A5
No ratings yet
DPU4E HdwGuide A5
58 pages
Does Matching Overcome LaLondes Critique
No ratings yet
Does Matching Overcome LaLondes Critique
49 pages
Implementing Propensity Score Matching Estimators With STATA
100% (1)
Implementing Propensity Score Matching Estimators With STATA
15 pages
EXPERIMENTAL RESEARCH AND DESIGN MATCHING ITEMSWPS Office - 040446
No ratings yet
EXPERIMENTAL RESEARCH AND DESIGN MATCHING ITEMSWPS Office - 040446
21 pages
Some Practical Guidance For The Implementation of Propensity Score Matching
No ratings yet
Some Practical Guidance For The Implementation of Propensity Score Matching
33 pages
Regression Discontinuity Design
No ratings yet
Regression Discontinuity Design
29 pages
Chapter Six - Project Impact Evaluation
No ratings yet
Chapter Six - Project Impact Evaluation
28 pages
Propensity Score Matching Methods For The
No ratings yet
Propensity Score Matching Methods For The
104 pages
Cc-Me cs2 PSM
No ratings yet
Cc-Me cs2 PSM
40 pages
Reporting & Disseminating Eval Results - Presentation Stream 1-1
No ratings yet
Reporting & Disseminating Eval Results - Presentation Stream 1-1
20 pages
Dayo Asubiojo - Resume - Dec 26
No ratings yet
Dayo Asubiojo - Resume - Dec 26
4 pages
Smith and Todd 2005 PDF
No ratings yet
Smith and Todd 2005 PDF
49 pages
Matching Estimator
No ratings yet
Matching Estimator
38 pages
Strategy Matching
No ratings yet
Strategy Matching
3 pages
Rebuttal of Colin Leslie Dean's Critique of Kurt Godel
100% (1)
Rebuttal of Colin Leslie Dean's Critique of Kurt Godel
4 pages
Data Collection, Analysis & Interpretations-1
No ratings yet
Data Collection, Analysis & Interpretations-1
34 pages
Week 1 - Firat and Venkatesh (1995)
No ratings yet
Week 1 - Firat and Venkatesh (1995)
30 pages
Scoring Match
No ratings yet
Scoring Match
32 pages
Upsc Cse
No ratings yet
Upsc Cse
18 pages
Germany17 Jann
No ratings yet
Germany17 Jann
84 pages
Annual Report 2023-24 Draft1 Print
No ratings yet
Annual Report 2023-24 Draft1 Print
38 pages
Randamised Control Trials (RCT) Rose
No ratings yet
Randamised Control Trials (RCT) Rose
23 pages
2 Matched Group Designs
No ratings yet
2 Matched Group Designs
14 pages
Degroot
No ratings yet
Degroot
23 pages
Wacholder III
No ratings yet
Wacholder III
9 pages
Jama Caricchio 2021 Oi 210064 1626283669.23567
No ratings yet
Jama Caricchio 2021 Oi 210064 1626283669.23567
10 pages
Introduction To Propensity Score Analysis
No ratings yet
Introduction To Propensity Score Analysis
41 pages
Chapter 11 Quasiexperimental Designs
No ratings yet
Chapter 11 Quasiexperimental Designs
22 pages
744845889-Murvin-Krak-1 2
No ratings yet
744845889-Murvin-Krak-1 2
1 page
Introduction To Impact & PSM
No ratings yet
Introduction To Impact & PSM
42 pages
Research Methods 3
No ratings yet
Research Methods 3
20 pages
Prop Scores
No ratings yet
Prop Scores
77 pages
Propensity Score Matching With Clustered Data in Stata: Bruno Arpino
No ratings yet
Propensity Score Matching With Clustered Data in Stata: Bruno Arpino
37 pages
Lanners 23 A
No ratings yet
Lanners 23 A
11 pages
Preparing To Take The Solid Edge Certification Exam: Siemens PLM Software
No ratings yet
Preparing To Take The Solid Edge Certification Exam: Siemens PLM Software
1 page
Evaluation Method - 2023 - Class
No ratings yet
Evaluation Method - 2023 - Class
21 pages
Department Presentation From BMTU New
No ratings yet
Department Presentation From BMTU New
18 pages
Caliendo Kopeinig JESurveys 2008
No ratings yet
Caliendo Kopeinig JESurveys 2008
42 pages
Matching and The Propensity Score Handout
No ratings yet
Matching and The Propensity Score Handout
23 pages
Sertel T SL 300 100 6D Catalogue
No ratings yet
Sertel T SL 300 100 6D Catalogue
2 pages
NIH Public Access: Author Manuscript
No ratings yet
NIH Public Access: Author Manuscript
29 pages
Career Objective:: Hyderabad From July 2011 To Till Date
No ratings yet
Career Objective:: Hyderabad From July 2011 To Till Date
4 pages
Cem: Coarsened Exact Matching in Stata: 9, Number 4, Pp. 524-546
No ratings yet
Cem: Coarsened Exact Matching in Stata: 9, Number 4, Pp. 524-546
23 pages
Note JR I
No ratings yet
Note JR I
6 pages
PSM1
No ratings yet
PSM1
39 pages
Elsa NG Resume sp15
No ratings yet
Elsa NG Resume sp15
1 page
PSM Article
No ratings yet
PSM Article
7 pages
National Museum of Rwanda
No ratings yet
National Museum of Rwanda
4 pages
Course Outline CSCD 607 Advanced Computer Networks
No ratings yet
Course Outline CSCD 607 Advanced Computer Networks
9 pages
PSMatching
No ratings yet
PSMatching
55 pages
Impact Evaluation Methods Table
No ratings yet
Impact Evaluation Methods Table
2 pages
Propensity Score Matching: A Primer For Educational Researchers
No ratings yet
Propensity Score Matching: A Primer For Educational Researchers
59 pages
J. Lecture 6 - Quasi-Experimental Methods
No ratings yet
J. Lecture 6 - Quasi-Experimental Methods
48 pages
Book 13 Apr 2024
No ratings yet
Book 13 Apr 2024
15 pages
How To Conduct Propensity Score Matching - An Introduction
No ratings yet
How To Conduct Propensity Score Matching - An Introduction
10 pages
Brief 8 Quasi-Experimental Design Eng
No ratings yet
Brief 8 Quasi-Experimental Design Eng
16 pages
PSM
No ratings yet
PSM
21 pages
Misosa - Animals With Backbones - The Vertebrates
100% (2)
Misosa - Animals With Backbones - The Vertebrates
12 pages
Evergreen State - Music Cultures of The World (1993-1994) Sean Williams
No ratings yet
Evergreen State - Music Cultures of The World (1993-1994) Sean Williams
4 pages
PSM Article
No ratings yet
PSM Article
7 pages
Matching Methods
No ratings yet
Matching Methods
9 pages
A Brief Guide To Decisions at Each Step of The Propensity Score M
No ratings yet
A Brief Guide To Decisions at Each Step of The Propensity Score M
12 pages
FMEP Interactive Handbook Gold
0% (1)
FMEP Interactive Handbook Gold
5 pages
Batch Numbering System QA - 004
100% (8)
Batch Numbering System QA - 004
5 pages
A Tutorial and Case Study in Propensity Score Analysis-An Application To Education Research
No ratings yet
A Tutorial and Case Study in Propensity Score Analysis-An Application To Education Research
13 pages
OLS and Matching
No ratings yet
OLS and Matching
20 pages
Lesson Plan For Energy Skate Park1
No ratings yet
Lesson Plan For Energy Skate Park1
2 pages
Propensity Score Matching - Dimewiki - 100606
No ratings yet
Propensity Score Matching - Dimewiki - 100606
2 pages
Evaluation Designs: Experimental (Randomized)
No ratings yet
Evaluation Designs: Experimental (Randomized)
4 pages
Ncu A2 Extra Tasks Easier U9
No ratings yet
Ncu A2 Extra Tasks Easier U9
2 pages
ADM Natural Vs Synthetic Vitamin E
No ratings yet
ADM Natural Vs Synthetic Vitamin E
2 pages
Document
No ratings yet
Document
5 pages
Cheat Sheet PSM
No ratings yet
Cheat Sheet PSM
3 pages
Propensity Scores: A Practical Introduction Using R
No ratings yet
Propensity Scores: A Practical Introduction Using R
21 pages
IBPS Clerk Previous Year Question Paper 2018: Quantitative Aptitude (Questions & Solutions)
No ratings yet
IBPS Clerk Previous Year Question Paper 2018: Quantitative Aptitude (Questions & Solutions)
18 pages
India's Grand Strategy
No ratings yet
India's Grand Strategy
14 pages
Flexible & Adaptable
No ratings yet
Flexible & Adaptable
5 pages
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Glossary of Research Methods
From Everand
Glossary of Research Methods
Dr. Awadhesh Kishore
No ratings yet
Glossary of Research Methodology
From Everand
Glossary of Research Methodology
Dr. Awadhesh Kishore
No ratings yet