Reliabilty Lecture (5)

The document discusses the concept of reliability in measurement, defining it as consistency and dependability in results. It outlines various types of reliability, including test-retest, parallel-forms, split-half, and internal consistency, while also addressing measurement errors and their impact on reliability. The document emphasizes the importance of understanding different sources of error and the methods to assess reliability effectively.

Uploaded by

Ahmad Gaming

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Reliabilty Lecture (5)

Uploaded by

Ahmad Gaming

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

RELIABILITY

BY: HINA SAMREEN

RELIABILITY
• Synonym for dependability or consistency (Gatewood & Field, 2001).
• Refers to consistency in measurement/ stability of results on a measure (Schwab, 2005).
• “The degree to which the results can be replicated under similar conditions”(Mc Bride,
2010).
• Reliability – not an all-or-none matter.
• A test may be reliable in one context and unreliable in another.
• Reliability Coefficient – an index of reliability, a proportion that indicates the ration
between the true score variance on a test and the total variance.
• Reliability is based on probability with a reliability coefficient ranging from 0 to 1.
• IF we use X to represent an observed score, T to represent a true score,
and E to represent error, then the observed score equals the true score
plus error.
X= T + E
• VARIANCE – any type of change. Variance in score affects the reliability of
the test.
• Variance from true differences is true variance and variance from
irrelevant, random sources is error variance.
• Term reliability refers to the proportion of the total variance attributed to
true variance, the more reliable the test.
• Because true differences are assumed to be stable, presumed to yield
consistent scores on repeated administrations of the same test as well as
on equivalent forms of test.
• Error variance may increase or decrease a test score by varying
amounts – consistency of the test score or reliability may be affected.
• Measurement Error – all of the factors associated with the process of
measuring some variable., other than the variable being measured.
• Example. Consider an English-language test on the subject of 12 th grade algebra being
administered, in English to a sample of 12 th grade students ,newly arrived to the United
States from China. Students in the sample are known to be Whiz-Kids in algebra. Yet all of
the students receive failing grades on the test, for some reasons.
• Do these failures indicate that these students are not whiz kids at
all? Perhaps this group of students did not do well on the algebra test
because they could neither read nor understand what was required
of them.
• In fact, the test administered was written and administered in
English – measurement of error. (Test should have been translated and administered
in the language of the test takers).

Categories of Measurement Error (Two)

Systematic or random error
Random error. Source of error in measuring a targeted variable caused by
unpredictable fluctuations and inconsistencies of other variables in the measurement
process. (Example: noise, unanticipated events happening within the test taker, etc.).
Systematic error. Source of error in measuring a variable that is typically
constant or proportionate to what is presumed to be true value of the variable being
measured. (Example. A measuring instrument itself found t be a source of systematic
error).
Types of Reliability
• Test-Retest Reliability. An estimate of reliability obtained by correlating pairs
of scores from the sample on two different administrations of the same test.
• Purpose of this type of reliability – Participant’s scores are consistent on
multiple administrations of the same test over time.
• Test-retest measure is appropriate when evaluating the reliability of a test
that purports to measure something that is relatively stable over time e.g.,
personality trait.
• If the characteristic being measured is assumed to fluctuate over time – test
retest reliability not suitable.
• As time passes by, people change. E.g., people may learn new things, forget
some tings and acquire new skills.
• The main issue with test-retest reliability is (Sturman et al., 2005)
– Difference in measures between the first and second administrations could impact the
reliability due to the following factors:
1. Time interval between test administrations (Passage of time can be a source of variance – the
longer the time interval between administrations of the same test, the correlation between the scores
obtained on each testing decreases).

2. The test or other factors associated with the participant.

• Test retest reliability appropriate for gauging the reliability of test that employ outcome measures
e.g., reaction time, or perceptual judgements.
• Even when the time period between the two test administrations of test is relatively small –
various factors like experience, fatigue, practice, memory etc. may intervene and confound
reliability.
• Parallel-forms/Equivalent forms of Reliability
• When a researcher creates two different but similar tests that measure the
same construct.
• Alternate forms are simply different versions of a test tat have been
constructed so as to be parallel.
• Idea behind parallel or equivalent forms reliability is to have two
conceptually identical tests that utilize separate questions to measure the
same construct of interest.
• Alternate forms of a test are typically designed to be equivalent with
respect to variables such as content and level of difficulty.
• Alternate forms reliability refers to an estimate of the extent to which
these different forms of the same test have been affected by item
sampling error or other error.
• Obtaining estimates of alternate-forms reliability is similar in two ways
to test-retest reliability:
– Two test administrations with the same group are required
– Test scores may be affected by factors such as motivation, fatigue or intervening
evets like practice, learning, memory or therapy.
Additional source of error variance – item sampling – test takers may do better or
worse on a specific form of the test not as a function of their true ability but
simply because of the particular items that were selected in the test.
• Having multiple items to measure the same construct could be a benefit
for using parallel or equivalent forms reliability to create similar but
different tests. The challenge you face when multiple tests are created
to measure the same construct is that the items on both versions of the
same test may not actually measure the same construct.
Strengths & Limitations
• Developing alternate-forms – time consuming & expensive.
• Test scores may be affected by test take variables.
• Once developed – advantageous for the test user
– Minimizes the effect of memory for the content of previously
administered tests - removes carryover effect.
Split-half Reliability
• Purpose of split half reliability is to divide the test/measure into two halves and test
the internal consistency of the items used.
• An estimate of split-half reliability is obtained by correlating two pairs of scores
obtained from two equivalent halves of a single test administered once.
• Split half reliability is different to the parallel or equivalent forms reliability with a
couple exceptions.
– Parallel or equivalent forms reliability requires two versions of a test.
– With split half, researchers conducts single administration of the test/measure and split the test
in two equal halves (odd-even method).

• Useful measure of reliability when it is impractical to assess reliability with two tests
or to administer a test twice – because of the factors such as time and expense.
• Recommended for homogenous tests.
• One common criticism of this technique is determining where to split the test
because of how the items are divided within the measure.
• There is more than one way to split a test – but there are some ways you should
never split a test.
– Simply dividing the test in the middle is not recommended – there is likely that this procedure
would spuriously raise or lower the reliability coefficient.
• (1). acceptable way to split a test – randomly assign items to one/other half of the
test.
• (2). Other acceptable way – split a test to assign odd-numbered items to now half of
the test & even-numbered items to other half of test. This method referred to as
odd-even reliability.
Alpha Coefficient
• Internal consistency reliability – Coefficient alpha or Cronbach’s alpha.
• Widely reported measure of reliability (Hogan, Benjamin, Brezinski, 2003).
• Similar to split half reliability as it also measures the internal consistency or
correlation between the items on a test.
• Main difference between split half and coefficient alpha – the entire test is used to
estimate the correlation between the items without splitting the test in half.
• Cronbach (1951) outlined that coefficient of alpha of greater than or equal to 0.7 is
generally acceptable.
• Very high Cronbach alpha – indicate redundancy of the items.
Inter-rater reliability
• Also known as inter-observer reliability or inter-judge reliability,
assesses the level of agreement or consistency between multiple raters
or observers when they independently assess the same phenomenon,
event or data. It is used to determine the extent to which different
raters, who may have different perspectives or judgments, provide
similar assessments or insights.
• Measuring the consistency of ratings across different raters.
• High inter-rater reliability indicates that the judgments made by
different raters are in agreement.
Intra-rater reliability
• Also known as intra-observer reliability or test-retest reliability.
• It assesses the consistency of ratings or measurements made by the same rater or
observer on two or more occasions when assessing the same phenomenon or data.
• When a researcher examines the consistency of one particular individual’s rating at
multiple points in time.
• Purpose of intra-rater reliability is to determine the sustainability
of an individual’s ratings at two different points in time.
• It is used to determine whether a single rater’s judgments or measurements are
consistent over time.
• High intra-rater reliability indicates that the rater provides consistent results when
assessing the same thig on different occasions.
Thank you

CISA EXAM-Testing Concept-Knowledge of Compliance & Substantive Testing Aspects
From Everand
CISA EXAM-Testing Concept-Knowledge of Compliance & Substantive Testing Aspects
Hemang Doshi
3/5 (4)
AS Level Physics Revision Notes
100% (1)
AS Level Physics Revision Notes
39 pages
CE 010 Module 1.2-1.3
No ratings yet
CE 010 Module 1.2-1.3
29 pages
RELIABILITY AND VALIDITY
No ratings yet
RELIABILITY AND VALIDITY
47 pages
Language Test Reliability
No ratings yet
Language Test Reliability
20 pages
Reliability and its Types
No ratings yet
Reliability and its Types
13 pages
Reliability
No ratings yet
Reliability
9 pages
CHAPTER 6
No ratings yet
CHAPTER 6
8 pages
Unit 6
No ratings yet
Unit 6
37 pages
Reliability
No ratings yet
Reliability
11 pages
Reliability Test by Group 2
No ratings yet
Reliability Test by Group 2
28 pages
Reliability and Validity
No ratings yet
Reliability and Validity
32 pages
9 Reliability
No ratings yet
9 Reliability
10 pages
4 Reliability Validity
No ratings yet
4 Reliability Validity
47 pages
script-sir Fano
No ratings yet
script-sir Fano
1 page
Reliability by Vartika Verma
No ratings yet
Reliability by Vartika Verma
17 pages
3 - Types of Reliability
No ratings yet
3 - Types of Reliability
36 pages
CLASS PRESENTATION - Test Reliability
No ratings yet
CLASS PRESENTATION - Test Reliability
7 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
38 pages
Unit 9
No ratings yet
Unit 9
27 pages
test constrcution
No ratings yet
test constrcution
39 pages
Reliability
No ratings yet
Reliability
3 pages
RELIABILITY 2024
No ratings yet
RELIABILITY 2024
30 pages
Reliability
No ratings yet
Reliability
113 pages
5 Reliability
No ratings yet
5 Reliability
29 pages
Students_Slides_1_Realibity
No ratings yet
Students_Slides_1_Realibity
59 pages
Chracteristics of A Good Test
No ratings yet
Chracteristics of A Good Test
58 pages
reliability
No ratings yet
reliability
15 pages
Reliability and Its Importance
No ratings yet
Reliability and Its Importance
57 pages
Class 10
No ratings yet
Class 10
54 pages
Evidence of Reliability
No ratings yet
Evidence of Reliability
4 pages
Week 4 - Reliability
No ratings yet
Week 4 - Reliability
8 pages
RELIABILITY
No ratings yet
RELIABILITY
16 pages
RELIABILITY Show - PPSX
No ratings yet
RELIABILITY Show - PPSX
33 pages
Reliability PPT Presentation
100% (1)
Reliability PPT Presentation
9 pages
Paprint
No ratings yet
Paprint
3 pages
RMBS M2 Lecture 5a
No ratings yet
RMBS M2 Lecture 5a
42 pages
TYPESOFRELIABILITY
No ratings yet
TYPESOFRELIABILITY
5 pages
KPD Validity & Realibility
No ratings yet
KPD Validity & Realibility
25 pages
Theory of Reliability
No ratings yet
Theory of Reliability
11 pages
Reliability: Floramae Z. Campos Student/MA-GC
No ratings yet
Reliability: Floramae Z. Campos Student/MA-GC
29 pages
Reliability Estimates: Source of Error Variance Is Test Administration
No ratings yet
Reliability Estimates: Source of Error Variance Is Test Administration
8 pages
Chapter 13 Assessing Quality of Measurement Tools 2
No ratings yet
Chapter 13 Assessing Quality of Measurement Tools 2
57 pages
Reliability
No ratings yet
Reliability
9 pages
Questionnaire Reliability Validity
No ratings yet
Questionnaire Reliability Validity
29 pages
Chapter 4: Reliability
No ratings yet
Chapter 4: Reliability
40 pages
Top 4 Characteristics of A Good Test: Characteristic # 1. Reliability
No ratings yet
Top 4 Characteristics of A Good Test: Characteristic # 1. Reliability
21 pages
Psyc 385 Exam 2 Study Guide
No ratings yet
Psyc 385 Exam 2 Study Guide
17 pages
Reliability Reviewer
No ratings yet
Reliability Reviewer
5 pages
May 2 - Reliability
No ratings yet
May 2 - Reliability
16 pages
What Is Reliability and Its Types
No ratings yet
What Is Reliability and Its Types
6 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
33 pages
Reliability
No ratings yet
Reliability
13 pages
mpc validity and reliability-1
No ratings yet
mpc validity and reliability-1
22 pages
Module 4 Psychometric properties (1)
No ratings yet
Module 4 Psychometric properties (1)
49 pages
Psy 112 Handout 6
No ratings yet
Psy 112 Handout 6
6 pages
Slide 4-Reliability
No ratings yet
Slide 4-Reliability
17 pages
RELIABILITY
No ratings yet
RELIABILITY
5 pages
Handbook of Psychological Assessment Fourth Edition
100% (1)
Handbook of Psychological Assessment Fourth Edition
9 pages
Kyu Edu 2301 WK3
No ratings yet
Kyu Edu 2301 WK3
5 pages
reliability
No ratings yet
reliability
27 pages
Evaluating a Psychometric Test as an Aid to Selection
From Everand
Evaluating a Psychometric Test as an Aid to Selection
Zuzana Robertson C.Psychol
5/5 (1)
Naturess
0% (1)
Naturess
20 pages
CLS ENG 23 24 XI Phy Target 1 Level 1 Chapter 1
No ratings yet
CLS ENG 23 24 XI Phy Target 1 Level 1 Chapter 1
25 pages
1.1 Understanding Physics: Attempt The Following Questions On Your Own What Is Physics?
No ratings yet
1.1 Understanding Physics: Attempt The Following Questions On Your Own What Is Physics?
8 pages
Horizontal Distance Measurement
0% (1)
Horizontal Distance Measurement
24 pages
Topic 5_Part 1_Traverse and WCB
No ratings yet
Topic 5_Part 1_Traverse and WCB
26 pages
2016 Physics Questionbank - Book
No ratings yet
2016 Physics Questionbank - Book
454 pages
Saponification Lab
No ratings yet
Saponification Lab
5 pages
Phys 102 LabBook
No ratings yet
Phys 102 LabBook
147 pages
Systematic Errors (Controllable Errors) - Random Errors.: Measurement Error
No ratings yet
Systematic Errors (Controllable Errors) - Random Errors.: Measurement Error
14 pages
Pharm. Analysis 1 (Quantitative Pharmaceutical Chemistry) : Control of Quality of Analytical Methods
No ratings yet
Pharm. Analysis 1 (Quantitative Pharmaceutical Chemistry) : Control of Quality of Analytical Methods
12 pages
Sample Size Requirements Reliability Studies: K and Number
No ratings yet
Sample Size Requirements Reliability Studies: K and Number
8 pages
Instrumentation and Control Engineering PPT 1
No ratings yet
Instrumentation and Control Engineering PPT 1
69 pages
Errors in Surveying_ Sources, Types and Management
No ratings yet
Errors in Surveying_ Sources, Types and Management
6 pages
Lecture 10 - Quality Control-1
No ratings yet
Lecture 10 - Quality Control-1
31 pages
Uncertainty Analysis For Engineers and Scientists A Practical Guide 1108745741 9781108745741 Compress
No ratings yet
Uncertainty Analysis For Engineers and Scientists A Practical Guide 1108745741 9781108745741 Compress
389 pages
DOE Course - Lecture #1
100% (1)
DOE Course - Lecture #1
27 pages
Ch-01: Measurement - Short Question Answers | PDF
No ratings yet
Ch-01: Measurement - Short Question Answers | PDF
11 pages
Chapter 1: What Is Statistics?: Christopher J. Wild, Jessica M. Utts, and Nicholas J. Horton
No ratings yet
Chapter 1: What Is Statistics?: Christopher J. Wild, Jessica M. Utts, and Nicholas J. Horton
27 pages
Prerna Bansal Maths in Chemistry
No ratings yet
Prerna Bansal Maths in Chemistry
196 pages
PHY 100 - Experimental Physics Lab I
No ratings yet
PHY 100 - Experimental Physics Lab I
5 pages
Chem IA Redox Titration of Iron
89% (9)
Chem IA Redox Titration of Iron
9 pages
18MEO113T - DOE - Unit 1 - AY2023-24 ODD
No ratings yet
18MEO113T - DOE - Unit 1 - AY2023-24 ODD
120 pages
Solutions To Chapter 1 Measurement: (D), Quite Simply So
No ratings yet
Solutions To Chapter 1 Measurement: (D), Quite Simply So
3 pages
GeneralPhysics1 Q1 ChapterTest1
100% (1)
GeneralPhysics1 Q1 ChapterTest1
2 pages
Pre Lab EXP 3 SP015
No ratings yet
Pre Lab EXP 3 SP015
8 pages
Lecture 4 Experimental Design 18.9.18ND
No ratings yet
Lecture 4 Experimental Design 18.9.18ND
16 pages
IB Biology Lab Report Guide
100% (1)
IB Biology Lab Report Guide
16 pages

Reliabilty Lecture (5)

Uploaded by

Reliabilty Lecture (5)

Uploaded by

RELIABILITY

BY: HINA SAMREEN

Categories of Measurement Error (Two)

2. The test or other factors associated with the participant.

You might also like