0% found this document useful (0 votes)

22 views44 pages

Relibility Testing

The document discusses the concepts of validity and reliability in measurement instruments, defining validity as the ability to measure intended constructs and reliability as the consistency of results. It outlines methods for establishing reliability, including test-retest reliability, internal consistency, and various statistical techniques such as Cronbach’s Alpha and the Kuder-Richardson test. Additionally, it highlights factors affecting survey data reliability and provides examples of measuring customer satisfaction through surveys.

Uploaded by

pascuasitti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views44 pages

Relibility Testing

Uploaded by

pascuasitti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

VALIDITY AND

RELIABILITY
 Validity
- is the ability of an instrument to
measure what intends to measure.

 Reliability
– refers to the consistency of
results.
METHODS IN ESTABLISHING
RELIABILITY

a. Test-retest or Stability test

 measures test consistency
 the same test is given to the
respondents twice.
 In other words, you give the same
test twice to the same people at
different times to see if the scores
are the same.
For example, test on a Monday,
then again the following Monday.
The two scores are then correlated.
Test-Retest Reliability Coefficients
It is also called coefficients of stability, it vary
between 0 and 1, where:

 1 : perfect reliability,
 ≥ 0.9 : excellent reliability,
 ≥ 0.8 < 0.9 : good reliability,
 ≥ 0.7 < 0.8 : acceptable reliability,
 ≥ 0.6 < 0.7 : questionable reliability,
 ≥ 0.5 < 0.6 : poor reliability,
 < 0.5 : unacceptable reliability,
 0 : no reliability.
For measuring reliability for two tests,
use the Pearson Correlation
Coefficient
What is Pearson Correlation?

 Correlation between sets of data is a

measure of how well they are related.

 The most common measure of

correlation in stats is the Pearson
Correlation.

 The full name is the Pearson Product

Moment Correlation (PPMC).
GLUCOSE
SUBJECT AGE (x) LEVEL (y) xy x2 y2
1 43 99 4257 1849 9801
2 21 65 1365 441 4225
3 25 79 1975 625 6241
4 42 75 3150 1764 5625
5 57 87 4959 3249 7569
6 59 81 4779 3481 6561
Σ 247 486 20485 11409 40022
Step 6: Use the following correlation
coefficient formula.
Step 6: Use the following correlation
coefficient formula.
Test-Retest Reliability Coefficients
It is also called coefficients of stability, it vary
between 0 and 1, where:

2 21 65

3 25 79

4 42 75

5 57 87

6 59 81

Σ
The reliability of survey data may depend on the
following factors:

 Respondents may not feel encouraged to

provide accurate, honest answers

 Respondents may not feel comfortable

providing answers that present themselves in a
unfavorable manner.

 Respondents may not be fully aware of their

reasons for any given answer because of lack
of memory on the subject, or even boredom.
b. Internal Consistency
 the degree of interrelationship or
homogeneity among the items on a
test, such that they are consistent
with one another and measuring the
same thing.
Example:

You want to find out how satisfied your

customers are with the level of customer
service they receive at your call center.

You send out a survey with three

questions designed to measure overall
satisfaction.
Choices for each question are:
 Strongly agree
 Agree
 Neutral
 Disagree
 Strongly disagree

1. I was satisfied with my experience.

2. I will probably recommend your
company to others.
3. If I write an online review, it would be
positive.
If the survey has good internal
consistency, respondents should
answer the same for each question,
i.e. three “agrees” or three “strongly
disagrees.”
If different answers are given, this is
a sign that your questions are poorly
worded and are not reliably measuring
customer satisfaction.
 There are three main techniques for
measuring the
internal consistency reliability,
depending upon the degree,
complexity and scope of the test.
1. Cronbach’s Alpha Test
2. Split half Test
3. Kuder-Richardson Test
Cronbach’s Alpha Test

 most commonly used when you want to

assess the internal consistency of a
questionnaire (or survey) that is made
up of multiple Likert-type scales and
items.
Cronbach’s Alpha Formula

Where:
k = # of items
s 2i = the sum of the variances of each item
2
S y = the variance of the Total column
Cronbach’s Alpha turns out to be 0.773.

The following table describes how different

values of Cronbach’s Alpha are usually
interpreted:
Kuder-Richardson test

- Kuder-Richardson Formula 20, or KR-20,

is a measure reliability for a test
with binary variables (i.e. answers that
are right or wrong). It should only be
used if there is a correct answer for each
question.

– used for dichotomous items

The scores for KR-20 range from 0 to 1,
where 0 is no reliability and 1 is perfect
reliability. The closer the score is to 1, the
more reliable the test.
k = sample size for the test
s 2 = variance for the test
p = proportion of people passing the item
q = proportion of people failing the item.
Σ = sum up (add up)
Split Half

 Assesses the internal consistency of

the test. It measure the extent to which
all parts of the test contribute equally
to what is being measured.
 It is commonly use for multiple choice
test.
 Is a test for a single knowledge area is
split into two parts and then both part
given to one group of students at the
same time.
Since the reliability coefficient holds only
half of the test the reliability for a whole test
is calculated using

Half test is calculated using

A coefficient of 0 means no reliability
and 1.0 means perfect reliability
means perfect reliability. Generally if
the reliability is above 0.80 it is said to
have a good reliability if below 0.50 it
would not be considered a very
reliable test.
Test-retest
Measuring a property that you
expect to stay the same over time.

Internal consistency
Using a multi-item test where all
the items are intended to measure
the same variable.
You devise a questionnaire to
measure the IQ of a group of
participants.
You devise a questionnaire to
measure the IQ of a group of
participants.

- Test-retest
A test of color blindness for trainee
pilot applicants should have high
test-retest reliability, because color
blindness is a trait that does not
change over time.
Test-retest
Measuring a property that you
expect to stay the same over time.

Internal consistency
Using a multi-item test where all
the items are intended to measure
the same variable.
Example:

You want to find out how satisfied your

customers are with the level of customer
service they receive at your call center.

You send out a survey with three

questions designed to measure overall
satisfaction.
Example:

You want to find out how satisfied your

customers are with the level of customer
service they receive at your call center.

You send out a survey with three

questions designed to measure overall
satisfaction.

- Internal Consistency
Choices for each question are:
 Strongly agree
 Agree
 Neutral
 Disagree
 Strongly disagree

1. I was satisfied with my experience.

2. I will probably recommend your
company to others.
3. If I write an online review, it would be
positive.

Measurement Tool Quality Guide
No ratings yet
Measurement Tool Quality Guide
57 pages
Reliability
No ratings yet
Reliability
9 pages
Reliability
No ratings yet
Reliability
11 pages
Psychometric Properties Reliability Full
No ratings yet
Psychometric Properties Reliability Full
4 pages
MS Final Lecture Reliability
No ratings yet
MS Final Lecture Reliability
80 pages
Reliability and Validity
No ratings yet
Reliability and Validity
47 pages
Introduction To Reliability: What Is Reliability? Why Is It Important?
No ratings yet
Introduction To Reliability: What Is Reliability? Why Is It Important?
14 pages
Reliability and Validity
No ratings yet
Reliability and Validity
32 pages
Psych Stats Semi
No ratings yet
Psych Stats Semi
11 pages
Reliability & Pilot Testing
No ratings yet
Reliability & Pilot Testing
2 pages
Questionnaire Reliability and Validity
No ratings yet
Questionnaire Reliability and Validity
29 pages
Language Test Reliability
No ratings yet
Language Test Reliability
20 pages
Reliability: Floramae Z. Campos Student/MA-GC
No ratings yet
Reliability: Floramae Z. Campos Student/MA-GC
29 pages
5 Reliability
No ratings yet
5 Reliability
29 pages
Students Slides 1 Realibity
No ratings yet
Students Slides 1 Realibity
59 pages
Chracteristics of A Good Test
No ratings yet
Chracteristics of A Good Test
58 pages
Realibility and Coefficient of Reliability
No ratings yet
Realibility and Coefficient of Reliability
4 pages
Reliability and Its Types
No ratings yet
Reliability and Its Types
13 pages
Slide 4-Reliability
No ratings yet
Slide 4-Reliability
17 pages
Validity and Reliability in Questionnaires
No ratings yet
Validity and Reliability in Questionnaires
24 pages
Reliability
No ratings yet
Reliability
10 pages
3 - Types of Reliability
No ratings yet
3 - Types of Reliability
36 pages
Reliability and Validity
No ratings yet
Reliability and Validity
18 pages
Key Characteristics of Research Tools
No ratings yet
Key Characteristics of Research Tools
3 pages
Understanding Test Score Reliability
No ratings yet
Understanding Test Score Reliability
20 pages
Reliability 2024
No ratings yet
Reliability 2024
30 pages
Reliability Reviewer
No ratings yet
Reliability Reviewer
5 pages
Strructures
No ratings yet
Strructures
28 pages
Unit 9
No ratings yet
Unit 9
27 pages
Deepika RM Seminar
No ratings yet
Deepika RM Seminar
23 pages
W2 - Reliability in ESL Research
No ratings yet
W2 - Reliability in ESL Research
27 pages
Chapter 6
No ratings yet
Chapter 6
8 pages
Cia Psycho
No ratings yet
Cia Psycho
6 pages
Validity and Reliability - Removed 1
No ratings yet
Validity and Reliability - Removed 1
29 pages
MPC Validity and Reliability-1
No ratings yet
MPC Validity and Reliability-1
22 pages
Chapter 4 Notes
100% (1)
Chapter 4 Notes
3 pages
Research Methods: Reliability & Validity
No ratings yet
Research Methods: Reliability & Validity
23 pages
Chapter 6edited
No ratings yet
Chapter 6edited
15 pages
Understanding Reliability in Measurement
No ratings yet
Understanding Reliability in Measurement
16 pages
Lesson 6 1
No ratings yet
Lesson 6 1
16 pages
Understanding Measurement Reliability
No ratings yet
Understanding Measurement Reliability
17 pages
Measurement Instrument Reliability Guide
No ratings yet
Measurement Instrument Reliability Guide
268 pages
Reliability Main
No ratings yet
Reliability Main
19 pages
Test Validity and Reliability Methods
No ratings yet
Test Validity and Reliability Methods
75 pages
Module 4 Psychometric Properties
No ratings yet
Module 4 Psychometric Properties
49 pages
Reliability
No ratings yet
Reliability
2 pages
Validity and Reliability: I Qra Development Academy Reporter: Nur - Salam Sultan SEPT. 21, 2019
No ratings yet
Validity and Reliability: I Qra Development Academy Reporter: Nur - Salam Sultan SEPT. 21, 2019
22 pages
Reliability
No ratings yet
Reliability
27 pages
Slide 5 - Reliability Validity
No ratings yet
Slide 5 - Reliability Validity
31 pages
Understanding Internal Consistency
100% (2)
Understanding Internal Consistency
18 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
38 pages
Reliability Estimates: Source of Error Variance Is Test Administration
No ratings yet
Reliability Estimates: Source of Error Variance Is Test Administration
8 pages
Psy 112 Handout 6
No ratings yet
Psy 112 Handout 6
6 pages
UNIT-5 Psychometry - 240505 - 1652001
No ratings yet
UNIT-5 Psychometry - 240505 - 1652001
20 pages
Validity and Reliability in Test Development
No ratings yet
Validity and Reliability in Test Development
20 pages
Psychometric Tests
No ratings yet
Psychometric Tests
28 pages
Reliability and Validity of Measurement: Learning Objectives
No ratings yet
Reliability and Validity of Measurement: Learning Objectives
8 pages
Psychometric Properties
No ratings yet
Psychometric Properties
3 pages
TAM Limitations in Technology Research
No ratings yet
TAM Limitations in Technology Research
14 pages
Explainable Attention Based Breast Tumor Segmentation Using A Combination of Unet, Resnet
No ratings yet
Explainable Attention Based Breast Tumor Segmentation Using A Combination of Unet, Resnet
39 pages
Electromagnetic Induction: A Case Study
No ratings yet
Electromagnetic Induction: A Case Study
17 pages
Data Analysis - Tushar06 - Resume
No ratings yet
Data Analysis - Tushar06 - Resume
3 pages
Risk-Based Guideline For Post-Marketing Quality Surveillance of 2020
No ratings yet
Risk-Based Guideline For Post-Marketing Quality Surveillance of 2020
33 pages
Kenya - MoE-Kenya Cooking Sector Study - 2019
No ratings yet
Kenya - MoE-Kenya Cooking Sector Study - 2019
176 pages
CNNs for Stock Price Prediction
No ratings yet
CNNs for Stock Price Prediction
6 pages
Dhyan Singh Arya
No ratings yet
Dhyan Singh Arya
15 pages
Consumer Awareness Survey Questionnaire
No ratings yet
Consumer Awareness Survey Questionnaire
4 pages
Soumen Shekhar Das: Mobile No: +91 8121478480 Career Summary
No ratings yet
Soumen Shekhar Das: Mobile No: +91 8121478480 Career Summary
2 pages
PHD Coursework Question Papers
100% (2)
PHD Coursework Question Papers
6 pages
Budgeting for Organizational Leaders
No ratings yet
Budgeting for Organizational Leaders
28 pages
IntroWorksheet
100% (3)
IntroWorksheet
17 pages
2023-Large Models For Time Series and Spatio-Temporal Data - A Survey and Outlook
No ratings yet
2023-Large Models For Time Series and Spatio-Temporal Data - A Survey and Outlook
25 pages
The Power of Price Action Reading
No ratings yet
The Power of Price Action Reading
30 pages
Sports Betting Strategies IMAJMM Review
No ratings yet
Sports Betting Strategies IMAJMM Review
25 pages
The Impact of Financial Education For Youth: Verónica Frisancho
No ratings yet
The Impact of Financial Education For Youth: Verónica Frisancho
25 pages
Statistics & Probability FIDP Plan
No ratings yet
Statistics & Probability FIDP Plan
10 pages
Action Research Proposal Presentation Sample//PUBLIC SPEAKING
No ratings yet
Action Research Proposal Presentation Sample//PUBLIC SPEAKING
18 pages
Spruious Regression and Ghouse Equation
No ratings yet
Spruious Regression and Ghouse Equation
23 pages
Multidimensional Scaling Guide
No ratings yet
Multidimensional Scaling Guide
46 pages
Deductive Inductive Reasoning 2
No ratings yet
Deductive Inductive Reasoning 2
15 pages
Chi-Square Distribution Critical Values
No ratings yet
Chi-Square Distribution Critical Values
2 pages
Assessing The Costs and Benefits
No ratings yet
Assessing The Costs and Benefits
20 pages
Events Management Iii: Shaireena Lee M. Tiana
No ratings yet
Events Management Iii: Shaireena Lee M. Tiana
70 pages
Case Defense Rubric
No ratings yet
Case Defense Rubric
1 page
Raymond Cattle
No ratings yet
Raymond Cattle
18 pages
Mobile Food Vending in Greenville SC
No ratings yet
Mobile Food Vending in Greenville SC
97 pages
Establishing Quality Standards (2021 Update) - 042334
No ratings yet
Establishing Quality Standards (2021 Update) - 042334
20 pages
How To Do A Website Ux Audit
No ratings yet
How To Do A Website Ux Audit
31 pages

Relibility Testing

Uploaded by

Relibility Testing

Uploaded by

VALIDITY AND

a. Test-retest or Stability test

 Correlation between sets of data is a

 The most common measure of

 The full name is the Pearson Product

 Respondents may not feel encouraged to

 Respondents may not feel comfortable

 Respondents may not be fully aware of their

You want to find out how satisfied your

You send out a survey with three

1. I was satisfied with my experience.

 most commonly used when you want to

The following table describes how different

- Kuder-Richardson Formula 20, or KR-20,

– used for dichotomous items

 Assesses the internal consistency of

Half test is calculated using

You want to find out how satisfied your

You send out a survey with three

You want to find out how satisfied your

You send out a survey with three

1. I was satisfied with my experience.

You might also like