Understanding Test Score Types

1) There are different types of test scores such as raw scores, percentile ranks, and standard scores. Standard scores include z-scores, T-scores, stanines, and normal curve equivalents. 2) Test scores can be interpreted using norm-referenced or criterion-referenced frameworks. Norm-referenced interpretations compare a student's performance to peers while criterion-referenced interpretations determine if a student has met a predefined standard of mastery. 3) Both frameworks require defining the achievement domain measured and use valid and reliable test items and assessment methods, though they differ in how scores are scaled and mastery is determined.

Uploaded by

April Jade Mendoza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

2K views21 pages

Understanding Test Score Types

Uploaded by

April Jade Mendoza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

UTILIZATION OF

ASSESSMENT DATA
Chapter 10
TYPES OF TEST SCORES

Raw and Percentage Scores

• Raw scores are obtained by simply counting the number of
correct responses in a test following the scoring directions.
Percentile Rank
• A percentile rank gives the percent of scores that are at or
below a raw or standard score. It is used to rank students in
a reference sample. This should not be confused with the
percentage of correct answers.
Standard Scores
• It is difficult to use raw scores when making comparisons between
groups on different tests considering that tests may have different levels
of difficulty. To remedy this, raw score may be transformed to a derived
score.
• A normal curve represents a normal distribution – a symmetric
theoretical distribution. The mean (arithmetic average), median
(score that separates the upper and lower 50%) and mode (most
frequently occurring score) are located at the center of the bell
curve where the peak is.
• The curve approaches the horizontal axis asymptotically. The empirical rule (68-
95-99.7 rule) shows the connection between the normal distribution and the
standard deviation. It states that 68% of the scores will fall within 1 standard
deviation of the mean; 95% of the scores within 2 standard deviations; and
almost all (99.7%) of the scores will fall within 3 standard of the mean.
STANDARD SCORES

•
A. Z – score
• The z – score gives the number of standard deviations of a test score above
or below the mean. The formula is where is the test score, is the average
score and is the standard deviation. A negative z – score means it is below the
average, while a positive z value means it is above the average.
B. T – score
• In a T – score scale, the mean is set to 50 instead of 0, and the standard
deviation is 10. To transform a Z – score to a T – score, we multiply the z –
score by 10 and add 50, i.e. T = 10z + 50. A z – score of -1.5 converts to a T
– score of 35. Hence, similar to a z – score, a T – score of 35 indicates that
the raw score of 12 is 1.5 standard deviations below the mean. This implies
that based on the test scores, the student performed below the average.
C. Stanine
• Stanine, short for standard nine, is a method of scaling scores on a nine-point
scale. A raw score is converted to a whole number from a low 1 to a high 9.
Unlike in a z-score where the mean and standard deviation are 0 to 1,
respectively, stanines have a mean of 5 and a standard deviation of 2.
• Stanine scores of 1, 2 and 3 are below average; 4, 5 and 6 are average; 7, 8
and 9 are above average. In the previous example, the stanine equivalent of a
raw score of 12 is 2. This can be calculated from the z-score by multiplying
it by 2 and adding 5, i.e. stanine = 2z + 5. A stanine value of 2 shows that
the student’s performance level is below average.
D. Normal Curve Equivalent
• The Normal Curve Equivalent (NCE) is a normalized standard score within
the range 1 – 99. It has a mean of 50 and a standard deviation of 21.06.
Caution should be exercised when converting a raw score to NCE because
the latter requires a representative national sample.
• NCE is preferred by some because of its equal-interval. Some differences
between tests and among subtests in a test battery can be calculated and
examined.
E. Developmental Scores
• A grade equivalent (GE) describes a learner’s developmental growth. It gives a picture
as to where he/she is on an achievement continuum.
• Note that a GE is an estimate of a learner’s location in the development continuum and
not the grade level where he/she should be placed (Frechtling & Myerberg, 1983). One
criticism against GE is that it assumes that equal learning occurs throughout the year
which is far from the truth.
• Further, it cannot be used to compare a student’s performance in different tests/subtests.
Analogous to grade equivalents are age equivalent scores. They are interpreted similarly.
• Developmental scores like grade and age equivalents promote typological thinking –
categorizing abilities and performance by age or grade without due consideration to
variations.
TYPES OF TEST SCORE INTERPRETATIONS
• A frame of reference is some well-defined performance domain (Mehrens &
Lehmann, 1985). This is needed to make sense of test scores. Scores and marks
may be explained in relation to a norm or criterion. These references were
conceived and differentiated by American psychologist Robert Glaser in 1963.
The use of norm and criterion-reference measures hinges on the purpose of
assessment.
• Rendering judgement whether he/she passes or fails is criterion-referenced.
• Selection is made based on norm-referenced scores.
A. NORM-REFERENCED INTERPRETATIONS
• The term “norm” originated from the Latin word norma which means precept or rule. By
definition, it pertains to the average score in a test. Apart from school average norm, there are
other types of norms that can be reported: international, national and local norm groups, and
special norm groups (e.g. students who are visually impaired).
• Norm-referenced interpretations are explanations of a learner’s performance in comparison with
other learners of the same age or grade. A learner’s knowledge is gauged in terms of his/her
position in the norm group. The use of standard scores and percentile rank are common in norm-
referenced interpretations.
• When using percentile ranks, Campbell (1995) cautioned test interpreters that (1) percentile units
are not necessarily equal in size and (2) it is of critical importance that the norms be developed
using a comparable cohort of students.
• As pointed out, norm-referenced evaluations determine the learner’s place or rank.
Assessment instruments that lend itself to this kind of interpretation include standardized
aptitude and achievement tests, teacher-made survey tests, interest inventories and
adjustment inventories.
• According to Kubiszyn & Borich (2010), a norm-referenced assessment tends to be
general as it covers a wider scope of content measuring a variety of skills. Because there
are several objectives covered, only one or two items are sampled for each learning
objective.
• Morever, difficulty of test items varies and may likely result to greater variability in test
scores. This implies that scores are dispersed and score standings become more telling.
Besides, the purpose of norm-referenced measures is to discriminate between high and
low achievers.
FIVE GUIDELINES WHEN INTERPRETING
NORM-REFERENCED TEST SCORES.

1. Detect any unexpected pattern of scores.

2. Determine the reasons for score patterns.
3. Do not expect surprises for every student.
4. Small differences in subtest scores should be viewed as chance fluctuations.
5. Use information from various assessments and observations to explain
performance on other assessments.
B. CRITERION-REFERENCED INTERPRETATIONS
• The word “criterion” came from the Greek word kriterion which means standard. And so, criterion-
referenced interpretations provide meaning to tests scores by describing what the learner can and cannot do
in light of a standard. Hence, test scores allow for absolute interpretations and not comparative.
• A learner’s performance is explained in relation to a pre-determined criterion of mastery. Mastery tests
generate criterion-referenced interpretations, so do teacher-made tests, skill test, competency test,
performance assessments, and licensure examinations.
• Multiple choice questions, alternate response items, short-response and essays enable criterion-referenced
interpretations since these test formats can be used to measure specific body of knowledge or skills set the
students have acquired.
• What matters is the link between the test items and the criteria set forth. Unlike in norm-referenced
interpretations where most students are categorized as average, it is quite possible in a criterion-based test to
have more students able to achieve an acceptable or high level of proficiency, or otherwise.
• Criterion-referenced scores include percentage correct, speed of performance, quality
ratings and precision of performance (Nitko & Brookhart, 2011). When interpreting
standardized test scores, there is an assumption of normality in the distribution of test
scores.
• This is not necessary in a criterion-referenced framework. When the test scores are
negatively skewed (skewed to the left), there are more high scores. This implies that
students performed well and speaks of the quality of instruction given them.
• However, it may also mean that test items may have been easy. If positively skewed
(skewed to the right), then there are more low scores. The students performed poorly
and test items may have been difficult.
• Criterion-referencing is used in diagnosing students’ needs and monitoring their
progress. It is likewise used in certification and program evaluation. It is the
preferred mode of assessment in an outcome-based education framework.
• The foregoing discussion revealed the differences between the two major frames
of references. Despite the differences, there are also commonalities. Miller, Linn
& Gronlund (2009) stated that both require specification of the achievement
domain to be measured, as well as relevant and representative sample of test
items. They added that the same test item rules (except for item difficulty) are
followed and the principles of validity and reliability are still observed.

AL 4.1 - Utilization of Assessment Data
No ratings yet
AL 4.1 - Utilization of Assessment Data
10 pages
Lesson 8 Analysis Interpretation Use of Test Data - 20231027 - 140201 - 0000
No ratings yet
Lesson 8 Analysis Interpretation Use of Test Data - 20231027 - 140201 - 0000
42 pages
Philosophies Shaping Education
0% (1)
Philosophies Shaping Education
88 pages
AOL-2-Mod-1 MA
No ratings yet
AOL-2-Mod-1 MA
17 pages
Lesson 3 (Different Classification of Assessment)
100% (3)
Lesson 3 (Different Classification of Assessment)
18 pages
Classroom Test Design Guide
100% (4)
Classroom Test Design Guide
22 pages
Developing Assessment Tools
No ratings yet
Developing Assessment Tools
7 pages
Measures of Central Tendency and Dispersion/Variability: Range, Variance and Standard Deviation
100% (1)
Measures of Central Tendency and Dispersion/Variability: Range, Variance and Standard Deviation
15 pages
Matching Type
No ratings yet
Matching Type
26 pages
General Guidelines in Constructing Good Problem-Solving Test Items
No ratings yet
General Guidelines in Constructing Good Problem-Solving Test Items
5 pages
Module 5
No ratings yet
Module 5
14 pages
Test Design for Educators
No ratings yet
Test Design for Educators
37 pages
Development of Classroom Assessment Tools
No ratings yet
Development of Classroom Assessment Tools
193 pages
Assessment in Learning 1: Prof Edu 6
No ratings yet
Assessment in Learning 1: Prof Edu 6
14 pages
10 Creating-A-Positive-School-Culture
No ratings yet
10 Creating-A-Positive-School-Culture
8 pages
Lesson 5 Construction of Written Test
100% (1)
Lesson 5 Construction of Written Test
20 pages
Chapter 2 Target Testing
No ratings yet
Chapter 2 Target Testing
40 pages
Appropriateness of Assessment Methods
100% (3)
Appropriateness of Assessment Methods
27 pages
Norm Referenced and Criterion Referenced Grading
100% (1)
Norm Referenced and Criterion Referenced Grading
48 pages
Prof Ed 7 Lesson 2
No ratings yet
Prof Ed 7 Lesson 2
7 pages
Module 1 Lesson 2
No ratings yet
Module 1 Lesson 2
9 pages
Module 3 Lesson 1
No ratings yet
Module 3 Lesson 1
20 pages
Basic Concepts and Principles in Assessing Learning: Lesson 1
No ratings yet
Basic Concepts and Principles in Assessing Learning: Lesson 1
9 pages
Assigning Letter Grades and Computing Grades
No ratings yet
Assigning Letter Grades and Computing Grades
7 pages
Unit 5 Lesson 3
No ratings yet
Unit 5 Lesson 3
8 pages
Grading and Reporting of Assessment Results
No ratings yet
Grading and Reporting of Assessment Results
54 pages
Multicultural Teaching Challenges
100% (1)
Multicultural Teaching Challenges
2 pages
Lesson 4 - Planning A Written Test
100% (1)
Lesson 4 - Planning A Written Test
21 pages
21st Century Assessment
100% (2)
21st Century Assessment
14 pages
Nature, Purpose and Rationale For Assigning Grades
100% (1)
Nature, Purpose and Rationale For Assigning Grades
11 pages
Detailed Lesson Plan in Assessment of Student Learning 2
100% (2)
Detailed Lesson Plan in Assessment of Student Learning 2
14 pages
Teachers Philosophical Heritage
No ratings yet
Teachers Philosophical Heritage
20 pages
Chapter 4 Analysis and Interpretation of Assessment Results
No ratings yet
Chapter 4 Analysis and Interpretation of Assessment Results
36 pages
Alternative Response
0% (1)
Alternative Response
28 pages
CREATIVITY THROUGH THE ARTS (Report)
No ratings yet
CREATIVITY THROUGH THE ARTS (Report)
29 pages
Chapter 2 Asl 2
No ratings yet
Chapter 2 Asl 2
21 pages
Classroom Assessment Strategies
67% (3)
Classroom Assessment Strategies
36 pages
PeD Midterm
No ratings yet
PeD Midterm
6 pages
Product-Oriented Learning Competencies
100% (1)
Product-Oriented Learning Competencies
7 pages
Team 1 Soft Copy Report (Ed 6)
100% (1)
Team 1 Soft Copy Report (Ed 6)
84 pages
Assessment in Learning 1 Chapter 4 1
100% (1)
Assessment in Learning 1 Chapter 4 1
36 pages
Selecting and Constructing Test Items and Task2
100% (1)
Selecting and Constructing Test Items and Task2
10 pages
Selecting and Constructing Test Items A
No ratings yet
Selecting and Constructing Test Items A
21 pages
Assisngment Assessment of Learning - Umbina
No ratings yet
Assisngment Assessment of Learning - Umbina
3 pages
Item Analysis and Validation: Ed 106 - Assessment in Learning 1 AY 2022-2023
No ratings yet
Item Analysis and Validation: Ed 106 - Assessment in Learning 1 AY 2022-2023
8 pages
TTL2 Answers
No ratings yet
TTL2 Answers
6 pages
Chapter 4
No ratings yet
Chapter 4
121 pages
Cueto, Gemma M. Bsed-English 2201: Task 1 (Individual Activity)
67% (3)
Cueto, Gemma M. Bsed-English 2201: Task 1 (Individual Activity)
3 pages
Draft Lesson Plan - e Portfolio
No ratings yet
Draft Lesson Plan - e Portfolio
8 pages
Assessment of Learning 2 Module 3 5
No ratings yet
Assessment of Learning 2 Module 3 5
17 pages
Practicality and Efficiency
No ratings yet
Practicality and Efficiency
7 pages
History of Luksong Tinik
No ratings yet
History of Luksong Tinik
2 pages
Week - 7.1 - Assessment of Learning 1-Educ 208
No ratings yet
Week - 7.1 - Assessment of Learning 1-Educ 208
15 pages
Assessment Development Cycle
No ratings yet
Assessment Development Cycle
6 pages
Test Question Design Guide
No ratings yet
Test Question Design Guide
7 pages
Normal Distribution: Example: John Michael Obtained A Score of 82 in
No ratings yet
Normal Distribution: Example: John Michael Obtained A Score of 82 in
3 pages
EDUC - Module 5.2 - Discussion
No ratings yet
EDUC - Module 5.2 - Discussion
4 pages
Teacher-Made Test Guidelines
100% (1)
Teacher-Made Test Guidelines
7 pages
MSC 4
No ratings yet
MSC 4
18 pages
Norm and Criterion-Referenced Test: Jenelyn P. Daanoy Prusil D. Dequilla
No ratings yet
Norm and Criterion-Referenced Test: Jenelyn P. Daanoy Prusil D. Dequilla
35 pages
Vocabulary Building
No ratings yet
Vocabulary Building
10 pages
Reading Skills
No ratings yet
Reading Skills
14 pages
Instructional Planning Models
100% (3)
Instructional Planning Models
3 pages
Instructional Software For Classroom Use
No ratings yet
Instructional Software For Classroom Use
12 pages
Teaching Math in Primary Grades
100% (11)
Teaching Math in Primary Grades
12 pages
Chapter 10
No ratings yet
Chapter 10
4 pages
Teaching English in The Elementary Grades Through Literature
100% (10)
Teaching English in The Elementary Grades Through Literature
11 pages
Student Learning Outcome - Additional
No ratings yet
Student Learning Outcome - Additional
2 pages
What Is Assessment
50% (2)
What Is Assessment
12 pages
Instructional Software For Classroom Use
No ratings yet
Instructional Software For Classroom Use
12 pages
Sources of Student Expected Learning Outcome
100% (1)
Sources of Student Expected Learning Outcome
12 pages
Art Analysis for Enthusiasts
100% (1)
Art Analysis for Enthusiasts
22 pages
Technology Integration Planning (Tip)
100% (2)
Technology Integration Planning (Tip)
18 pages
Art Analysis for Enthusiasts
No ratings yet
Art Analysis for Enthusiasts
22 pages
The Unity and Love of Every Filipino, The Aspiration of A Prosperous Nation
No ratings yet
The Unity and Love of Every Filipino, The Aspiration of A Prosperous Nation
1 page
LinkedIn Guide: Features & Uses
No ratings yet
LinkedIn Guide: Features & Uses
20 pages
LinkedIn - Report
No ratings yet
LinkedIn - Report
20 pages
History of Darts During Medieval Times
No ratings yet
History of Darts During Medieval Times
2 pages
Models of Inclusion
100% (1)
Models of Inclusion
15 pages
Primary and Secondary 1
No ratings yet
Primary and Secondary 1
12 pages
Day 02-Random Variable and Probability - Part (I)
No ratings yet
Day 02-Random Variable and Probability - Part (I)
34 pages
The Normal Distribution
No ratings yet
The Normal Distribution
9 pages
Business Statistics: Session 2
No ratings yet
Business Statistics: Session 2
60 pages
Important Question - Iat 2
No ratings yet
Important Question - Iat 2
10 pages
10 11.1 Day 3 Assignment 11
No ratings yet
10 11.1 Day 3 Assignment 11
3 pages
Normal Distribution Nov 2021 - Final
100% (1)
Normal Distribution Nov 2021 - Final
36 pages
CH 05 Probability An Introduction To Modeling Uncertainty
No ratings yet
CH 05 Probability An Introduction To Modeling Uncertainty
31 pages
DSILYTC Session 5 - Descriptive Statistics
No ratings yet
DSILYTC Session 5 - Descriptive Statistics
99 pages
Gas Turbine Blade Quality Inspection: Using NDT Resonant Acoustic Method
No ratings yet
Gas Turbine Blade Quality Inspection: Using NDT Resonant Acoustic Method
20 pages
Poonam Malik IJEMS June 2024
No ratings yet
Poonam Malik IJEMS June 2024
6 pages
Lab 7: Scoring, Interpreting, and Reporting T-Scores
100% (1)
Lab 7: Scoring, Interpreting, and Reporting T-Scores
2 pages
Chapter 4 Fin534
No ratings yet
Chapter 4 Fin534
38 pages
Basic Concepts of Statistics - Part 2
No ratings yet
Basic Concepts of Statistics - Part 2
18 pages
Determining The Sample Size (Continuous Data)
No ratings yet
Determining The Sample Size (Continuous Data)
2 pages
Statistics and Probability Reviewer Quarter 3
No ratings yet
Statistics and Probability Reviewer Quarter 3
19 pages
Identifying The Most Important Independent Variables in Regression Models - Statistics by Jim
No ratings yet
Identifying The Most Important Independent Variables in Regression Models - Statistics by Jim
8 pages
Z Score
No ratings yet
Z Score
2 pages
Fish Stock Assessment Training Manual
No ratings yet
Fish Stock Assessment Training Manual
99 pages
Sis Manual
100% (1)
Sis Manual
7 pages
COSS Assessment Report Teacher
No ratings yet
COSS Assessment Report Teacher
4 pages
Business Mathematics & Quantitative Methods
75% (4)
Business Mathematics & Quantitative Methods
11 pages
Normal Distribution & Probability Distributions
No ratings yet
Normal Distribution & Probability Distributions
11 pages
Information Paper: Northern Territory Skilled Occupation Priority List
No ratings yet
Information Paper: Northern Territory Skilled Occupation Priority List
10 pages
Z-Score Normalization - (Data Mining)
No ratings yet
Z-Score Normalization - (Data Mining)
6 pages
Data Analysis and Visualization EDA
No ratings yet
Data Analysis and Visualization EDA
51 pages
Assessment of Growth and Development
No ratings yet
Assessment of Growth and Development
34 pages
Documentation & Report For Flyzy Flight Cancellation Project
No ratings yet
Documentation & Report For Flyzy Flight Cancellation Project
25 pages
What Are Confidence Intervals - Simply Psychology
No ratings yet
What Are Confidence Intervals - Simply Psychology
5 pages
MIDTERM EXAM Maqhanoy Educ 10
No ratings yet
MIDTERM EXAM Maqhanoy Educ 10
3 pages
Group 4 Written Report Grading System
No ratings yet
Group 4 Written Report Grading System
20 pages

Understanding Test Score Types

Uploaded by

Understanding Test Score Types

Uploaded by

UTILIZATION OF

Raw and Percentage Scores

1. Detect any unexpected pattern of scores.

You might also like