8602
Education Assessment
         and Evaluation
Submitted by: Ayesha Khalid
Assignment number 1
B.ED 1.5 year
Allama Iqbal Open University
8/8/2022
Q. 1 What is the role of teacher in classroom assessment? Write a detailed note.
According to Carole Tomlinson “Assessment is today's means of modifying tomorrow's
instruction." It is an integral part of teaching learning process. It is widely accepted that
effectiveness of teaching learning process is directly influenced by assessment. Hamidi
(2010) developed a framework to answer the Why; What, How and When to assess.
This is helpful in understanding the true nature of this concept.
Why to Assess: Teachers have clear goals for instruction and they assess to ensure that
these goals have been or are being met. If objectives are the destination, instruction is
the path to it then assessment is a tool to keep the efforts on track and to ensure that the
path is right. After the completion of journey assessment is the indication that
destination is ahead.
What to Assess: Teachers cannot assess whatever they themselves like. In classroom
assessment, teachers are supposed to assess students' current abilities in a given skill or
task. The teacher can assess students’ knowledge, skills or behaviour related to a
particular field.
Who to Assess: It may seem strange to ask whom a teacher should assess in the
classroom, but the issue is of great concern. Teachers should treat students as 'real
learners', not as course or unit coverers. They should also predict that some students are
more active and some are less active; some are quick at learning and some are slow at
it. Therefore, classroom assessment calls for a prior realistic appraisal of the individuals
teachers are going to assess.
How to Assess: Teachers employ different instruments, formal or informal, to assess
their students. Brown and Hudson (1998) reported that teachers use three sorts of
assessment     methods     –    selected-response    assessments,     constructed-response
assessments, and personal-response assessments. They can adjust the assessment types
to what they are going to assess.
When to Assess: There is a strong agreement of educationists that assessment is
interwoven into instruction. Teachers continue to assess the students learning
throughout the process of teaching. They particularly do formal assessments when they
are going to make instructional decisions at the formative and summative levels, even
if those decisions are small. For example, they assess when there is a change in the
content; when there is a shift in pedagogy, when the effect of the given materials or
curriculum on learning process is examined.
How much to Assess: There is no touchstone to weigh the degree to which a teacher
should assess students. But it doesn't mean that teachers can evaluate their students to
the extent that they prefer. It is generally agreed that as students differ in ability, learning
styles, interests and needs etc so assessment should be limited to every individual's
needs, ability and knowledge. Teachers’ careful and wise judgment in this regard can
prevent teachers from over assessment or underassessment.
Principles of Classroom Assessment Conducted by teacher:
Hamidi (2010) described following principles of classroom assessment.
    1. Assessment should be formative. Classroom assessment should be carried out
        regularly in order to inform on-going teaching and learning. It should be
        formative because it refers to the formation of a concept or process. Teachers
        use it to see how far learners have mastered what they should have learned. So
        classroom assessment needs fully to reach its formative potential if a teacher is
        to be truly effective in teaching.
    2. Should determine planning. Classroom assessment should help teachers plan
        for future work. First, teachers should identify the purposes for assessment –
        that is, specify the kinds of decisions teachers want to make as a result of
        assessment. Second, they should gather information related to the decisions they
   have made. Next, they interpret the collected information—that is, it must be
   contextualized before it is meaningful. Finally, they should make the final, or
   the professional, decisions.
3. Assessment should serve teaching. Classroom assessment serves teaching
   through providing feedback on pupils' learning that would make the next
   teaching event more effective, in a positive, upwards direct. Therefore,
   assessment must be an integral part of instruction. Assessment seems to drive
   teaching by forcing teachers to teach what is going to be assessed. Teaching
   involves assessment; that is, whenever a student responds to a question, offers
   a comment, or tries out a new word or structure, the teacher subconsciously
   makes an assessment of the student’s performance.
4. Assessment should serve learning. Classroom assessment is an integral part
   of learning process as well. The ways in which learners are assessed and
   evaluated strongly affect the ways they study and learn. It is the process of
   finding out who the students are, what their abilities are, what they need to know,
   and how they perceive the learning will affect them.
5. Assessment should be curriculum-driven. Classroom assessment should be
   the servant, not the master, of the curriculum. Assessment specialists view it as
   an integral part of the entire curriculum cycle. Therefore, decisions about how
   to assess students must be considered from the very beginning of curriculum
   design or course planning.
6. Assessment should be student-centered. Since learner-centered methods of
   instruction are principally concerned with learner needs, students are
   encouraged to take more responsibility for their own learning and to choose
   their own learning goals and projects. Therefore, in learner-centered assessment,
   they are actively involved in the process of assessment. Involving learners in
   aspects of classroom assessment minimizes learning anxiety and results in
   greater student motivation.
7. Assessment should be diagnostic. Classroom assessment is diagnostic because
   teachers use it to find out learners' strengths and weaknesses during the in-
       progress class instruction. They also identify learning difficulties. If the purpose
       of assessment is to provide diagnostic feedback, then this feedback needs to be
       provided in a form – either verbal or written – that is for learners to understand
       and use.
   8. Assessment should be exposed to learners. Teachers are supposed to enlighten
       learners' accurate information about assessment. In other words, it should be
       transparent to learners. They must know when the assessments occur, what they
       cover in terms of skills and materials, how much the assessments are worth, and
       when they can get their results and the results are going to be used.
   9. Assessment should be non-judgmental. In the classroom assessment,
       everything focuses on learning which results from a number of such factors as
       student needs, student motivation, teaching style, time on task, study intensity,
       background knowledge, course objectives, etc. So there is no praise or blame
       for a particular outcome of learning.
   10. Assessment should involve reflective teaching. Reflective teaching is an
       approach instruction in which teachers are supposed to develop their
       understanding of teaching (quality) based on data/information obtained and
       collected through critical reflection on their teaching experiences. This
       information can be gathered through formative assessment (i.e., using different
       methods and tools such as class quizzes, questionnaires, surveys, field notes,
       feedback from peers, classroom ethnographies, observation notes, etc) and
       summative assessment (i.e., different types of achievement tests taken at the end
       of the term).
Q.2 Define learning outcomes and objectives. Differentiate between them.
What is Learning Outcome? Learning outcomes are the statements indicating what a
student is expected to be able to do as a result of a learning activity. Major difference
between learning objectives and out comes is that objectives are focused upon the
instruction, what will be given to the students and the outcomes are focused upon the
students what behaviour change they are being expected to show as the result of the
instruction.
What is Learning Objective: A learning objective refers to the statement of what
students will obtain through instruction of certain content. In other words ‘an objective
is a description of a performance you want learners to be able to exhibit before you
consider them competent. An objective describes an intended result of instruction,
rather than the process of instruction itself.’
Difference between Learning Outcomes and Objectives: Learning outcomes and
objectives’ are often used synonymously, although they are not the same. In simple
words, objectives are concerned with teaching and the teacher’s intentions whereas
learning outcomes are concerned with students learning.
Q.3 What are aptitude tests used for? Define the types of aptitude tests.
Aptitude Tests
Aptitude tests assume that individuals have inherent strengths and weaknesses, and are
naturally inclined toward success or failure in certain areas based on their inherent
characteristics.
 Aptitude tests determine a person's ability to learn a given set of information. They do
not test a person's knowledge of existing information. The best way to prepare for
aptitude tests is to take practice tests.
Aptitude and ability tests are designed to assess logical reasoning or thinking
performance. They consist of multiple choice questions and are administered under
exam conditions. They are strictly timed and a typical test might allow 30 minutes for
30 or so questions. Test result will be compared to that of a control group so that
judgments can be made about your abilities.
You may be asked to answer the questions either on paper or online. The advantages of
online testing include immediate availability of results and the fact that the test can be
taken at employment agency premises or even at home. This makes online testing
particularly suitable for initial screening as it is obviously very cost-effective.
Types of Aptitude Test
The following is a list of the different types of aptitude test that are used for assessment
process.
    (a) Critical Thinking:
        Critical thinking is defined as a form of reflective reasoning which analyses and
        evaluates information and arguments by applying a range of intellectual skills
        in order to reach clear, logical and coherent judgments within a given context.
        Critical thinking tests force candidates to analyse and evaluate short passages
        of written information and make deductions to form answers.
    (b) Numerical Reasoning Tests
        Numerical tests, sometimes known as numerical reasoning, are used during the
        application process at all major investment banks and accountancy &
        professional services firms. Test can be either written or taken online. The tests
        are usually provided by a third party. Perceptual Speed Tests Perceptual speed
        is the ability to quickly and accurately compare letters, numbers, objects,
        pictures, or patterns. In tests of perceptual speed the things to be compared may
       be presented at the same time or one after the other. Candidates may also be
       asked to compare a presented object with a remembered object.
   (c) Spatial Visualization Tests
       Spatial visualization ability or Visual-spatial ability refers to the ability to
       mentally manipulate 2-dimensional and 3-dimensional figures. It is typically
       measured with simple cognitive tests and is predictive of user performance with
       some kinds of user interfaces.
   (d) Logical Reasoning Tests
       Logical reasoning aptitude tests (also known as Critical Reasoning Tests) may
       be either verbal (word based, e.g. "Verbal Logical Reasoning"), numerical
       (number based, e.g. "Numerical Logical Reasoning") or diagrammatic (picture
       based, see diagrammatic tests for more information).
   (e) Verbal Reasoning Tests
       Verbal reasoning tests are a form of aptitude test used by interviewers to find
       out how well a candidate can assess verbal logic. In a verbal reasoning test, you
       are typically provided with a passage, or several passages, of information and
       required to evaluate a set of statements by selecting one of the following
       possible answers.
Q.4 Write advantages and disadvantages of matching type last item.
Matching items
According to Cunningham (1998), the matching items consist of two parallel columns.
The column on the left contains the questions to be answered, termed premises; the
column on the right, the answers, termed responses. The student is asked to associate
each premise with a response to form a matching pair.
Matching test items are used to test a student's ability to recognize relationships and to
make associations between terms, parts, words, phrases, clauses, or symbols in one
column with related alternatives in another column. When using this form of test item,
it is a good practice to provide alternatives in the response column that are used more
than once, or not at all, to preclude guessing by elimination. Matching test items may
have either an equal or unequal number of selections in each column. Matching-Equal
Columns. When using this form, providing for some items in the response column to
be used more than once, or not at all, can preclude guessing by elimination.
Good for:
• Knowledge level
• Some comprehension level, if appropriately constructed
Types:
• Terms with definitions
• Phrases with other phrases
• Causes with effects
• Parts with larger units
• Problems with solutions
Advantages:
The chief advantage of matching exercises is that a good deal of factual information
can be tested in minimal time, making the tests compact and efficient. They are
especially well suited to who, what, when and where types of subject matter.
Further students frequently find the tests fun to take because they have puzzle qualities
to them.
• Maximum coverage at knowledge level in a minimum amount of space/prep time
• Valuable in content areas that have a lot of facts
Disadvantages:
 The principal difficulty with matching exercises is that teachers often find that the
subject matter is insufficient in quantity or not well suited for matching terms. An
exercise should be confined to homogeneous items containing one type of subject
matter (for instance, authors-novels; inventions inventors; major events-dates terms –
definitions; rules examples and the like). Where unlike clusters of questions are used to
adopt but poorly informed student can often recognize the ill-fitting items by their
irrelevant and extraneous nature (for instance, in a list of authors the inclusion of the
names of capital cities).
Student identifies connected items from two lists. It is useful for assessing the ability to
discriminate, categorize, and association amongst similar concepts.
• Time consuming for students
• Not good for higher levels of learning
 Q.5 How will you define relaibility of test? Also write its types.
Reliability means Trustworthy. A test score is called reliable when we have reasons for
believing the test score to be stable and objective. For example if the same test is given
to two classes and is marked by different teachers even then it produced the similar
results, it may be considered as reliable. Stability and trustworthiness depends upon the
degree to which score is free of chance error.
According to Merriam Webster Dictionary:
“Reliability is the extent to which an experiment, test, or measuring procedure yields
the same results on repeated trials.”
According to Hopkins & Antes (2000):
“Reliability is the consistency of observations yielded over repeated recordings either
for one subject or a set of subjects.”
Joppe (2000) defines reliability as:
“…The extent to which results are consistent over time and an accurate representation
of the total population under study is referred to as reliability and if the results of a
study can be reproduced under a similar methodology, then the research instrument is
considered to be reliable.”
The more general definition of reliability is:
The degree to which a score is stable and consistent when measured at different times
(test-retest reliability), in different ways (parallel-forms and alternate-forms), or with
different items within the same scale (internal consistency).
Types of Reliability
Reliability is one of the most important elements of test quality. It has to do with the
consistency, or reproducibility, of an examinee's performance in the test. It's not
possible to calculate reliability exactly. Instead, we have to estimate reliability, and this
is always 102 an imperfect attempt. Here, we introduce the major reliability estimators
and talk about their strengths and weaknesses.
There are six general classes of reliability estimates, each of which estimates reliability
in a different way. They are:
   i)      Inter-Rater or Inter-Observer Reliability
           To assess the degree to which different raters/observers give consistent
           estimates of the same phenomenon. That is if two teachers mark same test
           and the results are similar, so it indicates the inter-rater or inter-observer
           reliability.
   ii)     Test-Retest Reliability:
   iii)    To assess the consistency of a measure from one time to another, when a
           same test is administered twice and the results of both administrations are
           similar, this constitutes the test-retest reliability. Students may remember
           and may be mature after the first administration creates a problem for test-
           retest reliability.
   iv)     Parallel-Form Reliability:
           To assess the consistency of the results of two tests constructed in the same
           way from the same content domain. Here the test designer tries to develop
           two tests of the similar kinds and after administration the results are similar
           then it will indicate the parallel form reliability.
   v)      Internal Consistency Reliability:
           To assess the consistency of results across items within a test, it is
           correlation of the individual items score with the entire test.
   vi)     Split half Reliability:
           To assess the consistency of results comparing two halves of single test,
           these halves may be even odd items on the single test.
   vii)    Kuder-Richardson Reliability:
           To assess the consistency of the results using all the possible split halves of
           a test. Let's discuss each of these in turn.