Language Assessment Principles & Practices
Language Assessment Principles & Practices
To measure synthesis (integrate learning from different areas or 1. The Essay-Translation Approach
solve problems by creative thinking), ask these kinds of questions: a. Characteristics and Types of Tests in Essay Translation
Categorize, Combine, Compile, Devise, Design, Explain, Generate, Approach:
Organize, Plan, Rearrange, Reconstruct, Revise, Tell. 1. This approach is commonly referred to as the pre-
To measure evaluation (judging and assessing), ask these kinds of scientific stage of language testing
questions: Appraise, Compare, Conclude, Contrast, Criticize, 2. No special skill or expertise in testing is required.
Describe, Discriminate, Explain, Justify, Interpret, Support. 3. Tests usually consist of essay writing, translation, and
grammatical analysis.
Kinds of Language Test 4. Tests have a heavy literary and cultural bias.
5. Public examinations resulting from the tests using this
1. Proficiency Test approach sometimes have an oral component at the upper
What is language proficiency test? intermediate and advance levels.
b. Strengths of Essay-Translation Approach
•Proficiency refers to the ability to perform an action or function. 1. This approach is easy to follow because teachers will
The language proficiency test is an evaluation of how well a person simply use their subjective judgement.
can use language to communicate in real life. 2. The essay-translation approach may be used for testing
• Proficiency test are designed to measure people’s ability in any level of examinees.
language, regardless of any training they may have had in that 3. The model of tester can easily be modified based on the
language. essentials of the tests.
c. Weaknesses of Essay-Translation Approach
1. Subjective judgement of teachers tends to be biased.
2. As mentioned, the tests have a heavy literary and 4. The Communicative Approach
cultural bias. a. Characteristics and types of tests of communicative Approach:
1. Communicative tests are concerned primarily with how
language is used in communication.
2. Language use is often emphasized to the exclusion of language
usage.
2. The Structuralist Approach
3. The attempt to measure different language skills in communicative
a. Characteristics and Types of Tests of the Structuralist tests is based on a view of language referred to as the divisibility
Approach: hypothesis.
1. This approach views that language learning is chiefly concerned
with a systematic acquisition of a set of habits. 4. The test content should totally be relevant for a particular group of
examinees and the tasks set should relate to real-life situation.
2. The structuralist approach involves structural linguistics which
stresses the importance of constructive analysis and the need to 5. Communicative testing introduces the concept of quantitative
identify and measure the learners mastery of the separate elements modes of assessment in preference to quantitative modes of
of the target language such as phonology, vocabulary and grammar. assessment.
3. Testing the skills of listening, speaking, reading and writing is b. Strengths of Communicative Approach
separate from another as much as possible.
1. Communicative tests are able to measure all integrated skills of
4. The psychometric approach to measurement with its emphasis on students.
reliability and objectivity forms an integral part of structuralist testing.
2. The tests using this approach face students in real life so it will be
b. Strengths of Structuralist Approach very useful for them.
1. In testing students’ capability, this approach may objectively and
surely be used by testers.
3. Because a communicative test can measure all language skills, it
can help students in getting the score. Consider students who have a
2. Many forms of tests can be covered in the test in a short time.
poor ability in using spoken language but may score quite highly on
tests of reading.
3. Using this approach in testing will help students find their strengths
and weaknesses in every skill they study.
4. Detailed statements of each performance level serve to increase
the reliability of the scoring by enabling the examiner to make
c. Weaknesses of Structuralist Approach
decisions according to carefully drawn-up and well-established
1. It tends to be a complicated job for teachers to prepare
criteria.
questionnaires using this approach.
c. Weaknesses of Communicative Approach
2. This approach considers measuring non-integrated skills more
than integrated skills.
1. Unlike the structuralist approach, this approach does not
3. The Integrative Approach emphasize learning structural grammar, yet it may be difficult to
a. Characteristics and types of tests of integrative approach: achieve communicative competence without a considerable mastery
1. This approach involves the testing of language in context and is of the grammar of the language.
thus concerned primarily with meaning and the total communicative
effect of discourse. 2. It is possible for cultural bias to affect the reliability of the tests
being administered.
2. Integrative tests are concerned with a global view of proficiency.
Test Techniques
3. Integrative testing involves functional language but not the use of
functional language. What is a Test Technique?
It means of eliciting behavior from candidates which will tell us about
4. The use, of cloze test, dictation, oral interview, translation and their language abilities.
essay writing are included in many integrative tests. What is the purpose of a test?
• To assess how well students are achieving the stated goals.
• To provide the instructor with an opportunity to reinforce the
b. Strengths of Integrative Approach
stated objectives and highlight what is important for students to
1. The approach to meaning and the total communicative effect of
remember.
discourse will be very useful for students in testing.
• To provide a learning experience for students.
2. This approach can view students proficiency with a global view.
Test Techniques
• DIRECT vs. INDIRECT
3. A model cloze test used in this approach measures the reader’s
• DISCRETE POINT vs. INTEGRATIVE
ability to decode ‘interrupted’ or ‘mutilated’ messages by making the
• NORM-REFERENCED vs. CRITERION-REFERENCED
most acceptable substitutions from all the contextual clues available.
• OBJECTIVE vs. SUBJECTIVE
4. Dictation, another type using this approach, was regarded solely
as a means of measuring students’ skills of listening comprehension. Direct vs. Indirect Testing
c. Weaknesses of Integrative Approach DIRECT TESTING
Even if many think that measuring integrated skills is • Requires the candidate to perform precisely the skill that
better, sometimes there is a need to consider the importance of the test wishes to measure.
measuring skills based on students’ need, such as writing only, • Direct Test Items- is some sort of items or a question you
speaking only, etc. can use in assessment where the students will complete an
authentic action.
Examples: • Sometimes an integrative item is really more a
PROCEDURE than an item.
DIRECT
• Composition writing to know students’ writing skills. Norm-Referenced Vs. Criterion-Referenced Testing
• Objective: How well your students pronounce a language.
Norm-Referenced
• Direct testing method: get them to speak. • Scores are interpreted relative to each other in a normal
• For Speaking: Interiews and presentations distribution scheme.
• For writing: Essay questions • The idea is to spread the students out on a continuum of
• For reading: using reading material and responding to knowledge.
questions verbally and or in writing Criterion-Referenced
• For listening: Following oral directions to complete a task • Interpretation of scores is absolute and may be
representational.
Direct testing has a number of attractions: • Measures students’ ability against the predetermined
1. Relatively straightforward to create conditions. standards, e.g. the learning objectives of a specific course or
2. Assessment and interpretation are also quite unit of a course.
straightforward.
3. There is likely to be helpful backwash effect. Objective Vs. Subject testing
Objective
• No judgment is required– There is only one right answer.
INDIRECT TESTING
• Attempts to measure the abilities that underlie the skills in Subjective
which the test is interested. • If judgment is called for– free composition, looking for a series
Indirect Test Items- assess knowledge without authentic application. of factors (creativity, style, cohesion, grammar, and mechanics.)
EXAMPLES • If the scorer is not looking for any one right answer.
Grammatical knowledge contributes to writing ability, then a grammar
test may be used as an indirect test of writing. EXAMPLES
INDIRECT Objective Test Items- are those that require a specific answer.
• A writing test that requires students to identify grammatical • true/false, yes/no, matching test, fill in the blanks.
errors in sentences.
Close Items: Subjective Test Items- is evaluated by giving an opinion. It can be
Smiling, Tom said that he ____ the cake. Sarah smiled and compared with an objective test, which has right or wrong
gave him a _____. answers and so can be marked objectively.
Paraphrasing:
-”I’m sorry I did not go to the assembly”. • TEST CONSTRUCTION
-I wish _______________________. “The evaluation of pupils progress is a major aspect of the teacher’s
Sentence Re-Ordering job.” Evaluating Educational Outcomes (Oriondo & Antonio)
-Cat the store goes to
Some test are referred to Semi-Direct Test construction
Examples: • Writing items requires a decision about the nature of the item or
Tape recorded speaking skills. question to which we ask students to respond, that is, whether
discrete or integrative, how we will score the item;
Discrete Point vs. Integrative Testing • For ex. Objectively or subjectively, the skill we will purpose to
DISCRETE POINT test, and so on.
• Refers to the testing of one element at a time, item by item • A test item is a specific task test taker are asked to perform.
or series of items each testing a particular grammar Test Items can assess one or more points or objectives, and the
structure. actual item itself may take on a different constellation depending
on the text.
INTEGRATIVE TESTING • For ex. An item may test one point (understanding of a given
• Requires the candidate to combine many language vocabulary word) or several points ( the ability to obtain facts
elements in the completion of a task. from a passage and then make inferences based on the facts).
• Discrete point will always be indirect while • for example. There could be five items all testing one
grammatical point (e.g., tag questions). Items of a similar kind
• Integrative tests will tend to be direct. may also be grouped together to form subtests within a given
test.
EXAMPLES
DISCRETE POINT Classifying Items
• Form of a series of items testing a particular grammatical
structure. • Discrete – A completely discrete-point item would test simply
• Diagnostic test of grammar. one point or objectives such as testing for the meaning of a
• Ex. Choose the correct meaning of the word paralysis. word in isolation.
• a. Inability to survive • For Example:
• b. State of unconsciousness • Choose the correct meaning of the word paralysis.
• c. State of stuck a.) inability to move
b.) state of unconscious
INTEGRATIVE TESTING c.) state of shock
• Writing a composition, making notes while listening to a d.) being in pain
lecture.
• Demonstrate your comprehension of the following words • Integrative- An integrative item would test more than one point
by using them together in a written paragraph: ‘’paralysis”, or objective at a time.
“accident”, and “football”.
For Example:
• Demonstrate your comprehension of the following words by • What type of evaluation to be used? (Diagnostic, formative, or
using them together in a written paragraph: “paralysis,” summative)
“accident,” and “skiing.” • The criteria upon which student learning outcomes will be judge,
• Sometimes an integrate items is really more a procedure than and the most appropriate assessment techniques for gathering
an item, as in the case of a free composition. Which could test a information on a student progress.
number of objectives.
2. ASSESSMENT PHASE
• Select appropriate tools and techniques, then collect and collate
For example: information on student progress.
• Write a one page essay describing three sports and relative
likelihood of being injured while playing them competitively. 3. EVALUATION PHASE
• Interpret the assessment information and make judgments
Objective – A multiple-choice item about student progress.
• Provide information.
Ex. Is objective in that there is only one right answer. • Report progress.
• Encourage student.
Subjective – A free composition may be more subjective in nature if
the scorer is not looking for any one right answer, but rather for a 4. REFLECTION PHASE
series of factors (creativity, style, cohesion, and coherence, Teachers evaluate the utility, equity, and appropriateness of the
grammar, and mechanics). assessment techniques used.
THE SKILL TESTED AND THE INTELLECTUAL SKILLS NEEDED Assists teachers in making decision concerning improvements
• The Language skills that we test include the more receptive or adaptations to subsequent instruction and evaluation.
skills continuum- listening and reading, and the more productive
skills- speaking and writing. there are, other language skills that Stages of test construction
cross-cut these four such as vocabulary.
• Non-verbal Skills, may also be assessed such as gesturing, and TEST CONSTRUCTION
this can be both receptive (interpreting someone else’s It is the set of activities involved in developing and
gestures) and productive (making one’s own gestures). evaluating a test of some psychological function.
Directions: Write True if the statement is correct and False if not. What is Authentic Assessment?
_____1. Founded is the past tense of found. Is an evaluation process that involves multiple forms of
_____2. “Used to doing” and “used to do” mean approximately the performance measurement reflecting the students’ learning,
same thing. achievement, motivation, and attitudes on instructionally-
relevant activities (Daniel Callison).
Example of ranking test Is a subset of alternative evaluation processes.
• Direction: Write the numbers 1,2,3 and 4 according to the In a broader sense, assessment is any systematic approach for
correct order or sequence of events. collecting information on student learning and performance,
• __ Super kid flew down to the tree and tried to grab the cat, usually based on different sources of evidences.
but it scratched her. Examples of Authentic Assessment
• __ Super kid used the blanket to grab the cat and flew it Performance Assessment - consists of any form of assessment
down to the ground. Finally, the cat was safe. in which the student constructs a response orally or in writing.
• __ Super kid was flying around when she saw a cat that Portfolio Assessment – is a systematic collection of student
was stuck in a tree. She wanted to help the cat get down. work that is analyzed to show progress over time with regard to
• __ After the cat tried to scratch her, Super kid got a great instructional objectives.
idea. She went and got a blanket. Self-Assessment – offers opportunities for the student to self-
regulate learning, and the responsibility of appraising his/her
progress
• Example of hotspot test Integrated Assessment - refers to evaluation of multiple skills or
assessment of language and content within the same activity.
Alternative Assessment – involves approaches for finding out
what students know or can do other than through the use of
multiple-choice testing.
Construct validity
A test is said to have a construct validity if scores can be shown
to reflect a theory about the nature of a construct or its relation
to the other constructs.
Content validity
A test is said to have in content validity if the items or tasks of
which it is made up constitute a representative samples of items
or tasks for the area of knowledge or ability to be tested. These
are often related to a syllabus or a course.
Convergent validity
A test is said to have in convergent validity when there
is a high correlation between scores achieved in a different
test measuring the same construct (irrespective of
method). This can be considered an aspect of construct
validity
Criterion-related validity
A test is said to have in criterion-related if a relationship can be
demonstrated between test scores and some external criterion
which is believed to be a measure of the same ability.
Information of criterion relatedness is also used in determining
how well a test predicts future behavior.
Discriminant validity
There are no sources in the current [Link] of testing method. This can be considered an aspect of construct validity.
Techniques for monitoring student progress A. Unstructured- There is no fixed interview schedule, but rather a
o Make a video and audio recordings of a variety of formal and number of themes that are to be addressed.
informal oral language experiences, and then assess these B. Semi structured- The researcher follow a preset schedule but it is
possible to deviate from this when interesting issues arise.
according a pre-determined criterion which are based upon
C. Structured- This type of interview closely resembles a
student need and curriculum objective. questionnaire.
o Use checklists as concise methods of collecting information, and D. One on one- This kind of interview allows the researcher to zoom
rating scales or rubrics to assess student achievement. in on the views of individual respondents.
o Record anecdotal comments to provide useful data based upon E. Group- The advantage interviewing larger numbers at once is that
observation of students’ oral activities. group interactions might spark observations that would have gone
unnoticed.
o Interview student to determine what they believe they do well or
areas in which they need to improve. F. REFERENCING
Think about the following before the interviews:
Have student keep portfolios of their dated writing samples (a) Interviews should get the chance to practice their
and language abilities checklist and records. interview skills prior to the data collection.
Keep anecdotal records of students reading and writing (b) Ideally, the pilot settings resemble the actual condition
activities and experiences. as accurately as possible.
Have the students write in readers response journals. (c) During the interview, it’s useful to take note of the
Confer with students during the writing and reading interview situations.
processes and observe them during peer conference. (d) The success of an interview largely depends on the
C. Types of Qualitative Analysis interviewer- respondent interaction.
1. Conversation analysis CA (e) Interviews are time consuming.
The assumption is that every interaction is contextual and
has table and predictable nature. Various kinds of referencing:
[Link] analysis DA Norm-referencing is the placement of learners’ rank order,
DA is the analysis of “text and talk as social practices” and their assessment and ranking in relation to their peers.
is mainly concerned with power relation, gender inequalities etc. Criterion-referencing is a reaction against norm referencing
3. Test Language Analysis in which the learner is assessed purely in terms of his or her peers.
An analysis of test input or test taker responses for lexical The mastery criterion-referencing approach is one in which
richness, rhetoric, genre, discourse markers, grammatical complexity a single ‘minimum competence standard’ or ‘cut-off point’ is set to
etc. divide the achievement of the objective being recognized.
The continuum criterion-referencing approach is an
approach in which an individual ability referenced to a defined
continuum of all relevant degrees of ability in the area in question.
Testing Listening
1. Sound Discrimination
For beginners of language learning, it is important that they
able to distinguish the sounds of the language clearly to help
them understand the message they hear. In English for
example, there are several sounds that are phonetically similar
but phonemically different. Words are the same except for one
sound may cause confusion to the listener who cannot
discriminate these different sounds.
Example:
A teacher who instructs his/her student to bring a ‘mop’ the
next day may find a ‘map’ instead, if the sound [o] instead of a
[a] was heard by the student.
The same test types can be used for discriminating
consonant sounds like [p] and [f], [b] and [v], [s] and [z], 3. Statements & Dialogues
etc.
Examples: These items are designed to measure how well the students can
1. pill – fill understand short samples of speech and deal with a variety of
2. thought – taught signals on the lexical and grammatical levels of phonology.
3. best - vest
4. sip – sit
5. cool - call
Examples:
1. His cot is small. His cat is small.
2. She bought a fan. She bought a pan.
3. The child drew a lamp. The child drew a lamb. 4. Listening Comprehension
4. I placed the pin in my pocket. I placed the pen in my Test on learning comprehension also vary according to
pocket. learners proficiency. For the intermediate group, task may
5. The lady stood in front of the glass. The lady stood in front consist of responding to requests, answering questions,
of the class. following directions, repeating messages, etc. Common
test types for these tasks are question-answer, stimulus-
[Link] on Stress and Intonation response, and listen-repeat.
Most of the item type in this section are more appropriate for
elementary stages of learning English. They are preferable to
the discrimination items as they involve the testing of grammar
and lexis through phonology. Pictures, maps and diagrams can
be used effectively for testing such skills, thereby making the
testees’ performance less dependent on other skills.
Understanding talks and lectures
The ability to understand both informal talks and formal lectures
is an important skill for students studying subjects in the medium
of English at intermediate and advanced levels.
professional and technical literature in the foreign
language.
There are few comprehensive systematic programs which
have been constructed from a detailed analysis of the skills
required for efficient reading.
Much test material is still limited to short reading extracts
on which general comprehension questions are based. As
with listening comprehension, reading comprehension is
very closely related to the type of practice material used by
the teacher to develop reading skills.
What is urgently required in many classroom tests is
greater awareness of the actual processes involved in
reading and the production of appropriate exercise and test
materials to assist in the mastery of these processes.
Word Matching
Testees are required to draw a line under the word which is the same
as the word on the left.
Type II.
Sentence Matching
Testees are required recognize as quickly as possible sentences
which consist of the same words in the same order.
The items will concentrate on word and sentence comprehension, Type III.
using pictures to test this skill.
Pictures and Sentence Matching The following matching item shows how visuals can be used to test
the comprehension of definitions of certain words. Testees are
required to match the meaning of certain terms in a dictionary with
the appropriate shapes which those terms denote. The example
shows how the matching technique can be used at a more advanced
level and how it can lend itself to amore communicative testing of
reading. Above all, this particular item measures the ability of the
testee to understand the definitions usually found in the dictionary –
an essential skill required in learning and using foreign language.
Type I.
The true /false test is one of the most widely used tests of reading
comprehension. Not only the scoring is straightforward and quick, but
the scores are obtained by the testees can be very reliable indices of
reading comprehension provided that the items are well constructed
and that there are enough of them.
Disadvantages:
1. It can encourage guessing.
Type II. 2. The base score is 50 percent and thus the average test
The following item type is included to provide another example of difficulty generally in the region of the test may fail to
how reading comprehension matching tests can be based on a discriminate widely enough among the testees unless there
dictionary. Again, the item is intended for use at a fairly advanced are a lot of items.
level.
True or False Reading Tests
To penalize the testee for guessing and instructions on the lines of
the following may be included in the rubric:
Each correct answer will be awarded two marks. However, for each
wrong answer, one mark will be deducted from your score. It is better
therefore, not to guess blindly and to leave a blank if you do jot know
the correct answer. According to the passage, are the following
statements true or false, or is it impossible to draw any conclusion?
Completion Items
Cloze Procedure
A. TESTING SPEAKING
Testing the speaking and writing skills of a student is used to
evaluate listening and reading skills; spoken and written skills of
learners are directly and immediately manifested, thus a teacher
could tell off-hand if they are proficient or not in the language.
On the other hand, there are language specialists who believe that it
is more difficult to assess the student’s productive skills with
precision because of the simultaneous use of different language
abilities in speaking and writing; either of the four/five components
are recognized as part of the speaking/writing process.
1. TESTING PRONUNCIATION
• No fixed criteria for judging good or bad pronunciation.
Traditionally, correct pronunciation is measured by the accuracy
of imitating a native speakers’ way of pronouncing words (i.e. to
speak English correctly, one has to speak either the American
or British way).
• With the advent of the communicative approach in
teaching/learning language, the emphasis shifted from accuracy
to clarity/comprehensibility. As long as the pronunciation does
not impede the clarity of the message of the listener’s
Example activities of: understanding of it, the pronunciation is considered acceptable.
A. Skimming to identify the gist • Still, in testing specific sound production, one has to follow
Directions: Encircle the letter corresponding to the correct certain standards.
answer. • For beginners, the following types of tests for pronunciation are
The whole passage states that accounts about aliens on common: 1. recognition of similar sounds, 2. putting proper
earth are: stress in words and sentences, 3. loud reading, etc.
a. Entirely true
b. Absolutely false Example: Recognition of similar sounds/stress.
c. Awaiting confirmation
ACTIVITY
Direction: From a list of words in each group, read the word that has
B. Scanning to locate specific information the same vowel sound as the guide words.
Directions: answer the question Guide Words:
When and where did the Roswell incident happen? 1. beat steak great leave break
2. food mood blood foot stood
C. Making inferences 3. map mall play said fast
Direction: select the most probable meaning of the underlined 4. saw small plow wow tough
word based on the context within which it is used in the 5. let sit end leaf grid
sentence.
People have continued to report that they have seen UFOs Some difficulties in testing the speaking skills
and extraterrestrial creatures. • At all stages beyond the elementary levels of mimicry and
a. Horribly ugly repetition an ability to speak is an extremely difficult skill to
b. Out-of-this-world test.
c. Supernatural • Questions relating to the criteria for measuring the
d. Heavenly speaking skills and to the weighting given to such
components as correct pronunciation remain largely
unanswered.
• It is possible for people to produce practically all the correct
sounds but still be unable to communicate their ideas
appropriately and effectively. On the other hand, people
can make numerous errors in both phonology and syntax
and yet succeed in expressing themselves fairly clearly.
• Furthermore, success in communication often depends as
much on the listener as on the speaker: a particular listener
may have a better ability to decode the foreign speaker's
message or may share a common nexus of ideas with him required. (The testees read or hear the situation and them make
or her, thereby making communication easier. the appropriate responses, shown in the brackets.)
Conversational exchanges 1. You are trying to get to the public library but you are lost. Ask
a police officer the way.
• These drills are specially suitable for the language
2. Your friend has just returned from a holiday abroad. What do
• laboratory and can serve to focus attention on certain aspects of you say to him?
the spoken language, specially in those countries where English
is taught as a foreign language and the emphasis is primarily on 3. A waitress has just brought you the bill but has totalled it up
the reading skills. incorrectly. What do you say to her?
• However, several of the test items themselves are from 4. A friend of yours wants to see a film about a murder. You
communicative in any sense at all and do not allow for authentic have already arranged to see it another evening, but you know
interaction of any kind. she would be hurt if she knew. Make up an excuse.
• The item types range from items presenting the testees with • Type 3 The students hear a stimulus to which they must
situations in which they initite conversations to incomplete respond in any appropriate way. (his test often relies on
conversations with the part of one speaker omitted (i.e. a one- conventional greetings, apologies, acceptable ways of
sided dialogue). expressing polite disagreement, etc.)
• Tests containing such item types are on the whole reliable, but Do you mind if I use your pencil for a moment?
(Not at all/Certainly/Please Do/Go ahead/etc.)
they cannot be described as being valid tests of speaking.
What about a game of tennis?
(Yes I'd love a game/All right. I don't mind/etc.)
• Type 1 The testees are given a series of situations and are
1. Please don't go to a lot of trouble on my behalf.
required to construct sentences on the lines of a certain pattern 2. Oh dear, it's raining again. I hope it stops soon.
or group of patterns. Again, it is essential that two or three 3. We shan't be late. Shall we?
models be given to the testees so that they know exactly what is 4. Karen asked me to say she's sorry she can't come tonight.
• Type 4 This is similar to the previous type of item, but the stimuli • Advertisements, posters and strip cartoons may be used in
and responses form part of a longer dialogue and the situation is this way for class tests, provided that there are enough
thus developed. Because of its total predictability, however, this available to prevent the students from preparing one or two
type of item is sometimes referred to as a dialogue of a deaf! set pieces.
The man in the dialogue below continues regardless of what the
testees say. • Careful selection of the picture used for the examination
will help in controlling the basic vocabulary required and
You're on your way to the supermaret. A man comes up and may, to some extent determine the type of sentence
speaks to you. structure that predominates.
MAN: Excuse me, I wonder if you can help me at all. I'm looking • If the pictures depict a story or sequence of events, it is
for a chemist's. useful to give the testees one or two sentences as a starter
PAUSE FOR TESTEES REPLY thereby familiarizing them with the tense sequencing they
MAN: Thank you. Do you know what time it opens? should employ
PAUSE FOR TESTEE'S REPLY
MAN: Thanks a lot. Oh, er, by the way, is there a phone box Examiner: Last summer Lucy
near
here? spent a few days with her uncle and aunt in the country.
PAUSE FOR TESTEE'S REPLY When it was time for her to return home, her uncle and aunt
MAN:Oh dear, I'll need some coin. do you have any change for took her to the station. Lucy had made a lot of friends and
a she felt sad on leaving them. She got on the train and waved
• 5? goodbye to them... Now you continue to tell the story.
• PAUSE FOR TESTEE'S REPLY
• MAN: Well, thanks a lot. You've been most helpful.
• Finally, oral interviews do not simply happen The following is an example of an extremely simple role play suitable
spontaneously. for use at the elementary level.
Group discussion and role playing • It requires mastery not only of grammatical and rhetorical
devices but also of conceptual and judgmental elements.
• Through group discussion and role playing the teachers
can discover how students are thinking and using the The following analysis that attempts to group varied skills necessary
target language. for writing good prose into five general components
• Students are thus given an opportunity to use what can be • Language use – ability to write correct and appropriate
termed 'explanatory talk'. sentences
• In this latter type of activity, the members of the group are • Mechanical skills – ability to correctly those conventions
given a particular situation and instructed to make various peculiar to the written language
decisions.
• Stylistic skills - ability to manipulate sentences and A brief description of various grades of achievement expected to be
paragraphs, and use language effectively. attained by the class.
(a) write a letter , telling a friend about any interesting school • Example:
excursion on which you have been.
• Some of the reviewers were favorable to the new play.
(b) you have just been on a school to a nearby seaside town.
• a. and as many were unfavorable
3. Setting The Composition
• b. although others of the same amount were
• A good topic for a composition determines the register and
style to be used in the writing task by presenting the • unfavorable
students with a specific situation and context in which to
write. • *c. while an equal number were unfavorable
Register
• The use of the correct register denotes the ability to write a
specific purpose with a specific audience in mind.
• Confusion and embarrassment result from the use of
inappropriate register.
• Example:
• It has now been made out beyond any doubt what so ever that
the nicotine contained in tobacco smoke is poisonous. One
minute drop of pure nicotine plunged in to the blood stream of a
rat is sufficient to kind. It has also been proved that the nicotine
contained in tobacco smoke sends up the pulse rate and the
[Link] The Composition blood preassure. There is also strong evidence that the nicotine
content in fags is a primary cause of loose of weight and expansion (by modification, subordination, coordination,
hungriness. It is also likely that a few heavy etc.), word transformation in context (past to present
tenses, singular to plural nouns, nouns to pronouns, etc.).
Example:
EXAMPLE
• The following type (advance) register tests requires the student A. SENTENCE CONVERSION
to identify those words which are incongruous, replacing each Direction: Read the following statements aloud and change to
with a much more suitable word. questions answerable by yes/no.
1.) My friend Luisa, lives in Makati.
• It has now been made out beyond any doubt what so ever that
2.) She and her sisters are renting a condominium along
the nicotine contained in tobacco smoke is poisonous. One
Buendia.
minute drop of pure nicotine plunged in to the blood stream of a
3.) They used to go home every day to Bulacan after school or
rat is sufficient to kind. It has also been proved that the nicotine
work.
contained in tobacco smoke sends up the pulse rate and the
4.) After a few months, they realized it was impractical.
blood preassure. There is also strong evidence that the nicotine
5.) Now they have more time for study and leisure.
content in fags is a primary cause of loose of weight and
hungriness. It is also likely that a few heavy smokers will loose
Directions: Go back to sentence 1-5, change each into a
control of their finer muscles and be unable to play around with
question using the following question words:
objects with ease and precision. Such a loose of muscle activity
1.) Who
may widen the eyes and spoil vision. Moreover, smoking puts
2.) Where
back growth in adolescents and lowers athletic ability.
3.) What
• However, the most serious disease connected with smoking 4.) When
is cancer of the lungs: the direct connection between smoking 5.) How
and cancer has recently been established so assuredly that
cancer research folk and public health authorities throughout the B. ANSWERING QUESTIONS
world have begun intensive campaigns against smoking. In Directions: Answer the following question appropriately.
certain countries not only are cigarette advertisements banished 1.) What do you like most about your school?
from cinema and television screens but also makers and forced 2.) How will you describe your classmates in three words? Your
to print on each packet a warning concerning the dangers of teachers?
smoking. 3.) Why is college education important to you?
4.) What are your immediate goals after college?
• Controlled writing
Grammar tests are designed to measure student proficiency in
• There are several ways of controlling student's freedom of matters ranging from inflections (bottle-bottles, bake-baked) to
expression in their written work and as a consequence, syntax.
increasing the reliability of the scoring.
Syntax involves the relationship of words in a sentence,
• However useful such methods are as teaching devices, they will including matters such as word order, use of negative, question
only prove useful for testing purposes if each student is forms, and connectives.
completely familiar with the particular task performed: hence the
important of clear instructions followed by one example. 3. TESTING WORD CHOICE
• Type 1 The students are given a short reading extract and then One’s choice of words reflects his/her level of proficiency in the
required to write a similar paragraph using the notes they have language. The richer the vocabulary is, the more proficient the
given e.g learner is expected to be. In oral communication, vocabulary
proficiency may be tested through word substitutions, supplying
appropriate words in the missing parts of sentences, selecting
from a given list of words the most appropriate for the sentence,
etc.
• Grammar tests involve correct usage of verb and tenses, Speed alone will not constitute fluency, because some speakers tend
nouns and pronouns, adjective, and adverb forms, etc. in to speak fast, but they stammer or falter several times in their talk
oral communication, grammatical usage may be tested which hinders the smooth flow of communication.
through sentence conversion (from negative to affirmative,
statement to question, etc.), answering questions,
5. TESTING COMPREHENSION When there is not such listing it becomes necessary to
infer from textbooks and other teaching materials what
Comprehension is an easy part of communication. One’s oral structures are being taught.
communication ability depends on his/her ability to understand
the message received to which he/she needs to respond to. Specifications for a placement test will normally include all
of the structures identified in this way.
It can be tested focusing on the speaker’s ability to reply quickly
to a remark or a question clearly and completely. Sampling
This will reflect an attempt to give the test content validity
All these component part of speech can be rated simultaneously by selecting widely from the structures specified
in an interview using an evaluation sheet. The teacher devise It should take account of what are regarded for one reason
this sheet his/herself. It must reflect the five component being or another as the most important structures
It should not deliberately concentrate on the structures that
rated according to assigned numerical values.
happen to be easiest to test.
It is advisable not to fill in the score sheet during the interview as
it might affect the performance of the speaker. It is better to
Writing items
record the scored interview and rate the speaker using the score
sheet afterwards.
Whatever techniques are chosen for testing grammar, it is important
___ 5. Speaks very fast; message is very clear. for the text of the item to be written in grammatically correct and
___ 4. Speaks fast; message is fast. natural language.
___ 3. Speaks moderately fast; part of the message is unclear.
___ 2. Speaks slow; message is difficult to understand. Techniques for testing grammar
___ 1. Falters many times in speaking; message cannot be
understood. Common Types of Objective Items Used to Test Awareness of the
Grammatical Features of the Language
The diagnostic tests of grammar are very useful for the Item type 2 and 3 are preferable to 1 because the options do not
individual and the group. interrupt the flow of meaning in the sentence.
Unfortunately, type 1 confuses the reader because of the long
Writing Specifications parenthesis.
Item type 4 shows the correct( or incorrect ) form as part of the
For achievement tests where teaching objectives or the sentence.
syllabus list the grammatical structures to be taught,
Another item type, but it is not recommended since it
specification of content should be quite straightforward. requires the testees to spend time on unnecessary reading.
Type 5: A. Tom ought not to tell me your secret, but he did. When we arrived, the bank clerk……………………
B. Tom ought not to having told me your secret, but he did. Completion
C. Tom ought not to be telling me your secret, but he did. This Technique can be used to test a variety of structures.
D. Tom ought not to have told me your secret, but he did. Simple completion items used for testing grammar consists
of a sentence from which a grammatical element is
The following method is useful for testing short answers and removed
responses: He went……….school
Type 6: Tom ought not to have told anyone the secret. Steps in preparing simple-completion grammar tests.
A. ‘So ought you’ 1. Select grammar points that need to be tested
B. ‘Nether ought you’ 2. Provide an appropriate context
C. ‘Nether you oughtn’t.’ 3. Write good instructions
D. ‘So oughtn’t you.’ Three Basic Kinds of Simple Completion Grammar tests
Type 7: ‘Tom ought not to have told me’ The Inflection Form
A. Tom did not tell me, but he should. Testing the mastery of inflections provides for a productive
B. Perhaps Tom may not tell me. response. These vary from simple comparatives to verb
C. Tom told me but it was wrong of him. tense questions
D. It was necessary for Tom not to tell me. 1. He is the …………….(tall) person in class
2. He…………… (sleep) for nearly an hour
The free response form
It may be argued that an understanding of syntactic patterning is just
It illustrates how common terminology can be occasionally.
as necessary for the following item:
Example:
‘………. Was Robert late last week?’
add a question to tag these sentence:
‘Three times’.
Hamlet was indecisive, ………………………..?
A. How much C. How
Polonius knew a lot of aphorisms,………………………?
often
Write in the missing part of the two word verb.
B. How many D. How
long 1. What time did he get ……… this morning?
Items which appear in a test of grammar and structure Example of a free response with a minimum amount of contextual
should be made to sound as natural as possible. The following control:
mechanical test item: 1. You would get better sooner if………………
This book belongs to Peter, but that is…….....
A. Peter’s book C. The book of Peter Multiple Choice
B. the book to Peter D. The book of Peter’s In this test an incomplete sentence is followed by four
options for completing the sentence.
Can be written as follows: While she………………the house, her children were playing
This book belong to Peter, but that is …………. outside
A. Mary’s book C. The book of Mary A. has been cleaning
B. The book to Mary D. The book of Mary’s B. Cleaned
Note that the distractors should generally be correct both in
C. has cleaned
writing and in speech. The following item provided unsuccessful
when it was included in a test because many of the more able D. was cleaning
students selected option D. The reason being that they pronounce
used to quite correctly as use to. Cloze procedure
Cloze test are prose passages, usually a paragraph or
more in length, from which words have been deleted.
Gap Filling The students rely on the context in order to supply the
Ideally gap filling items should have just one correct response. missing words
For example: What was most disturbing………. that for the first It is easy to prepare and easy to score
time in his life Ali was on his way. (was) The cloze is simply a story or essay from which a number
He arrived late, ……… was a surprise. (which) of words have been deleted.
The council must do something to improve transport in the city. Sentence-completion vocabulary and grammar items are
………. they will lose the next election. (Otherwise) similar in a way to cloze tests
• laboratory and can serve to focus attention on certain aspects of 4. A friend of yours wants to see a film about a murder. You
the spoken language, specially in those countries where English have already arranged to see it another evening, but you know
is taught as a foreign language and the emphasis is primarily on she would be hurt if she knew. Make up an excuse.
the reading skills.
• Type 3 The students hear a stimulus to which they must
• However, several of the test items themselves are from respond in any appropriate way. (his test often relies on
communicative in any sense at all and do not allow for authentic conventional greetings, apologies, acceptable ways of
interaction of any kind. expressing polite disagreement, etc.)
• The item types range from items presenting the testees with Do you mind if I use your pencil for a moment?
situations in which they initite conversations to incomplete (Not at all/Certainly/Please Do/Go ahead/etc.)
conversations with the part of one speaker omitted (i.e. a one- What about a game of tennis?
sided dialogue). (Yes I'd love a game/All right. I don't mind/etc.)
1. Please don't go to a lot of trouble on my behalf.
• Tests containing such item types are on the whole reliable, but 2. Oh dear, it's raining again. I hope it stops soon.
they cannot be described as being valid tests of speaking. 3. We shan't be late. Shall we?
4. Karen asked me to say she's sorry she can't come tonight.
• Type 1 The testees are given a series of situations and are
• Type 4 This is similar to the previous type of item, but the stimuli
required to construct sentences on the lines of a certain pattern
and responses form part of a longer dialogue and the situation is
or group of patterns. Again, it is essential that two or three thus developed. Because of its total predictability, however, this
models be given to the testees so that they know exactly what is type of item is sometimes referred to as a dialogue of a deaf!
required. (The testees read or hear the situation and them make The man in the dialogue below continues regardless of what the
the appropriate responses, shown in the brackets.) testees say.
spent a few days with her uncle and aunt in the country.
When it was time for her to return home, her uncle and aunt
took her to the station. Lucy had made a lot of friends and
she felt sad on leaving them. She got on the train and waved
goodbye to them... Now you continue to tell the story.
• Others, however, argue that the examination nevertheless Some other techniques for oral examining
is artificial and unrealistic: students are placed not in
natural, real-life speech situations but in examination The short talk
situations.
• In certain examinations students are required to prepare a
short talk on a given topic. They may be allowed several
days or only a few minutes in which to prepare the talk and
in some cases , they may be provided with notes or Testing the Writing Skills
reference material.
3. The Writing Skills
• Vague subjects are best avoided; many topics are best
presented as questions: • The writing skills are complex and sometimes difficult to
• Should country spend huge sums of many on space teach
• exploration?
• It requires mastery not only of grammatical and rhetorical
• Do demonstrations serve any useful purpose?
devices but also of conceptual and judgmental elements.
• Do people ever really learn anything from the mistakes
• they make? The following analysis that attempts to group varied skills necessary
for writing good prose into five general components
Group discussion and role playing • Language use – ability to write correct and appropriate
sentences
• Through group discussion and role playing the teachers
can discover how students are thinking and using the • Mechanical skills – ability to correctly those conventions
target language. peculiar to the written language
• Students are thus given an opportunity to use what can be • Stylistic skills - ability to manipulate sentences and
termed 'explanatory talk'. paragraphs, and use language effectively.
• In this latter type of activity, the members of the group are • Judgement skills- ability to select, organize and order
given a particular situation and instructed to make various relevant information
decisions.
2. Testing Composition Writing
• Radio activity from a nuclear power station accident will
reach your area in a few hours: There is a small but very • Composition writing / free composition.
safe nuclear fallout shelter nearby, but there is room for
only six people out of a total of twelve. Which six people • Can be a useful testing tool.
from the following list do you think it would be most useful
• Provides students an opportunity to demonstrate the ability
to save in the interest of future generations? List them in
to organize language material using their own words and
order of priority.
ideas to communicate.
• (Note: M= Male F= Female)
The following are types of writing
• It has now been made out beyond any doubt what so ever that
the nicotine contained in tobacco smoke is poisonous. One
minute drop of pure nicotine plunged in to the blood stream of a
rat is sufficient to kind. It has also been proved that the nicotine
contained in tobacco smoke sends up the pulse rate and the
blood preassure. There is also strong evidence that the nicotine
content in fags is a primary cause of loose of weight and
hungriness. It is also likely that a few heavy
Example:
Register
• The use of the correct register denotes the ability to write a
specific purpose with a specific audience in mind.
• Confusion and embarrassment result from the use of
inappropriate register.
[Link] on Stress and Intonation d. Choir practice
e. Basketball practice
Features of stress, intonations, rhythm and juncture are f. Swimming practice
generally considered more important in oral communication
skills than the ability to discriminate between phonemes, These may be made even simpler for beginners. For the
tests of stress and intonation are on the whole less advanced level, tasks may consist of drawing inferences,
satisfactory than the phonemes discrimination tests. listening for gist or intent, listening for specific information,
following a story line or sequence of events, etc.
Most of the item type in this section are more appropriate for
elementary stages of learning English. They are preferable to
the discrimination items as they involve the testing of grammar
and lexis through phonology. Pictures, maps and diagrams can
be used effectively for testing such skills, thereby making the
testees’ performance less dependent on other skills.
These items are designed to measure how well the students can
understand short samples of speech and deal with a variety of
signals on the lexical and grammatical levels of phonology.
4. Listening Comprehension
Test on learning comprehension also vary according to
learner’s proficiency. For the intermediate group, task may
consist of responding to requests, answering questions,
following directions, repeating messages, etc. Common
test types for these tasks are question-answer, stimulus-
response, and listen-repeat.
Examples:
Directions: Do the action. (stimulus response)
Erase the writings on the board and place the eraser on the
table.
II. Direction: Answer the following questions.
(question-answers)
Initial stages of reading: Matching Tests The items will concentrate on word and sentence comprehension,
They test student’s ability to discriminate visually between words using pictures to test this skill.
which are spell in fairly similar ways. These test items will assist
in developing word recognition speed. Though not administered
as speed tests in the strict sense in the very early stage. Once
the students have gained familiarity and confidence in this type
of test, their performances should be timed so that they are
forced to read under some pressure. At first it is advisable to
confine the words used in the items to those who encountered
orally; later a number of words not encountered orally should be
introduced.
Word Matching
Testees are required to draw a line under the word which is the same
as the word on the left.
Type II.
Type III. The following item type is included to provide another example of
how reading comprehension matching tests can be based on a
dictionary. Again, the item is intended for use at a fairly advanced
level.
The following matching item shows how visuals can be used to test
the comprehension of definitions of certain words. Testees are
required to match the meaning of certain terms in a dictionary with
the appropriate shapes which those terms denote. The example
shows how the matching technique can be used at a more advanced
level and how it can lend itself to amore communicative testing of
reading. Above all, this particular item measures the ability of the
testee to understand the definitions usually found in the dictionary –
an essential skill required in learning and using foreign language.
Type I.
The true /false test is one of the most widely used tests of reading
comprehension. Not only the scoring is straightforward and quick, but
the scores are obtained by the testees can be very reliable indices of
reading comprehension provided that the items are well constructed
and that there are enough of them.
Disadvantages:
3. It can encourage guessing.
4. The base score is 50 percent and thus the average test
difficulty generally in the region of the test may fail to
discriminate widely enough among the testees unless there
are a lot of items.
Each correct answer will be awarded two marks. However, for each
wrong answer, one mark will be deducted from your score. It is better
therefore, not to guess blindly and to leave a blank if you do jot know
the correct answer. According to the passage, are the following
statements true or false, or is it impossible to draw any conclusion?
Type II.
True or False Reading Tests effectiveness of each of the items used. The sampling of the reading
passage is the utmost importance and must be related to the broader
aims of the language teaching situation.
Completion Items
What is Phil-IRI?
Cloze Procedure