Francis Et Al. (2020)
Francis Et Al. (2020)
Testing Effect
Andrea P. Francis
Albion College, United States of America
Mareike B. Wieth
Albion College, United States of America
Kevin L. Zabel
University of Wisconsin La Crosse, United States of America
Thomas H. Carr
Michigan State University, United States of America
Abstract
This quasi-experimental study investigated the role of prior psychology knowledge and in-class
retrieval activity in the testing effect. Undergraduate introductory psychology students (N ¼ 53)
from two classes at a small liberal arts college practiced retrieving information in class with
multiple-choice quizzing and concept mapping. Prior psychology knowledge was measured
using a 25-item multiple-choice pretest. Both students with high and low prior psychology
knowledge had higher scores on examination material that was practiced in class with
retrieval-based concept mapping compared to traditional multiple-choice quizzes and to no
organized in-class retrieval activity at all. Only students with high prior psychology knowledge
had higher scores on quizzed material compared to no organized in-class retrieval practice, and
these scores were lower than those on material that was practiced with in-class concept map-
ping. In comparison to administering multiple-choice quiz questions, a more useful in-class activity
might be to have students, especially those with less prior psychology knowledge, practice
retrieving material through free recall and connection building activities such as a concept map.
Corresponding author:
Andrea P. Francis, Albion College 611 E Porter Street, Albion, MI, Michigan 49224-1831, United States of America.
Email: [email protected]
2 Psychology Learning & Teaching 0(0)
Keywords
Concept maps, prior knowledge, testing effect
Introduction
Introductory psychology instructors must grapple with what in-class instructional activities
will benefit their students most. There is a growing body of evidence in cognitive and edu-
cational psychology showing that retrieving information from memory can be used as an
instructional tool instead of merely being a way to assess and document what has been
learned (e.g., Carpenter, Pashler, & Vul, 2006; McDaniel, Roediger, & McDermott, 2007;
Schwieren, Barenberg, & Dutke, 2017). However, it is unclear how this “testing effect” is
influenced by students’ prior knowledge about the concepts being learned. The present study
investigates the moderating role of prior knowledge in the testing effect, expands on the
application of retrieval-based practice learning in an introductory psychology classroom
setting, and demonstrates the effectiveness of a concept map retrieval practice tool that
can be used in the classroom.
that came to mind for each word in the list next to the word itself. Participants in the control
group did math problems for 1 minute after viewing a list of words. Recall was greatest for
the retrieval practice group, which suggested that elaboration or semantic cue creation did
not benefit participants as much as free recall and was taken as support for the episodic
context account. The authors suggested that during retrieval practice items become associ-
ated with contextual features associated with different temporal contexts. After repeated
retrieval, the items may become easier to recall because they are no longer to be found in
only a single episodic memory or context. However, in Lehman et al.’s (2014) experiment,
students did not practice elaborative retrieval, only elaboration with the words present.
Further, participants were asked to learn word lists, which is unlike the more complex
material encountered in the classroom. Therefore, it is important to consider how the testing
effect has been applied and studied in real world instructional settings.
elaborative retrieval hypothesis reviewed above, be more effective for related but not iden-
tical test items: concept mapping.
Retrieval Tools for the Testing Effect: Multiple-Choice Quizzes and Concept Maps
Although many studies have used multiple-choice questions as retrieval tools for practice
testing, there is evidence that recall-based activities, such as in the short-answer questions
used by McDaniel et al. (2007), can lead to greater benefits. One recall-based retrieval tool
that may be particularly advantageous for encouraging elaboration and the generalization
of material is concept mapping. A concept map is a graphic organization that overtly
represents not only a person’s knowledge but also the connections among the concepts of
a particular subject matter. Previous research has shown that a useful, perhaps necessary
component in using a concept map as an effective retrieval-based learning activity is free
recall. Karpicke and Blunt (2011) found no benefit over and above mere repeated study for
participants who were asked to create a concept map using their notes from the studied
material. However, in a follow-up investigation, Blunt and Karpicke (2014) found that when
free recall was added to concept mapping (material such as notes was not present while
students created the concept map), then the recall-based concept map enhanced memory as
much as recalling information in the form of a paragraph. Thus, recall is an important
component in effective retrieval-based learning activities, perhaps because of the effort it
requires.
Over and above effort, the elaborative retrieval hypothesis (Carpenter, 2009; 2011; Pyc &
Rawson, 2010) suggests that creating meaningful connections during retrieval practice
enhances retention of material. Understanding similarities and differences, or the connec-
tions, among concepts is a form of relational processing. Encouraging relational processing
rather than item-specific processing has been shown to enhance testing effects in laboratory
learning paradigms (Mulligan & Peterson, 2015; Peterson & Mulligan, 2013). Concept map-
ping encourages explicit relational processing by requiring students to make connections
between concepts. It is possible that relational connections are created and used during
multiple-choice quizzing, but given the constraints of the retrieval tool, such connections
would be created incidentally and covertly, rather than intentionally and overtly.
unpracticed case study 15 minutes or 24 hours later. Results showed there was no interac-
tion between expertise level and whether the questions had been practiced. Thus, prior
knowledge did not seem to influence benefits of retrieval practice using short-answer ques-
tions. Xiaofeng, Xiao-e, Yanru, and AiBao (2016) also found no difference in the benefits of
retrieval practice for high and low prior knowledge learners. Using the same paradigm as
Lehman and colleagues (2014), Xiaofeng and colleagues (2016) had psychology majors and
non-majors study a list of psychology-related words and subsequently either try to recall the
list or generate free associates to each of the list’s words. As with Lehman and colleagues
(2014), subsequent recall was greatest for the retrieval practice condition. Interestingly, for
the retrieval practice group there was no difference between performance among the majors
and non-majors. However, psychology majors did recall more than the non-majors in both
the elaboration and control conditions, suggesting that free-association-style elaboration
without retrieval practice may benefit those with greater prior knowledge more than
those with less prior knowledge.
Similarly, in exams asking novel, newly encountered questions, an elaboration-based
technique involving self-explanation of causal connections, called “elaborative interrog-
ation,” seems to be highly effective. However, the technique depends on prior knowledge.
Specifically, this study showed that the more students already knew about a topic domain
from their life experience, the more elaborative interrogation helped them add to that
knowledge (Woloshyn, Pressley, & Schneider, 1992). In addition to using experience with
a topic, prior knowledge of a discipline can also be assessed using a pretest. For example,
Thompson and Zamboanga (2004) found that psychology knowledge from a 25-item, five-
alternative-multiple-choice pretest predicted class achievement. The pretest included ques-
tions from a variety of topics covered in introductory psychology courses. Thompson and
Zamboanga (2004) found that although pretest scores did predict subsequent achievement
in the course, prior coursework in psychology was unrelated to course achievement. Thus,
experience was not enough to adequately measure prior knowledge.
Cogliano, Kardash, and Bernacki (2019) applied the pretest approach in a study of
retrieval practice. Specifically, these authors measured prior knowledge of topics by
giving students a multiple-choice pretest comprising five items from each of five chapters.
The items on the pretest were then used as retrieval practice items. Cogliano et al. (2019)
compared exam performance on multiple-choice items that were identical to those that had
appeared on both a multi-topic pretest and a subsequent in-class practice test (practiced
items) to performance on items never seen before that tapped into the same concepts (indic-
ative of transfer). Average exam scores for topics in which students had low prior knowledge
were significantly higher for the practice-tested items than the new related items. For high
prior-knowledge topics, the difference between practice-tested and new related items was
negligible, suggesting that transfer of learning from practiced items to new related items only
occurred with high prior knowledge topics.
Importantly, for Xiafeng et al. (2016), Carroll et al. (2007), and Cogliano et al. (2019), the
target material to be learned via retrieval practice was all identical to the material being
practiced, whether in attempted retrieval via free recall, as individual stimuli for elaboration
via free association, or as items in multiple-choice quizzes. As noted before, instructors often
care more about performance on non-identical, newly encountered exam material than on
material that has already been encountered before in an identical form. Therefore, a ques-
tion arises as to whether general prior psychology knowledge moderates the testing effect
when items on the exam are only conceptually related to practiced material (rather than
6 Psychology Learning & Teaching 0(0)
identical) and elaboration-based retrieval tools (such as concept maps) are used instead of
multiple-choice testing.
Concept mapping produces a concrete, visual representation of the connections among
the facts and ideas being learned, rather than leaving those connections implicit in a recall
protocol, or haphazardly represented or even absent in a collection of multiple-choice or
short-answer quiz questions, or individually connected to whatever might come to mind as
in Lehman et al.’s (2014) free-association technique. When the learner has more prior
knowledge of the subject matter (and hence is able to create more relationships and medi-
ators that connect and integrate concepts), both implicit and explicit connection building
ought to be useful. However, when a student has less prior knowledge, a retrieval tool that
explicitly directs the learner to create mediators and write down concepts and the connec-
tions among them might be expected to increase subsequent performance on novel test items
more than a retrieval tool that directs the learner to memorize facts, attend to fewer con-
nections, and hold those connections in working memory as in multiple-choice quizzing.
Thus, it is important to consider whether intentional elaborative retrieval practice would
benefit students above and beyond multiple-choice quizzing, and whether the explicit pro-
duction of connections required by concept mapping would increase retention for low
knowledge learners as well as high knowledge learners. Both the elaborative retrieval
hypothesis and the episodic context account would suggest that students will benefit from
retrieval practice. The elaborative retrieval hypothesis further suggests all students will
benefit more from intentional semantic cue creation (concept mapping) during retrieval
than incidental semantic cue creation (such as might occur in multiple-choice testing),
and that higher (relative to less) prior knowledge will allow students to benefit from both
intentional and incidental cue creation.
Method
Participants
Participants were 53 of the 61 students enrolled in two introductory psychology classes at a
small liberal arts college in the Midwest (Mage ¼ 18.136, SDage ¼ 0.525; female n ¼ 19, male
n ¼ 32, not reported ¼ 2; 46 first years, seven sophomores). Four students did not sign the
consent form and four others did not complete the course requirements. The two classes
were taught by the same instructor, used the same materials (e.g., lectures, retrieval activ-
ities, exams, and Myers’ Psychology (2013) textbook), and followed the same class format
(two sessions a week for 110 minutes each session). However, because class dynamics can be
quite different between otherwise identical classes, class was included as a factor in the
research design and analyses.
Design
A 2 (class: class 1, class 2) 2 (prior psychology knowledge: low, high) 3 (type of retrieval
practice: multiple-choice, concept mapping, no retrieval) mixed-measures design was used to
examine the role of class dynamic, prior psychology knowledge, and retrieval tool on exam
performance. Similar to Thompson and Zamboanga (2004), prior knowledge was assessed
with a 25-question multiple-choice pretest taken by all students. Scores ranged from 5 to 19.
A median split was used to identify a low knowledge group (24 students who got 10 or fewer
Francis et al. 7
correct) and a high knowledge group (27 students who got 11 or more correct). Four topics
were assigned to be practiced with concept maps by all students in both classes, four more
were assigned to be practiced with quizzes by all students, and two chapters did not receive
any practice. This was done in lieu of counterbalancing assignment of chapters to type of
retrieval practice across classes to prevent contamination of the manipulation by across-
class communication. All students received the same exam material.
Materials
Prior Knowledge Assessment. The pretest was comprised of 25 four-alternative multiple-choice
questions, with three on research methods, four on biological psychology, one on develop-
mental psychology, two on sensation and perception, two on learning, four on memory, four
on thinking and language, two on personality and social psychology, and three questions
about abnormal psychology. Questions on the pretest, the multiple-choice quizzes, and the
four criterion exams described below were either directly taken or modified from questions
in Brink’s (2013) Test Bank Volumes 1 and 2 for the corresponding chapter in the Myers
(2013) textbook.
Retrieval Practice Manipulation. A quiz, a concept map, or no retrieval activity was the first task
each class worked on during class and covered content from the previous class session. The
instructor informed students in advance of the specific topic to be mapped or quizzed in the
next class. To make sure students were using retrieval processes associated with the testing
effect, no outside materials could be used while creating the concept map or taking the quiz.
Quizzes were comprised of six, four-alternative multiple-choice questions, and covered
the topics of research design and analysis, vision (sensation and perception), biases and
heuristics, and personality theories. Students practiced four other topics with concept
maps: Piaget’s Theory of Development, The Modal Model of Memory, Psychological
Disorders, and Forms of Conditioning. Material from the biopsychology and social psy-
chology chapters was not practiced with a specific tool in class. The pretest tapped material
relevant to the three types of retrieval practice at approximately equal rates. Specifically, 10
pretest questions pertained to chapters containing the four topics that were quizzed, 10
pretest questions pertained to chapters containing the four topics that were concept
mapped, and five pretest questions pertained to chapters containing the two topics that
were not retrieval practiced.
In the class prior to completing their first recall-based concept map, the instructor com-
pleted a concept map with the students. Students were given a modified version of Ca~ nas
and Novak’s (2009) “Constructing your First Concept Map” instructions (www.ihmc.us).
The instructor and the students then created a concept map on the front chalkboard about
descriptive methods in psychology. In both classes, this map covered case studies, surveys,
and naturalistic observations as descriptive methods. This concept map was done together
with notes as a guide for students and was not considered a retrieval-practice activity in the
data analyses of this study.
For each concept map, students were told the number of first-level concepts that would
be required and their titles. For example, for the concept map on Piaget’s Theory of
Development, students were given the names of the four stages of development. Students
were told the concepts inside the circles could be facts, examples of the concept, or appli-
cations in their own life. The teacher emphasized the importance of elaborating on the
8 Psychology Learning & Teaching 0(0)
concepts and that each link could be used later to help the student remember the informa-
tion in a different context. Such elaboration was not emphasized during quiz completion.
The instructor monitored student progress during concept map and quiz completion and
asked students to turn over their map or quiz when they were finished. No time limit was
imposed for either activity and the time on task was not measured. The instructor went over
the quizzes and concept maps immediately after collecting them. Previous research has
shown that items that are incorrectly retrieved may reduce retention when assessments
are not accompanied by feedback (Pashler, Cepeda, Wixted, & Rohrer, 2005), and explicit
expert guidance can act as a way to direct attention to relevant material during problem
solving (Salden, Koedinger, Renkl, Aleven, and McLaren, 2010). Therefore, a discussion of
why certain answers were correct or incorrect occurred in the review of both concept maps
and quizzes. The graded quizzes and concept maps were returned within a week so students
could use them when preparing for their exams. Although it is possible students took more
time to construct concept maps than to take quizzes, the instructor noted the amount of
time taken to review and provide feedback on the retrieved material was approximately the
same (between 5 and 10 minutes).
In an attempt to ensure equal levels of motivation to study for the two different retrieval
activities, the instructor formally graded all quizzes and concept maps. Students were also
told a minimum number of correct connections that would be needed to receive full credit
on their concept maps. For example, to receive all six points for the concept map on Piaget’s
Theory of Development, students needed to have at least 12 correct connections or links
(excluding the four primary links). Students lost points for missing or incorrect connections.
An example of an incorrect connection would be a student who wrote that a child
gains abstract thought processes in the concrete operations stage instead of the formal
operations stage.
Criterion Assessment. Over the semester, four non-cumulative exams consisting of 25 four-
alternative multiple-choice questions and either five or six short-answer questions were
given. Because of the possible subjective nature of grading short-answer questions, only
multiple-choice items were used in the analysis.
Multiple-choice exam items consisted of questions directly related to material practiced
with quizzes and concept maps, which was from the same chapter as the topic covered on
the quiz or concept map but was not directly related to the topics, and questions not related
(from different chapters) to material retrieved in class. None of the exam questions had been
presented before in the courses. To avoid potential confusion about how to categorize items
that were in the same chapter (e.g., development) but not practiced during retrieval (e.g.,
Piaget’s theory was practiced in a concept map whereas teratogens were not practiced in
class), we chose not to include same-chapter but unpracticed items in the analysis and
instead compared questions directly related to practiced concepts with questions on topics
from different chapters. Table 1 includes an example quiz item and a corresponding exam
item as well as a concept map with a corresponding exam item. Again, none of the exam
items were the same as any question that had been seen before in the course.
After removing items from the same chapters as the retrieved material, but not directly
related to the topics covered during retrieval practice, there were 18 multiple-choice exam
items that were directly related to concept-mapped material, 28 items that were directly
related to quizzed material, and 16 items that were not retrieval practiced. Table 2 contains
descriptive statistics.
Francis et al. 9
Myra has such low self-esteem that she is often Because she already believes boys are naugh-
on the lookout for critical moments about her tier than girls, Mrs. Zumpano, a second-
appearance and personality. Myra’s behavior grade teacher watches boys more closely
best illustrates the dangers of: than she watches girls from any signs of
a. confirmation bias misbehavior. Mrs. Zumpano’s surveillance
b. trial and error strategy best illustrates:
c. prototypes a. conjunction fallacy
d. base-rate fallacy b. confirmation bias
c. sunk cost effect
d. hindsight bias
Example student produced concept map Corresponding example item from exam
Procedure
On the second class meeting of the semester, the professor described the study and asked if
students were willing to let their grade information be used as part of the study.
Then students completed the 25-question pretest. The first quiz was given to both classes
during week 2. A practice concept map was completed as a class at the beginning of the third
week of class and the first retrieval-practice concept map was created in class later that week.
The first exam took place at the beginning of week 4. Students subsequently completed the
10 Psychology Learning & Teaching 0(0)
Results
A regression analysis found that overall pretest scores significantly predicted criterion
assessment scores, b ¼ 1.214, t(49)¼ 4.874, p < .001. Pretest scores also explained a signifi-
cant proportion of variance in criterion assessment scores, F(1,49) ¼ 23.758, p < .001,
R2 ¼ .327.
Because the number of criterion items related to each type of retrieval practiced material
differed, proportions of items answered correctly were used for further analysis. A mixed
design analysis of variance was conducted to examine the role of class dynamic (class 1 and
class 2), prior knowledge (high and low), and retrieval practice (concept maps, multiple-
choice quizzes, and no retrieval practice) on exam performance. Retrieval practice was
entered as a within-subjects variable whereas both class and prior knowledge were entered
as between-subjects variables.
Figure 1. The effect of prior knowledge and retrieval tool on exam performance. For students with low
prior psychology knowledge, only concept mapping increased exam performance. For students with high
prior psychology knowledge, both concept mapping and multiple-choice quizzing increased exam perfor-
mance. Bars represent standard error.
material, t(26) ¼ 2.144, p ¼ .042, d ¼ .414, or the non-retrieved material, t(26) ¼ 3.177,
p ¼ .004, d ¼.623, and performance on the quizzed material was greater than performance
on the non-retrieved material, t(26) ¼ 2.354, p ¼ .026, d ¼ .437. The three-way interaction
between retrieval practice, prior knowledge, and class was not significant, F(2, 94) ¼ .604,
p ¼ .549.
Discussion
The current study focused on the role of prior knowledge and retrieval practice tool (mul-
tiple-choice quizzing, concept mapping, or no retrieval practice) in subsequent exam per-
formance across two undergraduate psychology courses. Immediate feedback on quiz or
concept map accuracy was provided as part of retrieval practice, and exam items were
closely related to (but not exactly the same as) the content targeted by the retrieval tools
(e.g., different questions about the same concepts, as illustrated in Table 1). We found no
evidence of any differences between classes in testing effect outcomes. The remaining dis-
cussion will therefore focus on the key components of prior knowledge and retrieval
practice tool.
Overall, data suggest that as pretest scores increased, so did scores on the criterion exams.
This supports Thompson and Zamboanga’s (2004) findings that a 25-item multiple-choice
pretest predicted course achievement in an introductory psychology course. However, in our
results this relationship was moderated by the type of retrieval tool used to enhance per-
formance in class.
Consistent with the elaborative retrieval hypothesis, results suggest students benefited
more from intentional elaborative rehearsal in the form of concept mapping than they did
from multiple-choice quizzing. Importantly, this main effect was moderated by prior
12 Psychology Learning & Teaching 0(0)
psychology knowledge. For students with more prior psychology knowledge, both concept
mapping and multiple-choice quizzing enhanced performance beyond no in-class retrieval at
all, with the impact of concept mapping being the greatest. These results go beyond previous
classroom studies in which multiple-choice quiz items have helped subsequent exam perfor-
mance (Batsell, et al., 2017; Carpenter et al., 2009; McDaniel et al., 2013; McDaniel et al.,
2011; McDaniel, Wildman, & Anderson, 2012, McDermott et al., 2014) by showing that
although quizzing helped, concept mapping helped more. However, for students with less
prior psychology knowledge, only concept mapping was associated with enhanced perfor-
mance above and beyond no in-class retrieval practice.
explicit intentional cue creation occurred during concept map retrieval practice, students
seemed to benefit from the testing effect regardless of prior psychology knowledge level.
the role of exposure to material. Future research studies are needed to further disentangle
the roles of topic difficulty, prior psychology knowledge, exposure to material, and retrieval
activity in the testing effect.
Consistent with the elaborative retrieval hypothesis (Carpenter, 2009; 2011), it is possible
that the testing effect occurred because students were able to draw and write out the con-
cepts and connections on paper rather than having to hold them in working memory, which
would be required when processing and answering multiple-choice quiz questions. That is,
making the conceptual organization visually explicit may be what enhances learning and
exam performance beyond traditional multiple-choice quizzing.
For multiple-choice quizzing, the lack of explicit connection making and the integration
of knowledge that results from it may also be why it is not beneficial for some learners. In
their review of the testing effect, Nguyen and McDaniel (2015) write that it is possible “that
quizzing may strengthen memory for some information at the expense of related
information” (p. 89). Peterson and Mulligan (2013) suggest a negative testing effect can
occur when individuals attend to item-specific processing (information that differentiates
items from one another) instead of relational processing (associations among a set of items)
(also see Mulligan & Peterson, 2015). If multiple-choice quizzing does tend to fragment
knowledge representation through item-specific processing, then its impact on exam perfor-
mance might well be unhelpful, especially if exam items are not identical or very similar to
those on the quizzes. Future research is needed to understand why explicit visually expressed
organization during recall enhanced performance for learners with low prior knowledge,
whereas the potential incidental connection building, episodic context practice, or other
kinds of processing that are done during multiple-choice quizzing did not.
Conclusions
Given the ill-defined and mercurial nature of a classroom environment, it is important to
consider the numerous factors that can impact student performance. In this study, we
specifically examined how in-class retrieval activities and prior psychology knowledge
played a role in exam performance. An in-class retrieval activity, concept mapping, that
enabled and encouraged students to create effective connections that integrated and con-
solidated new knowledge enhanced subsequent performance. In particular, concept map-
ping helped students with lower prior psychology knowledge overcome the obstacles that
stand in the way of mastering material. Thus, in comparison to creating and using multiple-
choice quiz questions, a useful in-class activity might be to have students, especially those
with less prior psychology knowledge, practice retrieving material through free recall and
connection building activities such as a concept map.
Funding
This research received no specific grant from any funding agency in the public, commercial, or not-for-
profit sectors.
Francis et al. 15
ORCID iD
Andrea P. Francis https://orcid.org/0000-0001-5007-7020
References
Agarwal, P. K., Bain, P. M. & Chamberlain, R. W. (2012). The value of applied research: Retrieval
practice improves classroom learning and recommendations from a teacher, a principal, and a
scientist. Educational Psychology Review, 24, 437–448.
Ambrose, S. A., Bridges, M. W., DiPietro, M., Lovett, M. C., & Norman, M. K. (2010). How learning
words: Seven research-based principles for smart teaching. Jossey-Bass.
Batsell, W. R., Perry, J. L., Hanley, E. & Hostetter, A. B. (2017). Ecological validity of the testing
effect: The use of daily quizzes in introductory psychology. Teaching of Psychology, 44, 18–23.
Blunt, J. R. & Karpicke, J. D. (2014). Learning with retrieval-based concept mapping. Journal of
Educational Psychology, 106, 849–858.
Brink J. (2013). Test bank volume 1 and 2 for David G. Myers Psychology tenth edition. Worth
Publishers.
Ca~nas, A. J., & Novak, J. D. (2009). Constructing your first concept map. Retrieved from: http://cmap.
ihmc.us/docs/constructingaconceptmap.php
Carpenter, S. K. (2009). Cue strength as a moderator of the testing effect: The benefits of elaborative
retrieval. Journal of Experimental Psychology: Learning, Memory, & Cognition, 35, 1563–1569.
Carpenter S. K. (2011). Semantic information activated during retrieval contributes to later retention:
Support for the mediator effectiveness hypothesis of the testing effect. Journal of Experimental
Psychology, 37, 1547–1552.
Carpenter, S. K., & DeLosh, E. L. (2006). Impoverished cue support enhances subsequent retention:
Support for the elaborative retrieval explanation of the testing effect. Memory & Cognition, 34,
268–276.
Carpenter, S. K., Pashler, H., & Cepeda, N. J. (2009). Using tests to enhance 8th grade students’
retention of U. S. history facts. Applied Cognitive Psychology, 23, 760–771.
Carpenter, S. K., Pashler, H., & Vul, E. (2006). What types of learning are enhanced by a cued recall
test? Psychonomic Bulletin & Review, 13, 826–830
Carpenter, S. K., & Yeung, K. L. (2017). The role of mediator strength in leaning from retrieval.
Journal of Memory and Language, 92, 128–141.
Carroll, M., Campbell-Ratcliffe, J., Murnane, H., & Perfect, T. (2007). Retrieval-induced forgetting in
educational contexts: Monitoring, expertise, text integration, and test format. European Journal of
Cognitive Psychology, 19(4–5), 580–606. https://doi.org/10.1080/09541440701326071
Cogliano, M. C., Kardash, C. A. M., & Bernacki, M. L. (2019). The effects of retrieval practice and
prior topic knowledge on test performance and confidence judgments. Contemporary Educational
Psychology, 56, 117–129. https://doi.or10.1016/j.cedpsych.2018.12.001
Glover, J. A. (1989). The “testing” phenomenom: Not gone but nearly forgotten. Journal of
Educational Psychology, 81, 392–399.
Karpicke, J. D., & Blunt, J. (2011). Retrieval practice produced more learning than elaborative study-
ing with concept mapping. Science, 331, 772–775.
Karpicke, J. D., Lehman, M., & Aue, W. R. (2014). Retrieval-based learning: An episodic context
account. In B. H. Ross (Ed.), The psychology of learning and motivation: Vol. 61. (p. 237–284).
Elsevier Academic Press.
Kornell, N., Hays, M. J., & Bjork, R. A. (2009). Unsuccessful retrieval attempts enhance
subsequent learning. Journal of Experimental Psychology: Learning, Memory, and Cognition, 35,
989–998.
Lehman, M., Smith, M. A., & Karpicke, J. D. (2014). Toward an episodic context account of retrieval-
based learning: Dissociating retrieval practice and elaboration. Journal of Experimental Psychology:
Learning, Memory, and Cognition, 40(6), 1787–1794. https://doi.org/10.1037/xlm0000012
16 Psychology Learning & Teaching 0(0)
McDaniel, M. A., Agarwal,P. K., Huelser, B. J., McDermott, K. B., & Roediger, H. L. (2011). Test-
enhanced learning in a middle school science classroom: The effects of quiz frequency and place-
ment. Journal of Educational Psychology, 103(2), 399–414.
McDaniel, M. A., Anderson, J. L., Derbish, M. H., & Morrisette, N. (2007). Testing the testing effect
in the classroom. European Journal of Cognitive Psychology, 19, 494–513.
McDaniel, M. A., Roediger, H. L., & McDermott, K. B. (2007). Generalizing test-enhanced learning
from the laboratory to the classroom. Psychonomic Bulletin & Review, 14, 200–206.
McDaniel M. A., Thomas R. C., Agarwal P. K., McDermott K. B., & Roediger H. L. III. (2013).
Quizzing in middle-school science: Successful transfer performance on classroom exams. Applied
Cognitive Psychology, 27, 360–372
McDaniel, M. A., Wildman, K. M., & Anderson, J. L. (2012). Using quizzes to enhance summative-
assessment performance in a web-based class: An experimental study. Journal of Applied Research
in Memory and Cognition, 1, 18–26.
McDermott, K. B., Agarwal, P. K., D’Antonio, L., Roediger, H. L., & McDaniel, M. A. (2014). Both
multiple-choice and short-answer quizzes enhance later exam performance in middle and high
school classes. Journal of Experimental Psychology, Applied, 20, 3–21.
Mulligan, N. W., & Peterson, D. J. (2015). Negative and positive testing effects in terms of item-
specific and relational information. Journal of Experimental Psychology: Learning, Memory, and
Cognition, 41, 859–871.
Myers, D. G. (2013). Psychology. Worth Publishers.
Nguyen, K., & McDaniel, M. A. (2015). Using quizzing to enhance student learning in the classroom:
The good, the bad, and the ugly. Teaching of Psychology, 42, 87–92.
Pashler, H., Cepeda, N. J., Wixted, J. T., & Rohrer, D. (2005). When does feedback facilitate learning
of words? Journal of Experimental Psychology: Learning, Memory, and Cognition, 31, 3–8.
Peterson, D. J., & Mulligan, N. W. (2013). The negative testing effect and multifactor account. Journal
of Experimental Psychology: Learning, Memory, and Cognition, 39, 1287–1293.
Pyc, M. A., & Rawson, K. A. (2009). Testing the retrieval effort hypothesis: Does greater difficulty
correctly recalling information lead to higher levels of memory? Journal of Memory and Language,
60, 437–447.[10.1016/j.jml.2009.01.004]
Pyc, M. A., & Rawson, K. A. (2010). Why testing improves memory: Mediator effectiveness hypoth-
esis. Science, 330, 335.
Salden, R. J. C. M., Koedinger, K. R., Renkl, A., Aleven, V., & McLaren, B. M. (2010). Accounting
for beneficial effects of worked examples in tutored problem solving. Educational Psychology
Review, 22(4), 379–392.
Schwieren, J., Barenberg, J., & Dutke, S. (2017). The testing effect in the psychology classroom: A
meta-analytic perspective. Psychology Learning & Teaching, 16, 179–196.
Thompson, R. A. & Zamboanga, B. L. (2004). Academic aptitude and prior knowledge as predictors
of student achievement in introduction to psychology. Journal of Educational Psychology, 96(4),
778–784.
Willingham, D. T. (2009). Why don’t students like school? A cognitive scientist answers questions about
how the mind words and what it means for the classroom. Jossey-Bass.
Woloshyn, V. E., Paivio, A., & Pressley, M. (1994). Use of elaborative interrogation to help students
acquire information consistent with prior knowledge and information inconsistent with prior
knowledge. Journal of Educational Psychology, 86(1), 79–89.
Wooldridge, C. L., Bugg, J. M., McDaniel, M. A., & Liu, Y. (2014). The testing effect with authentic
educational materials: A cautionary note. Journal of Applied Research in Memory and Cognition, 3,
214–221.
Xiaofeng, M., Xiao-e, Y., Yanru, L., & AiBao, Z. (2016). Prior knowledge level dissociates effects of
retrieval practice and elaboration. Learning and Individual Differences, 51, 210–214. https://doi.org/
10.1016/j.lindif.2016.09.012
Francis et al. 17
Author biographies
Andrea P. Francis is an Assistant Professor of Psychological Science at Albion College,
where she has taught since 2010. As an educational psychologist, her teaching interests
include educational psychology, child and adolescent development, research methods, and
introductory psychology. Andrea’s research, which has been published in both educational
and psychological journals, focuses on how individual differences in social experience influ-
ence the cognitive processes involved in learner creativity and criterion performance.
Mareike B. Wieth is a Professor of Psychological Science at Albion College, where she has
taught since 2005. She is trained as a cognitive psychologist and regularly teaches research in
cognitive psychology, introductory psychology, sensation and perception, and a course on
drugs, brain, and behavior. Mareike’s research focuses on the impact of various individual
differences on higher-order cognitive processes such as creativity, problem solving, and
decision making. She has been an expert contributor on NPR and the BBC and her research
on creativity has been featured in a variety of media and news outlets.