A practice effect may be a problem when studying the results of which of the following?

Flashcards are an application of the testing effect. Here, flashcard software Anki is used to review a mathematical formula through active recall. First, only the question is displayed. Then the answer is displayed too, for verification.

The testing effect (also known as retrieval practice, active recall, practice testing, or test-enhanced learning) suggests long-term memory is increased when some of the learning period is devoted to retrieving information from memory. It is different from more general practice effect, defined in the APA Dictionary of Psychology as "any change or improvement that results from practice or repetition of task items or activities."

Cognitive psychologists are working with educators to look at how to take advantage of tests—not as an assessment tool, but as a teaching tools as testing prior knowledge is more beneficial for learning when compared to reading or passively studying material, even more so when the test is more challenging for memory.

History[edit]

Before much experimental evidence had been collected, the utility of testing was already evident to some perceptive observers including Francis Bacon who discussed it as a learning strategy as early as 1620.

"Hence if you read a piece of text through twenty times, you will not learn it by heart so easily as if you read it ten times while attempting to recite it from time to time and consulting the text when your memory fails."

Towards the end of the 19th century, Harvard psychologist William James described the testing effect in the following section of his 1890 book "The Principles of Psychology"

"A curious peculiarity of our memory is that things are impressed better by active than by passive repetition. I mean that in learning (by heart, for example), when we almost know the piece, it pays better to wait and recollect by an effort from within, than to look at the book again. If we recover the words in the former way, we shall probably know them the next time; if in the latter way, we shall very likely need the book once more."

The first documented empirical studies on the testing effect were published in 1909 by Edwina E. Abbott which was followed up by research into the transfer and retrieval of prior learning. In his 1932 book Psychology of Study, C. A. Mace said:

"On the matter of sheer repetitive drill there is another principle of the highest importance: Active repetition is very much more effective than passive repetition. ... there are two ways of introducing further repetitions. We may re-read this list: this is passive repetition. We may recall it to mind without reference to the text before forgetting has begun: this is active repetition. It has been found that when acts of reading and acts of recall alternate, i.e., when every reading is followed by an attempt to recall the items, the efficiency of learning and retention is enormously enhanced."

Studies in retrieval practice started in 1987 by John. L Richards, who published his findings in a newspaper in New York.[citation needed] Much of the confusion around early studies could have been due to constrained approaches not accounting for context. In more recent research with contributions from Hal Pashler, Henry Roediger and many others, testing knowledge can produce better learning, transfer, and retrieval results when compared to other forms of study that often use recognition like re-reading or highlighting.

Retrieval practice[edit]

In recent research storage strength, (how well an item is learned), and retrieval strength (how well an item can be retrieved) have become separate measures for retrieval practice. Retrieval strength (also known as recall accuracy) is typically higher for restudied words when tested immediately after practice, whereas tested words were higher as time moves on. This suggests using tests are more beneficial for long term memory and retrieval which some authors believe is due to limited retrieval success during practice supporting the idea that tests are learning opportunities.

Functional magnetic resonance imaging suggests that retrieval practice strengthens subsequent retention of learning through a "dual action" affecting the anterior and posterior hippocampus regions of the brain. This could support findings that individual differences in personality traits or with working memory capacity, don't seem to have any negative impacts of the testing effect, with a greater impact for lower ability individuals.

Despite some doubting knowledge transfer across a topic when testing with some studies showing contradictory evidence suggesting recognition was better than recall, inferential thinking has been supported and the transfer of learning is at its strongest with application of theory to practice, inference questions, medical education, and problems involving medical diagnosis. The transfer can occur across domains, paradigms, and help retention for material not on a final test. Using retrieval practices also produces less forgetting than studying and restudying while helping to identify misconceptions and errors with effects lasting years.

Repeated testing[edit]

Repeated testing have shown statistical significance and results getting better than repeated studying which could be due to testing creating multiple retrieval routes for memory, allowing individuals to form lasting connections between items, or blocking information together which can help with memory retention and schema recall. Using spaced repetition has shown an increase on the testing effect with a greater impact with a delay in testing, but the delay could lead to forgetting or retrieval-induced forgetting.

Delaying the test after a session can have a greater impact so studying in the day should be tested in the evening with a delay, but studying in the evening should have an immediate test due the effect sleep has on memory. Despite divided attention being thought to decrease the testing effect, if it is from a different medium it could enhance the effect.

The rate of forgetting is not affected by the speed or degree of learning but by the type of practice involved.

Test difficulty[edit]

According to the retrieval effort hypothesis, "difficult but successful retrievals are better for memory than easier successful retrievals" which supports the idea of finding a desirable difficulty within the retrieval practice considering our memory biases. Learning a language was better when using unfamiliar words compared to familiar words, supporting higher difficulty resulting in greater learning. The difficulty relates to the likelihood of forgetting as the harder it is to remember, the more likely you are to remember and retain the information supporting the notion that more effort is required for longer lasting retention similar to the depth of processing at encoding. Therefore lack of effort from students studying could be a factor that reduces its efficiency.

Increased difficulty shows decreased initial performance but increased performance on harder tests in the future, so retention and transfer suffer less when training is difficult. Even unsuccessful retrieval can enhance learning, as creating the thought helps with retention due to the generation effect. Like with processing time, it is the qualitative nature of the information that determines retention.

Getting feedback helps with learning but finding a desirable difficulty for the test combined with feedback is more beneficial than studying or testing without feedback. The Read, Recite, Review method has been proposed as a method to combine retrieval practice with feedback.

Test format[edit]

The test format doesn't seem to impact the results as it is the process of retrieval that aids the learning but transfer-appropriate processing suggests that if the encoding of information is through a format similar to the retrieval format then the test results are likely to be higher, with a mismatch causing lower results. However, when short-answer tests or essays are used greater gains in results are seen when compared to multiple-choice test

Cued recall can make retrieval easier as it reduces the required retrieval strength from an individual which can help short term results, but can hinder long term retrieval overtime due to reduced retrieval demand during practice. Quicker learning can reduce the rate of forgetting for a short period of time, but the effect doesn't last as long as more effortful retrieval. Cueing can be seen when encoding new information overlaps with prior knowledge making retrieval easier or from a visual or auditory aid.

Prior knowledge seems to increase the impact of retrieval practice meaning pre-testing can be used to get greater results, and the post-testing can be used to facilitate learning and memory of newly studied information, known as the forward testing effect. Pre-test or practice test accuracy doesn't predict post test results as time affects forgetting but pre-testing contributes to learning and prior knowledge and has shown to help in language learning.

Practice methods[edit]

When compared to concept mapping alone, retrieval practice is more beneficial, despite students not seeing retrieval practice as a useful learning tool. When combined, learner performance was increased, suggesting concept mapping is a tool that should be combined with retrieval practice alongside other non-verbal responses. Retrieval helps with mental organization which can work well with concept mapping. Multimedia testing can be used alongside flashcards as a method of retrieval practice but removing cards too early can result in lower long term retention. Individuals may not correctly interpret the outcome of practice cards contributing to dropped cards which impact future retrieval attempts therefore resulting in lower results due to increased forgetting.

It is advised that students, people in care units and teaching professionals use distributed retrieval practice with feedback to aid their studies. Interleaved practice, self-explanation, and elaborative interrogation can be useful but need more research. Summarization can be useful for individuals trained how to use to get the most from it. Keyword mnemonics and imagery for text have been somewhat helpful but the effects are often short lived. However, if each of these methods are integrated with retrieval elements the testing effect is more likely to occur.

Test benefits[edit]

A list of benefits of retrieval practice.

  • Aids later retention
  • Identifies knowledge gaps
  • Aids future related learning
  • Prevents interference from prior material in future learning
  • Aids transfer of knowledge to new contexts
  • Aids knowledge organization
  • Aids retrieval of untested information
  • Improves metacognitive monitoring
  • Provides feedback to instructors
  • Frequent testing encourages study intentions

Quizzes[edit]

A meta-analysis found the following links between frequent low-stakes quizzes in real classes and improved student academic performance:

  • There was an association between the use of quizzes and academic performance.
  • This association was stronger in psychology classes
  • This association was stronger in all classes when quiz performance could improve class grades.
  • Students doing well on quizzes tended to lead to students doing well on final exams
  • Regular quizzing increased the chances of students passing classes

Transfer of learning[edit]

Learning using retrieval practice appears to be one of the most effective methods for promoting transfer of learning. In particular the following three techniques have been identified as particularly beneficial for transfer especially when combined with feedback: i) Implementing broad rather than narrow retrieval exercises ii) Encouraging meaningful explanations of concepts or topics iii) Using a variety of complexity and formats with questions such as retrieval questions that require inference.

Considerations[edit]

Complex materials[edit]

Some researchers have applied aspects of cognitive load theory to suggest the testing effect may disappear with increasing task difficulty due to increased element interactivity. This has been addressed in the literature with studies that show complex learning is benefitted by retrieval practice.

Future research[edit]

It has been suggested that as most studies on the impact of retrieval practice were conducted in WEIRD countries this could cause a bias which should be explored in further studies.

Which of the following refers to the consistency of scores on a specific instrument across time or across items?

Reliability is consistency across time (test-retest reliability), across items (internal consistency), and across researchers (interrater reliability). Validity is the extent to which the scores actually represent the variable they are intended to.
Which of the following is a problem related to norm-referenced testing? Both norm-referenced assessment and informal methods of assessment could have problems with cultural bias.

Which of the following refers to the consistency of scores on a specific instrument?

Test Reliability. Refers to the consistency of scores on a specific instruction across time or across items.

Which of the following is considered a good reliability score?

80 or greater is considered to indicate good reliability. Again, high test-retest correlations make sense when the construct being measured is assumed to be consistent over time, which is the case for intelligence, self-esteem, and the Big Five personality dimensions.