29.04.01 · psychology / learning-memory

Learning and memory: conditioning, cognitive maps, encoding, storage, retrieval, and forgetting

shipped3 tiersLean: nonepending prereqs

Anchor (Master): primary sources: Pavlov 1927, Skinner 1938, Bandura 1961, Tolman 1948, Baddeley & Hitch 1974, Ebbinghaus 1885, Loftus & Palmer 1974, Sperling 1960; secondary: Anderson, Baddeley, Eichenbaum, Schacter

Overview Beginner

Every moment you are awake, your brain is changing. The things you see, hear, and do leave traces that alter how you will think and act in the future. That process of change is learning. The traces themselves, and your ability to access them later, are memory.

Learning and memory are not single processes. They are families of processes that overlap, interact, and sometimes compete. A child who touches a hot stove learns in one way (a single painful experience creates an immediate, lasting aversion). The same child learning to read learns in a different way (thousands of repetitions, guided by feedback, building gradually toward fluency). Both count as learning, but the mechanisms differ.

This unit covers four major families of learning and the memory systems that support them. First, classical conditioning: the automatic pairing of stimuli that makes you flinch at a sound, salivate at a smell, or feel anxious in a particular room. Second, operant conditioning: learning from consequences, where behaviours that produce rewards increase and behaviours that produce punishments decrease. Third, observational learning: learning by watching others, without direct reinforcement. Fourth, cognitive learning: internal mental representations that guide behaviour even without explicit reinforcement.

On the memory side, the unit covers how information gets into the brain (encoding), how it is maintained (storage), and how it is accessed later (retrieval). It covers the major distinctions between working memory (the short-term, capacity-limited system you use to hold a phone number in mind) and long-term memory (the vast store that holds your life experiences, knowledge, and skills). It covers why we forget, how memories can be distorted, and what happens when memory goes wrong.

One theme runs throughout: the processes described here are not neutral discoveries value-free. Several of the most important experiments in the history of psychology involved serious ethical violations, and the field is still reckoning with them. The science is real and genuinely useful. The cost at which it was obtained is also real.

Classical conditioning Beginner

In the 1890s, the Russian physiologist Ivan Pavlov was studying digestion in dogs. He noticed something unexpected. The dogs began salivating not just when food was placed in their mouths, but before the food arrived — when they saw the lab assistant who usually fed them, or when they heard the footsteps approaching. The salivary response, which should have been an automatic reaction to food, was being triggered by stimuli that predicted food.

Pavlov shifted his research to study this phenomenon systematically ^{[source pending]}. He established a careful experimental procedure. A dog was placed in a harness. A tone (or a bell, or a metronome tick) was sounded. Shortly after, food powder was placed in the dog's mouth. The food powder automatically produced salivation — this was the unconditioned response (UCR) to the unconditioned stimulus (UCS). The tone, initially, produced no salivation — it was a neutral stimulus.

After repeated pairings of the tone and the food (sometimes as few as a dozen trials), the dog began salivating to the tone alone. The tone had become a conditioned stimulus (CS), and the salivation to the tone was the conditioned response (CR). The dog had learned an association: tone predicts food.

This is the basic structure of classical conditioning. An initially neutral stimulus (the CS) is repeatedly paired with an unconditioned stimulus (the UCS) that automatically produces an unconditioned response (the UCR). Through this pairing, the CS comes to elicit a conditioned response (the CR) that is similar to the UCR.

Figure: The three phases of classical conditioning. Before conditioning, the neutral stimulus produces no response. During conditioning, the neutral stimulus is paired with the unconditioned stimulus. After conditioning, the previously neutral stimulus now produces the conditioned response on its own.

Several processes extend the basic phenomenon:

Acquisition. The initial learning phase, during which the CS-UCS association is being established. The strength of the CR typically increases rapidly over the first few trials and then plateaus. The timing matters: conditioning is strongest when the CS precedes the UCS by a short interval (about half a second for many reflexes). If the CS follows the UCS (backward conditioning), little or no conditioning occurs.

Extinction. If the CS is presented repeatedly without the UCS, the CR gradually weakens and disappears. A dog that hears the tone many times without being fed stops salivating to the tone. But extinction is not unlearning. If the dog is taken out of the experimental context and returned later, the CR may reappear (spontaneous recovery), though typically weaker than before. Extinction involves learning a new association (tone no longer predicts food) that inhibits, but does not erase, the original association.

Generalisation and discrimination. After conditioning to a specific tone, a dog will salivate to similar tones — this is stimulus generalisation. The more similar the new tone is to the original CS, the stronger the response. But the dog can also learn discrimination: if one tone is consistently followed by food and a different tone is not, the dog will salivate to the first but not the second.

Watson and Little Albert

John B. Watson, the founder of behaviourism in America, saw in classical conditioning a way to explain human emotion. In 1920, Watson and his graduate student Rosalie Rayner conducted what became one of the most cited and most controversial experiments in psychology ^{[source pending]}.

"Little Albert" was a healthy nine-month-old infant. Watson and Rayner first established that Albert was not afraid of a white rat, a rabbit, a dog, a monkey, masks, or burning newspapers. He showed curiosity, not fear. They then began the conditioning procedure. As Albert reached for the white rat, Watson struck a steel bar with a hammer just behind Albert's head, producing a loud, frightening noise. After several pairings of the white rat with the loud noise, Albert began to cry and withdraw when the rat appeared alone.

The fear generalized. Albert became distressed not only by the white rat but by a rabbit, a fur coat, and a Santa Claus mask — stimuli that shared the furry quality of the original CS. Watson and Rayner had, by their own account, conditioned a phobia in an infant.

The scientific contribution was real. Watson demonstrated that emotional responses — fear, in this case — could be learned through classical conditioning, not just inherited or maturationally determined. This finding had practical implications: if fear could be learned, it could presumably be unlearned, a principle that would later underpin behavioural treatments for phobias and anxiety disorders.

The ethical violations were severe, and they are not incidental to the experiment — they are central to evaluating it. Albert was eleven months old at the time of the final test. He could not consent. Watson and Rayner deliberately induced fear and distress in an infant, and they made no attempt to extinguish the conditioned fear before Albert left the study. Albert's mother removed him from the hospital before Watson could conduct the planned deconditioning procedures. The child's real name was Douglas Merritte, and he died at age six of hydrocephalus. Whether the conditioned fear persisted for the rest of his short life is unknown.

The experiment would not pass an institutional review board today. It violated the principles of informed consent, minimization of harm, and the researcher's obligation to leave participants no worse off than they were found. The fact that the study produced genuine scientific knowledge does not erase these violations. Psychology, like any science, must weigh what it learns against what it costs to learn it.

Operant conditioning Beginner

Classical conditioning explains how organisms learn to associate stimuli. But much of the behaviour that matters — studying, working, eating, exercising — is not simply a matter of reflexive responses to cues. It is shaped by consequences. Actions that produce pleasant outcomes tend to be repeated; actions that produce unpleasant outcomes tend to be avoided. This principle, called the law of effect, was first formulated by Edward Thorndike in 1898 and later developed into a comprehensive framework by B.F. Skinner ^{[source pending]}.

Skinner distinguished two kinds of behaviour. Respondent behaviour is the automatic, elicited kind that Pavlov studied — salivation, blinking, flinching. Operant behaviour is emitted by the organism and operates on the environment to produce consequences. A pigeon pecking a disk, a rat pressing a lever, a student studying for an exam — these are operant behaviours. They are not triggered by a specific stimulus. They are shaped by what follows them.

The consequences that follow an operant behaviour determine whether it will occur again. Reinforcement increases the likelihood of a behaviour. Punishment decreases it. Both can be either positive (adding something) or negative (removing something).

This yields four possibilities:

Positive reinforcement. A behaviour is followed by the presentation of a desirable stimulus. A rat presses a lever and receives food. A child completes homework and receives praise. The behaviour increases.

Negative reinforcement. A behaviour is followed by the removal of an aversive stimulus. A rat presses a lever and turns off an electric shock. You take an aspirin and your headache goes away. The behaviour increases. Note: negative reinforcement is not punishment. It increases behaviour by removing something unpleasant.

Positive punishment. A behaviour is followed by the presentation of an aversive stimulus. A child touches a hot stove and is burned. A driver speeds and receives a ticket. The behaviour decreases.

Negative punishment. A behaviour is followed by the removal of a desirable stimulus. A teenager breaks curfew and loses car privileges. A child hits a sibling and has a toy taken away. The behaviour decreases.

Reinforcement schedules

Skinner discovered that the pattern and timing of reinforcement profoundly affects how quickly a behaviour is learned, how strongly it persists, and how it responds to extinction. He studied several schedules of reinforcement:

Continuous reinforcement. Every instance of the desired behaviour is reinforced. This produces rapid learning but also rapid extinction when reinforcement stops. If a vending machine never gives you your snack, you stop putting money in after just a few tries.

Fixed ratio (FR). Reinforcement is delivered after a fixed number of responses. A factory worker paid per unit produced is on an FR schedule. The worker pauses briefly after each reward and then resumes a high, steady rate of responding.

Variable ratio (VR). Reinforcement is delivered after a variable number of responses, averaging to a set number. Slot machines operate on a VR schedule. The unpredictability produces high, steady rates of responding that are extremely resistant to extinction. Gambling addiction is partly maintained by VR schedules.

Fixed interval (FI). Reinforcement is available after a fixed amount of time, but only for the first response after that time has elapsed. Checking your mailbox (which is filled once a day) is roughly an FI schedule. The pattern is distinctive: low responding just after reinforcement, with a gradual increase as the next reinforcement time approaches (a scalloped pattern).

Variable interval (VI). Reinforcement is available after a variable amount of time. Checking email or social media is roughly a VI schedule. This produces moderate, steady responding that is also resistant to extinction.

REINFORCEMENT SCHEDULES — Response Patterns
                                            
Schedule      Rate          Extinction       Example
─────────────────────────────────────────────────────
Continuous    High, steady  Very fast        Vending machine
Fixed ratio   High, steady  Fast             Piece-rate pay
Variable ratio Very high    Very slow        Slot machines
Fixed interval Low, scalloped Moderate       Mail check
Variable int.  Moderate     Slow             Email checking

Skinner's box and its implications

Skinner developed the operant conditioning chamber — the "Skinner box" — as a controlled environment for studying operant behaviour. A rat in a Skinner box can press a lever (or a pigeon can peck a disk), and the apparatus delivers reinforcement or punishment according to a programmed schedule. The box eliminates extraneous variables and allows precise measurement of response rates.

The Skinner box was a genuine methodological advance. It produced quantitative data on learning that could be replicated across laboratories. The principles derived from it — reinforcement schedules, shaping, chaining — have been applied to education, animal training, therapy, and organizational management.

Skinner went further, arguing that the principles of operant conditioning could explain all of human behaviour. In Beyond Freedom and Dignity (1971), he argued that free will is an illusion, that what we call "choices" are behaviours shaped by environmental contingencies, and that society should be redesigned using behavioural principles. This claim — that humans are essentially larger versions of the rats in his boxes — was the target of intense criticism. The criticism was not anti-scientific. It was based on the observation that reducing human cognition, language, creativity, and moral reasoning to stimulus-response chains leaves out most of what makes humans interesting.

The tension is genuine. Operant conditioning is one of the most empirically validated frameworks in psychology. Applied behaviour analysis, derived directly from Skinner's work, is an effective treatment for autism spectrum disorder and other conditions. Token economies, behaviour modification programs, and many educational techniques rely on reinforcement principles. At the same time, the claim that these principles are sufficient to explain all of human behaviour is not supported by the evidence and has been repeatedly challenged by cognitive, social, and cultural approaches to psychology.

Observational learning Beginner

Not all learning requires direct experience. You can learn that a stove is hot by being burned, but you can also learn it by watching someone else get burned. You can learn to drive a car through trial and error, but most people learn by watching others drive and then practicing under guidance. Observational learning — learning by observing the behaviour of others and its consequences — was most systematically studied by Albert Bandura ^{[source pending]}.

Bandura's most famous experiment is the Bobo doll study (1961). Children ages three to six were brought individually to a playroom. In one condition, an adult model entered the room and began aggressively attacking an inflatable Bobo doll — punching it, hitting it with a mallet, throwing it, and kicking it, while making aggressive statements. In another condition, the adult model played quietly with other toys and ignored the Bobo doll. In a control condition, no adult model was present.

After observing the model (or not, in the control condition), each child was taken to another room with attractive toys but was told they could not play with them — a mild frustration designed to increase the likelihood of imitative behaviour. The child was then taken to a third room that contained the Bobo doll and various other toys, including a mallet.

The results were striking. Children who had observed the aggressive model were significantly more likely to behave aggressively toward the Bobo doll, and they imitated many of the specific aggressive actions they had seen the model perform. Children in the non-aggressive and control conditions showed much less aggressive behaviour.

The implications were important. The children had learned new behaviours without any direct reinforcement — they had simply watched. They had not been rewarded for aggression; they had not been punished for non-aggression. The learning was purely observational. This finding challenged behaviourist accounts that required direct reinforcement for learning to occur.

Bandura identified four processes in observational learning:

Attention. The learner must attend to the model's behaviour. Attention is influenced by the model's characteristics (attractive, high-status, similar to the observer), the behaviour's distinctiveness, and the observer's arousal level.

Retention. The learner must remember the observed behaviour. This involves encoding the behaviour into memory (often as a verbal description or a visual image) and storing it for later use.

Reproduction. The learner must be physically capable of reproducing the behaviour. A child can observe a professional basketball player's jump shot but may lack the coordination and strength to replicate it.

Motivation. The learner must have a reason to perform the behaviour. This is where reinforcement comes back in — not as a requirement for learning, but as a factor in whether the learned behaviour is actually performed. Bandura distinguished between acquisition (learning the behaviour) and performance (doing it). A child might learn an aggressive behaviour from observing a model but not perform it if aggression has been punished in the past.

Cognitive learning Beginner

Behaviourism, in its strict form, treats the organism as a black box: stimulus goes in, response comes out, and what happens inside is irrelevant to the science. This view was challenged in the 1930s and 1940s by researchers who found that animals — and humans — develop internal representations of their environment that guide behaviour, even in the absence of direct reinforcement.

Edward Tolman demonstrated this with a series of experiments on rats in mazes ^{[source pending]}. In one experiment, three groups of rats ran a maze over several days. Group 1 received food at the end of the maze every time. Group 2 never received food. Group 3 received no food for the first ten days and then received food on day eleven.

The results were revealing. Group 1 improved steadily, making fewer errors over time. Group 2 showed little improvement. But Group 3 showed a dramatic improvement on day eleven — their error rate dropped to the level of Group 1 almost immediately, as though they had been learning the maze all along but had no reason to demonstrate it.

Tolman argued that the rats in Group 3 had formed a cognitive map — an internal representation of the maze's layout — during the first ten days of unreinforced exploration. The food reward did not cause the learning; it caused the rats to use what they had already learned. This was latent learning: learning that occurs without reinforcement and is not expressed until there is a reason to express it.

Cognitive maps are not limited to rats. When you navigate your city, you rely on a mental representation of streets, landmarks, and routes. When you give someone directions, you are reading off your cognitive map. Taxi drivers in London, who must pass a demanding examination called "The Knowledge" (memorizing over 25,000 streets and thousands of landmarks), show enlarged hippocampi — the brain region most associated with spatial memory. The cognitive map is not a metaphor. It is a neural reality.

Tolman's work, along with the research on observational learning, contributed to the cognitive revolution in psychology — the shift from behaviourism to information-processing models that treat the mind as an active processor of information, not merely a responder to stimuli.

Visual: learning theories compared Beginner

Figure: A comparison of the four major learning frameworks. Classical conditioning pairs stimuli automatically. Operant conditioning shapes behaviour through consequences. Observational learning transmits behaviours through observation. Cognitive learning involves internal mental representations that operate even without direct reinforcement.

LEARNING THEORIES — Comparison
─────────────────────────────────────────────────────────────────
Theory          Key figure   Mechanism         Example
─────────────────────────────────────────────────────────────────
Classical       Pavlov       Stimulus pairing  Dog salivates to tone
conditioning                 (CS-UCS)

Operant         Skinner      Consequences      Pigeon pecks for food
conditioning                 (reinforcement)

Observational   Bandura      Modeling +         Child imitates
learning                     vicarious         aggressive adult
                              reinforcement

Cognitive       Tolman       Internal           Rat learns maze
learning                     representations   without reward
                              (cognitive maps)
─────────────────────────────────────────────────────────────────

Check your understanding Beginner

Exercise (easy, multiple choice).

Which of the following is an example of negative reinforcement?

A. A student gets a detention for being late. B. A driver slows down after getting a speeding ticket. C. A person takes antacid to relieve heartburn, and continues to do so in the future. D. A child loses TV privileges for not doing homework.

Hint

Negative reinforcement involves the removal of an aversive stimulus, which increases the behaviour that removed it.

Answer

Option C. Taking antacid removes the aversive stimulus (heartburn), and the behaviour of taking antacid increases in the future. This is negative reinforcement: the removal of something unpleasant strengthens a behaviour. Options A and B describe positive punishment (adding something aversive to decrease behaviour). Option D describes negative punishment (removing something desirable to decrease behaviour).

Formal definition Intermediate

The psychology of learning and memory uses a specialized vocabulary that can be defined precisely. These definitions are not merely labels; they identify empirically distinguishable processes.

Classical (Pavlovian) conditioning. A learning procedure in which a biologically potent stimulus (the UCS, which elicits a reflexive UCR) is repeatedly paired with a previously neutral stimulus (the CS). After sufficient pairings, the CS elicits a conditioned response (CR) that resembles the UCR. The critical relation is temporal: the CS must reliably predict the UCS. The strength of conditioning is measured by the amplitude and probability of the CR on CS-alone trials.

Operant (instrumental) conditioning. A learning procedure in which the consequence of a voluntary behaviour (an operant) modifies the future probability of that behaviour. If the consequence is a reinforcer (positive or negative), the behaviour increases. If the consequence is a punisher (positive or negative), the behaviour decreases. The critical relation is contingent: the consequence must follow the behaviour reliably. Skinner's three-term contingency (discriminative stimulus $\to$ operant response $\to$ reinforcing consequence) is the basic unit of analysis.

Observational (social) learning. Learning that occurs through observing the behaviour of a model and the consequences that follow. It does not require direct reinforcement of the learner. Bandura's four subprocesses — attention, retention, motor reproduction, and motivation — are necessary conditions. Vicarious reinforcement (observing the model being rewarded) and vicarious punishment (observing the model being punished) influence whether a learned behaviour is performed, not whether it is acquired.

Latent learning. Learning that occurs in the absence of reinforcement and is not expressed in performance until a relevant incentive is introduced. Tolman's cognitive-map experiments are the canonical demonstration. Latent learning implies that organisms encode more information about their environment than they currently need, and that reinforcement controls the expression of learning rather than the acquisition of learning.

Memory encoding. The process by which perceived information is transformed into a construct that can be stored and later retrieved. Encoding is not a passive recording but an active process of selecting, organizing, and integrating information with existing knowledge. The levels of processing framework (Craik & Lockhart, 1972) proposes that deeper, more meaningful processing produces stronger, more durable memories than shallow, superficial processing.

Memory storage. The process of maintaining encoded information over time. Storage is not a single mechanism. The multi-store model (Atkinson & Shiffrin, 1968) distinguished three stores: sensory memory (high capacity, duration under one second), short-term memory (limited capacity of roughly 7 $\pm$ 2 items, duration under 30 seconds without rehearsal), and long-term memory (effectively unlimited capacity and duration).

Memory retrieval. The process of accessing stored information. Retrieval is influenced by the similarity between the conditions at encoding and the conditions at retrieval (encoding specificity principle, Tulving, 1983). Retrieval is also reconstructive: the act of recalling an event involves reassembling it from stored fragments, not playing back a recording.

Memory systems Intermediate

Encoding, storage, and retrieval

The three phases of memory — encoding, storage, and retrieval — form a chain. A failure at any stage produces forgetting. Information that is not encoded cannot be stored. Information that is encoded but not stored properly degrades. Information that is stored but cannot be retrieved is functionally lost, even if the trace is intact.

Encoding is influenced by attention (you cannot encode what you do not attend to), depth of processing (semantic encoding — thinking about meaning — produces stronger traces than structural or phonemic encoding), and elaboration (connecting new information to existing knowledge). The self-reference effect — the finding that information processed in relation to the self is particularly well remembered — is one of the most robust effects in memory research.

Storage involves consolidation — the process by which initially labile memory traces become stable. Consolidation was first proposed on the basis of retrograde amnesia: patients with brain damage often lose memories from the period just before the injury, while older memories remain intact, as though recent memories had not yet been consolidated. The standard model holds that consolidation occurs during the first few hours after encoding and is disrupted by subsequent learning, head trauma, or electroconvulsive shock. More recent research suggests that consolidation is not a single event but an ongoing process, with memories being reconsolidated each time they are retrieved.

Retrieval is cue-dependent. The probability of successful retrieval depends on the overlap between the cues available at retrieval and the cues present at encoding (the encoding specificity principle). This is why context-dependent memory occurs: you remember more in the same environment where you learned the material. It is also why state-dependent memory occurs: information learned under the influence of a drug is often better recalled in the same drug state.

Working memory: Baddeley's model

The concept of working memory replaced the older concept of short-term memory. Short-term memory was conceived as a passive buffer — a temporary store with limited capacity. Working memory, as formulated by Baddeley and Hitch (1974) ^{[source pending]}, is an active system that both stores and manipulates information.

Baddeley's original model had three components:

The phonological loop. Stores verbal and auditory information for about two seconds unless actively rehearsed through subvocal repetition. It is what you use when you repeat a phone number to keep it in mind. The loop has limited capacity — roughly the amount of information you can say in about two seconds, which is why longer numbers require chunking.

The visuospatial sketchpad. Stores and manipulates visual and spatial information. It is what you use when you mentally rotate a shape, visualize a route, or imagine the layout of your kitchen. It operates in parallel with the phonological loop, which is why you can simultaneously hold a visual image and a verbal string in mind (though both draw on a shared central resource).

The central executive. An attentional controller that coordinates the phonological loop and visuospatial sketchpad, manages the flow of information, and oversees complex cognitive tasks. The central executive is not a memory store but an attentional system — it decides what to attend to, what to rehearse, and how to allocate limited cognitive resources.

A fourth component was added later:

The episodic buffer. A temporary store that integrates information from the phonological loop, the visuospatial sketchpad, and long-term memory into a coherent episode. It provides the interface between working memory and long-term memory, explaining how we can hold in mind integrated scenes (a person speaking words with a particular facial expression in a specific setting) that draw on multiple codes.

Working memory capacity varies across individuals and is a strong predictor of cognitive performance. Individuals with higher working memory capacity tend to perform better on reading comprehension, reasoning, and complex problem-solving tasks. This does not mean that working memory is intelligence, but it is a major bottleneck in cognitive processing.

Long-term memory types

Long-term memory is not a single system. It comprises several subsystems that differ in what they store, how they operate, and what brain structures support them.

Explicit (declarative) memory is memory for facts and events that can be consciously recalled and stated. It has two subdivisions:

Episodic memory. Memory for specific events in your personal past, located in a particular time and place. Your memory of what you had for breakfast this morning, where you were on your last birthday, or what happened at your high school graduation are episodic memories. Episodic memory involves autonoetic consciousness — the ability to mentally travel back in time and re-experience the past. It depends on the hippocampus and medial temporal lobe structures.

Semantic memory. Memory for general knowledge that is not tied to a specific episode. You know that Paris is the capital of France, that water freezes at 0 degrees Celsius, and that 7 times 8 is 56, but you probably do not remember the specific occasion on which you learned each of these facts. Semantic memory may begin as episodic (you learned a fact at a particular time and place) and lose its episodic context through repeated use.

Implicit (nondeclarative) memory is memory that influences behaviour without conscious awareness of the memory. It includes:

Procedural memory. Memory for skills and habits — how to ride a bicycle, how to type, how to play a musical instrument. Procedural memories are expressed through performance rather than recollection. You can demonstrate that you know how to ride a bicycle, but you cannot easily articulate the specific muscle movements involved. Procedural memory depends on the basal ganglia and cerebellum, not the hippocampus.

Priming. Exposure to a stimulus influences the response to a subsequent stimulus, without conscious awareness of the connection. If you read the word "yellow," you will be faster to identify the word "banana" shortly afterward, because the two concepts are associated in semantic memory. Priming effects are robust, automatic, and can persist for days or weeks.

Classical conditioning. The conditioned associations described earlier (CS-CR pairings) are themselves a form of implicit memory. The association between the tone and the food, once established, operates automatically and outside conscious control.

LONG-TERM MEMORY — Classification
                                    Long-term memory
                                    /               \
                            Explicit              Implicit
                           (declarative)        (nondeclarative)
                           /          \            |          \
                     Episodic      Semantic   Procedural    Priming
                     (events)      (facts)    (skills)    (activation)

Key result: forgetting curves and interference Intermediate

Hermann Ebbinghaus, in 1885, conducted the first rigorous experimental study of memory ^{[source pending]}. Using himself as the sole subject, he memorized lists of nonsense syllables (CVC trigrams like DAX, BOK, MIP) and measured how many he could recall after various delays. The nonsense syllables were chosen to minimize the influence of prior knowledge, giving Ebbinghaus a relatively pure measure of memory for new associations.

The result was the forgetting curve: a negatively accelerated function showing rapid initial forgetting followed by progressively slower forgetting. Ebbinghaus found that about 50% of the learned material was forgotten within the first hour. After 24 hours, roughly 66% was forgotten. After a month, about 80% was forgotten. But the curve never reached zero — some residual memory persisted indefinitely.

The shape of the forgetting curve has been replicated many times, though the exact rate of forgetting varies with the material, the learner, and the encoding conditions. The general principle — rapid initial loss, followed by slowing decline — is one of the most robust findings in the psychology of memory.

Two main theories account for forgetting:

Decay theory. Memories fade with the passage of time, whether or not they are used. The metaphor is physiological: memory traces, like muscle tone, weaken without exercise. Decay theory has some support (the physical basis of memory does involve synaptic connections that can weaken), but it is incomplete. Time itself is not a causal variable; something must happen during the elapsed time to produce the forgetting.

Interference theory. Forgetting occurs because other information competes with or disrupts the target memory. Proactive interference occurs when previously learned information interferes with the recall of newly learned information (old interferes with new). Retroactive interference occurs when newly learned information interferes with the recall of previously learned information (new interferes with old).

Interference theory predicts that the more similar two sets of information are, the more they will interfere with each other. This prediction is well supported. If you study Spanish and then study Italian, the two languages will interfere with each other more than if you study Spanish and then study chemistry. The similarity of the material produces competition at retrieval.

Retrieval failure is a third explanation. The information is stored but cannot be accessed because the available cues are insufficient. The tip-of-the-tongue phenomenon — knowing that you know something but being unable to retrieve it — is a common example of retrieval failure. The cue (the question or context) is not specific enough to locate the memory trace among many similar traces.

The practical implication of interference theory and the forgetting curve is that learning is more durable when it is spaced out over time (distributed practice) rather than concentrated in a single session (massed practice). The spacing effect — better long-term retention when study sessions are separated by intervals — is one of the most replicated findings in cognitive psychology and one of the most ignored in educational practice.

Reconstructive memory and false memories Intermediate

Memory is not a recording device. It does not passively capture experience and play it back on demand. Memory is reconstructive: each act of remembering involves reassembling information from stored fragments, filling in gaps with inferences, and integrating the result with current knowledge and expectations. This reconstruction is usually accurate enough for everyday purposes, but it is not a faithful reproduction of the original event.

Elizabeth Loftus and John Palmer demonstrated the reconstructive nature of memory in a landmark 1974 study ^{[source pending]}. Participants watched a film of a car accident. Some were asked "About how fast were the cars going when they smashed into each other?" Others were asked the same question with the verb hit, collided, bumped, or contacted. The verb influenced speed estimates: "smashed" produced higher estimates than "contacted."

One week later, participants were asked whether they had seen any broken glass in the film. There was no broken glass. But participants who had been asked the "smashed" question were more likely to report having seen broken glass. The misleading question had altered their memory of the event, not just their response to the question.

This is the misinformation effect: exposure to misleading information after an event can alter the memory of the event. The misinformation can be introduced through leading questions, suggestive interviewing, media reports, or conversations with other witnesses. The effect is robust, and it has profound implications for the reliability of eyewitness testimony.

Eyewitness testimony

Eyewitness testimony is among the most persuasive forms of evidence in criminal trials. Jurors find confident eyewitnesses highly credible. Yet decades of research have shown that eyewitness memory is far less reliable than most people — including most jurors and judges — believe.

Factors that reduce eyewitness accuracy include:

Weapon focus. When a weapon is present during a crime, witnesses tend to focus on the weapon and encode less information about the perpetrator's face and other details.

Cross-race identification. People are generally less accurate at recognizing faces of a race different from their own. This effect, called the other-race effect or own-race bias, is well documented and has been accepted as evidence in some court cases.

Post-event information. Witnesses who discuss the event with other witnesses, are exposed to media reports, or are asked leading questions may incorporate false information into their memory of the event.

Confidence-accuracy mismatch. The correlation between a witness's confidence in their identification and the accuracy of that identification is weaker than most people assume. A confident witness is not necessarily an accurate one, and confidence can be inflated by confirming feedback ("Good, you identified the suspect").

The recovered memory controversy

The reconstructive nature of memory became the centre of a bitter controversy in the 1980s and 1990s. Some patients in therapy reported recovering memories of childhood sexual abuse that they had previously forgotten. In some cases, these memories formed the basis of criminal charges.

The controversy had two poles. On one side, clinicians and advocates argued that traumatic memories can be repressed (pushed out of conscious awareness by a protective mechanism) and later recovered, often through therapeutic techniques such as guided imagery, hypnosis, or dream analysis. These recovered memories, they argued, were genuine and deserved legal recognition.

On the other side, cognitive psychologists — most prominently Loftus — argued that the therapeutic techniques used to recover memories could also create false memories. The misinformation effect, source monitoring failures, and the suggestibility of memory under hypnosis meant that patients could develop vivid, emotionally compelling memories of events that never happened.

The evidence supports both positions. Traumatic memories can be forgotten and later recalled accurately. But false memories can also be implanted through suggestive therapeutic techniques, and the phenomenological experience of a genuine recovered memory is indistinguishable from the experience of a false one. The person reporting the memory cannot tell the difference, and neither can a therapist.

The stakes are enormous. False accusations of abuse destroy lives. Genuine abuse that is dismissed as a false memory also destroys lives. The science does not provide a reliable method for distinguishing recovered true memories from false ones in individual cases. This is an area where the limits of scientific knowledge must be honestly acknowledged.

Exercise Intermediate

Exercise 1 (medium, application).

A teacher wants to use operant conditioning to increase the frequency with which students submit homework on time. The teacher decides to give a small reward (a sticker) for every on-time submission. After two weeks, the teacher switches to giving the reward only after every third on-time submission. Using the vocabulary of reinforcement schedules, identify both schedules and explain why the teacher's switch is a sound strategy.

Hint

Consider what happens to the behaviour when the reinforcement schedule changes from a denser to a thinner schedule. Think about resistance to extinction.

Answer

The first two weeks used a continuous reinforcement (CRF) schedule — every instance of the target behaviour was reinforced. CRF is effective for initial acquisition because the contingency between behaviour and consequence is maximally clear. The switch to rewarding every third submission is a fixed ratio 3 (FR-3) schedule.

The strategy is sound because thinning the schedule from CRF to FR-3 increases resistance to extinction. Behaviours maintained on intermittent schedules are more persistent than those maintained on continuous reinforcement. The transition should be gradual; an abrupt shift might cause a temporary decrease in responding. The teacher could further thin the schedule to a variable ratio if the goal is long-term maintenance without external reward.

Exercise 2 (medium, analysis).

A participant in a memory experiment studies a list of 30 word pairs (e.g., "dog — chair"). Later, the participant is shown the first word of each pair and asked to recall the second. On 10 trials, the participant cannot recall the correct word but insists that they know it — it is on the tip of their tongue. Using the encoding-storage-retrieval framework, explain what is happening and identify which stage is the bottleneck.

Hint

The tip-of-the-tongue state implies that the information was encoded and stored successfully. What is failing?

Answer

The tip-of-the-tongue (TOT) state is a retrieval failure. The participant's insistence that they know the answer, combined with their inability to produce it, indicates that encoding and storage were successful — the information is in memory. The bottleneck is at retrieval: the cue (the first word of the pair) is activating some features of the target memory but is not sufficient to fully access the stored trace. The TOT state demonstrates that storage and retrieval are distinct processes. Information can be stored but temporarily inaccessible. Often, providing a partial cue (such as the first letter of the target word) resolves the TOT state, supporting the retrieval-failure interpretation.

Exercise 3 (hard, critical evaluation).

A therapist uses guided imagery to help a client "recover" a childhood memory of abuse. The client develops a vivid, emotionally intense memory of the event. The therapist takes this as confirmation that the event occurred. In 4-6 sentences, evaluate the therapist's reasoning. Identify at least one error.

Hint

Vividness and emotional intensity are features of both genuine and false memories. What does the reconstructive memory literature say about the relationship between subjective conviction and accuracy?

Answer

The therapist's reasoning contains at least one error: equating the vividness and emotional intensity of the recovered memory with its accuracy. Research on the misinformation effect and false memory implantation shows that fabricated memories can be experienced as vividly and emotionally as genuine ones. Loftus's "lost in the mall" paradigm demonstrated that approximately 25% of participants could be led to develop a false memory of being lost in a shopping mall as a child, complete with sensory details and emotional content. The therapeutic technique of guided imagery is a form of imaginative rehearsal that can, in susceptible individuals, create a memory of an imagined event that is subjectively indistinguishable from a memory of a real event. The therapist's confidence in the memory's accuracy is therefore unwarranted on the basis of the client's subjective experience alone. This does not mean the memory is false; it means that subjective conviction is not a valid criterion for distinguishing true from false memories.

Memory and trauma Master

The relationship between memory and trauma is one of the most complex and contested areas in psychology. Traumatic experiences — events involving actual or threatened death, serious injury, or sexual violence — are processed differently from ordinary experiences, and these differences have implications both for clinical treatment and for legal proceedings.

Traumatic memory formation. Under conditions of extreme stress, the body releases stress hormones (adrenaline, cortisol) that modulate memory consolidation. Moderate levels of arousal enhance consolidation — this is the basis of flashbulb memories, the vivid, detailed memories people form of emotionally significant events (where they were when they heard about the 9/11 attacks, for example). But very high levels of arousal can impair consolidation, leading to fragmented, disorganized, or incomplete memories of the traumatic event.

This inverted-U relationship between arousal and memory (the Yerkes-Dodson law, applied to memory) means that traumatic memories are often both more intense and less coherent than ordinary memories. A trauma survivor may have vivid sensory fragments (the sound of screeching tires, the smell of smoke) without a coherent narrative of the event. This fragmentation is not evidence that the memory is false; it is a predictable consequence of how extreme arousal affects the hippocampus and amygdala.

Dissociation and amnesia. Some trauma survivors report periods of dissociation during the traumatic event — a sense of detachment from their body, of observing the event from outside themselves, or of time slowing down. Dissociation during trauma is associated with poorer encoding and subsequent amnesia for aspects of the event. The relationship between dissociation, amnesia, and trauma is well established in the clinical literature, though the mechanisms are not fully understood.

Repressed versus recovered memory. The concept of repression — the unconscious blocking of traumatic memories from conscious awareness — was introduced by Freud and remains influential in some clinical traditions. The empirical evidence for repression as a mechanism is contested. Some researchers argue that forgetting of traumatic events can be explained by ordinary memory mechanisms (infantile amnesia for events before age 3-4, dissociation during encoding, retrieval failure due to lack of cues) without invoking a special repression mechanism. Others argue that the clinical evidence — patients who recover verified traumatic memories after years of forgetting — supports the existence of repression.

The controversy is unlikely to be resolved by a single study, because the relevant data (the original traumatic event, the forgetting period, and the recovery) are inherently retrospective and difficult to verify independently. What the evidence does support is that some traumatic memories are forgotten and later recalled, and that some recovered memories are false. The base rate of each category is unknown.

Trauma-focused therapies. Several evidence-based treatments for trauma-related disorders involve memory processing. Prolonged exposure therapy requires the patient to repeatedly recount the traumatic memory in a safe therapeutic context, which activates the memory and allows new, non-threatening associations to form alongside the fear associations. Cognitive processing therapy involves identifying and challenging maladaptive beliefs that developed as a result of the trauma. Eye movement desensitization and reprocessing (EMDR) involves recalling the traumatic memory while performing a bilateral stimulation task (typically following the therapist's finger movements with the eyes); the mechanism of action is debated, but the treatment has empirical support.

These therapies work with the memory of the trauma, not around it. The principle is that traumatic memories, which were encoded under conditions of extreme arousal and fear, need to be reprocessed in a safe context in order to lose their capacity to trigger intense emotional and physiological responses. The goal is not to erase the memory but to change its relationship to the present.

Learning and memory across cultures Master

The study of learning and memory has been dominated by Western researchers, Western institutions, and Western research participants. In 2010, Henrich, Heine, and Norenzayan published an influential paper arguing that behavioural scientists rely disproportionately on samples from Western, educated, industrialized, rich, democratic ("WEIRD") societies, and that these samples are not representative of the human species. The critique applies directly to the study of learning and memory.

Rote learning and discovery learning

The dominant educational philosophy in Western psychology has favoured discovery learning — the view that learners construct knowledge best through active exploration, problem-solving, and hands-on experience, rather than through passive reception of information. This view is associated with Piaget, Bruner, and constructivist educational theory. It has shaped Western educational practice from kindergarten through university.

In many East Asian educational systems, by contrast, rote learning — the memorization and repetition of information — has historically been more valued. Students in China, Japan, and South Korea spend substantial time memorizing characters, formulas, poems, historical facts, and canonical texts. This approach is often criticized by Western educators as shallow, passive, and counterproductive to deep understanding.

The criticism is partly justified and partly culturally blinkered. Rote memorization, in isolation, is not sufficient for deep learning. But rote memorization is not necessarily the opposite of understanding; it can be the foundation on which understanding is built. A student who has memorized multiplication tables can allocate working memory resources to higher-level mathematical reasoning, while a student who must calculate 7 times 8 each time is spending cognitive resources on a procedure that should be automatic. The same principle applies to vocabulary in a foreign language, factual knowledge in history, and procedural routines in science.

The Chinese concept of wenli (understanding through practice) and the Japanese concept of shu-ha-ri (a progression from strict adherence to creative departure) both incorporate memorization and repetition as necessary early stages in the development of expertise. The Western assumption that rote learning and understanding are opposed may reflect a cultural preference rather than a universal cognitive truth.

Empirical evidence supports a middle position. Direct instruction — a structured approach in which the teacher explicitly explains concepts, demonstrates procedures, provides guided practice, and gives feedback — produces better learning outcomes than pure discovery learning in many domains, particularly for novice learners. Pure discovery learning, in which students are expected to figure things out on their own with minimal guidance, can be ineffective or even counterproductive, because novices lack the knowledge structures to make sense of the information they encounter. But pure rote learning, without opportunities for application, transfer, and integration, also produces fragile knowledge that does not generalize.

Collective memory and oral tradition

The concept of memory in Western psychology is predominantly individual: one person's brain encodes, stores, and retrieves information. But collective memory — the shared representations of the past held by a group, community, or nation — is also a form of memory, with different properties and different mechanisms.

Collective memory was first systematically studied by the sociologist Maurice Halbwachs (1925), who argued that individual memory is always situated within social frameworks. You remember your past not in isolation but through the social groups to which you belong — your family, your community, your nation. The groups provide the narratives, categories, and landmarks that structure your recall. Halbwachs' insight was that forgetting is not only an individual phenomenon but a social one: when a group disperses or its shared narratives are disrupted, the individuals within it lose access to memories that were maintained by the social framework.

Oral tradition — the transmission of knowledge, history, and culture through spoken rather than written means — is a sophisticated memory technology that has been developed independently by cultures around the world. In oral cultures, specialized techniques for preserving and transmitting information over generations include:

Rhythmic and metrical structures. Poetry, songs, and chants are easier to remember than prose because the rhythm and rhyme provide redundant cues that constrain possible continuations. The Iliad and the Odyssey were composed and transmitted orally for centuries before being written down, using elaborate metrical formulae (repeated phrases like "rosy-fingered dawn") that served as memory anchors.

Narrative organization. Information embedded in stories is better remembered than information presented as a list. The "method of loci" — the memory technique of associating items with locations along a familiar route — exploits the same spatial-narrative architecture that oral cultures use. Australian Aboriginal songlines are perhaps the most elaborate example: navigational routes across the continent encoded in songs that describe the landscape, its features, and the ancestral beings who created them. A songline is simultaneously a map, a history, a legal code, and a ritual text.

Distributed memory. In many oral cultures, no single individual holds the complete body of knowledge. Knowledge is distributed across members of the community, with different individuals responsible for different domains. This distribution provides redundancy and error correction: if one person's version of a story deviates from the community's version, the discrepancy can be identified and corrected through collective recitation.

The study of memory in oral traditions challenges the Western assumption that memory is primarily an individual, internal process. Memory can be distributed across people, places, and practices, and the technologies that support it — rhythm, narrative, space, social structure — are as sophisticated as any written system.

Memory in non-Western contexts

Research on memory in non-Western cultures has revealed both universals and cultural variation.

Universals. The basic structure of memory — encoding, storage, retrieval, the distinction between working and long-term memory, the forgetting curve, interference effects — appears to be universal. The same brain structures (hippocampus, prefrontal cortex, basal ganglia) support memory across cultures. The spacing effect and the testing effect (better retention from retrieval practice than from re-reading) have been replicated across diverse populations.

Variation. The content of memory, the strategies used to enhance it, and the values attached to different kinds of memory vary across cultures. Cultures that emphasize interdependence (many East Asian, African, and Latin American cultures) tend to produce memories that are more relational and contextual — focused on social relationships and group events — while cultures that emphasize independence (many North American and Western European cultures) tend to produce memories that are more self-focused and event-specific. These differences reflect the social priorities of the culture, not differences in memory capacity.

The age of earliest autobiographical memory also varies across cultures. In Western samples, the average age of earliest memory is around 3.5 years. In East Asian samples, it is typically later, around 4.5 years. In Maori samples from New Zealand, it is earlier, around 2.5 years. These differences are attributed to cultural practices around memory sharing: cultures that frequently discuss past events with young children, and that encourage children to elaborate on their experiences, tend to produce earlier first memories.

Connections Master

Neuroscience [29.02.NN] (pending) connects through the neural substrates of learning and memory. The hippocampus, amygdala, basal ganglia, prefrontal cortex, and cerebellum each contribute to different aspects of learning and memory. Long-term potentiation (LTP) — the persistent strengthening of synaptic connections following repeated stimulation — is the leading cellular model of how memories are stored. The connection runs in both directions: behavioural findings constrain neural models, and neural findings constrain behavioural theories.
Sensation and perception [29.03.NN] (pending) connect through the encoding phase of memory. What is encoded depends on what is perceived, and what is perceived depends on attention, expectation, and prior knowledge. Perceptual learning — the improvement in perceptual discrimination with practice — is both a form of learning and a modifier of future encoding.
Cognition and intelligence [29.05.NN] (pending) connects through working memory's role in reasoning and problem-solving. Working memory capacity is a strong predictor of fluid intelligence, and the mechanisms of working memory (maintenance, updating, inhibition) are central to many cognitive tasks. The debate over whether working memory and intelligence are distinct constructs or different labels for overlapping capacities is a live research question.
Psychological disorders [29.09.NN] (pending) connects through memory's role in anxiety disorders (conditioned fear responses that persist despite extinction), depression (biased memory for negative events), post-traumatic stress disorder (intrusive traumatic memories), and substance use disorders (conditioned cravings triggered by environmental cues).
Cross-cultural psychology [29.12.NN] (pending) connects through the cultural variation in memory practices, educational approaches, and the social frameworks that structure autobiographical memory. The WEIRD critique applies directly: much of what is presented as "general" knowledge about memory is based on Western samples and may not generalize.
Philosophy of mind 20.06.01 connects through the mind-body problem. The reconstructive nature of memory challenges naive realist views of perception and recollection. If memory is reconstructive rather than reproductive, then the relationship between experience and its mental representation is more complex than a simple copy. The implications for personal identity — whether you are the same person you were ten years ago, given that your memories of that time have been reconstructed many times — connect to philosophical debates about persistence and selfhood.
Ethics [20.02.NN] (pending) connects through the ethics of psychological research, from the Little Albert study to the recovered-memory controversies to the use of behavioural techniques in advertising, education, and politics. The power to shape learning and memory is a power that can be used for good (treating phobias, enhancing education) or for manipulation.

Historical & philosophical context Master

The systematic study of learning began with the work of Ebbinghaus on memory (1885) and Thorndike on animal intelligence (1898), but the philosophical roots extend much further. Aristotle's De Memoria (On Memory and Reminiscence) proposed that memory involves the formation of mental images that resemble past perceptions and that recollection is a process of searching for these images through associative chains. The method of loci — associating items to be remembered with locations in a familiar building — was widely used in ancient Greece and Rome and is described in the Rhetorica ad Herennium (c. 86 BCE).

The modern study of learning was shaped by two forces: the desire to make psychology an experimental science, and the philosophical position of empiricism — the view that all knowledge comes from experience. John Locke's metaphor of the mind as a tabula rasa (blank slate) at birth, written upon by experience, was a direct precursor of behaviourism's focus on learning as the mechanism by which the mind is formed.

Behaviourism emerged in the early twentieth century as a reaction against introspective psychology, which behaviourists regarded as unscientific because it relied on subjective self-reports that could not be independently verified. Watson's 1913 manifesto, "Psychology as the Behaviorist Views It," declared that psychology should study only observable behaviour, not mental states. Skinner extended this position to its most radical form, arguing that private events (thoughts, feelings) are themselves behaviours, subject to the same reinforcement principles as public behaviours.

The cognitive revolution of the 1950s and 1960s challenged behaviourism's rejection of mental processes. Several developments contributed: Chomsky's critique of Skinner's Verbal Behavior (1959), which argued that language acquisition cannot be explained by operant conditioning alone; Tolman's demonstrations of latent learning and cognitive maps; the development of the digital computer as a metaphor for the mind; and the failure of behaviourism to account for complex phenomena such as language, problem-solving, and decision-making.

The cognitive revolution did not reject the findings of behaviourism. Classical conditioning, operant conditioning, and the basic principles of reinforcement are well-established phenomena that continue to generate productive research. What changed was the theoretical framework: the black box was opened, and mental processes — encoding, storage, retrieval, representation, computation — were admitted as legitimate objects of scientific study.

The neuroscience revolution of the late twentieth and early twenty-first centuries added a third layer. The development of neuroimaging techniques (fMRI, PET, EEG) made it possible to observe the brain's activity during learning and memory tasks, providing converging evidence for the cognitive models and, in some cases, challenging them. The discovery of place cells in the hippocampus (O'Keefe & Dostrovsky, 1971) — neurons that fire when an animal is in a specific location — provided a neural substrate for Tolman's cognitive maps. The identification of long-term potentiation as a synaptic mechanism for memory (Bliss & Lomo, 1973) provided a cellular-level account of how learning changes the brain.

The tension between levels of explanation — behavioural, cognitive, and neural — is a defining feature of contemporary psychology. The same phenomenon (classical conditioning, for example) can be described at the behavioural level (CS-UCS pairing produces a CR), the cognitive level (the organism forms an expectation that the CS predicts the UCS), and the neural level (the CS-UCS pairing strengthens synaptic connections in the amygdala). Each level captures something real; none is complete on its own.

The ethical dimension of learning research has evolved significantly. The Little Albert study (1920) was conducted without informed consent and without a plan to reverse the harm caused. The Milgram obedience experiments (1961) and the Stanford prison experiment (1971), while not directly about learning, further eroded trust in researchers' self-regulation. The Belmont Report (1979) and the establishment of institutional review boards (IRBs) established the ethical framework that governs psychological research today: respect for persons (informed consent), beneficence (minimizing harm, maximizing benefit), and justice (fair distribution of research burdens and benefits). The field's willingness to confront its own ethical history — including the reassessment of the Little Albert study, the identification of Douglas Merritte, and the ongoing debate about the legacy of behaviourism — is a sign of disciplinary maturity, even if the reckoning is incomplete.

Bibliography Master

Foundational experiments:

Pavlov, I.P. — Conditioned Reflexes: An Investigation of the Physiological Activity of the Cerebral Cortex (Oxford University Press, 1927; trans. Anrep).
Watson, J.B. & Rayner, R. — "Conditioned emotional reactions", Journal of Experimental Psychology 3, 1–14 (1920).
Thorndike, E.L. — "Animal intelligence: an experimental study of the associative processes in animals", Psychological Review Monograph Supplements 2, 1–109 (1898).
Skinner, B.F. — The Behavior of Organisms: An Experimental Analysis (Appleton-Century, 1938).
Bandura, A., Ross, D. & Ross, S.A. — "Transmission of aggression through imitation of film-mediated aggressive models", Journal of Abnormal and Social Psychology 63, 575–582 (1961).
Tolman, E.C. — "Cognitive maps in rats and men", Psychological Review 55, 189–208 (1948).

Memory foundations:

Ebbinghaus, H. — Memory: A Contribution to Experimental Psychology (1885; Dover reprint, 1964; trans. Ruger & Bussenius).
Atkinson, R.C. & Shiffrin, R.M. — "Human memory: a proposed system and its control processes", in Spence (ed.), The Psychology of Learning and Motivation Vol. 8 (Academic Press, 1968), pp. 47–89.
Baddeley, A.D. & Hitch, G. — "Working memory", in Bower (ed.), The Psychology of Learning and Motivation Vol. 8 (Academic Press, 1974), pp. 47–89.
Craik, F.I.M. & Lockhart, R.S. — "Levels of processing: a framework for memory research", Journal of Verbal Learning and Verbal Behavior 11, 671–684 (1972).
Tulving, E. — Elements of Episodic Memory (Oxford University Press, 1983).

Reconstructive memory and false memories:

Loftus, E.F. & Palmer, J.C. — "Reconstruction of automobile destruction: an example of the interaction between language and memory", Journal of Verbal Learning and Verbal Behavior 13, 585–589 (1974).
Loftus, E.F. & Pickrell, J.E. — "The formation of false memories", Psychiatric Annals 25, 720–725 (1995).
Schacter, D.L. — The Seven Sins of Memory: How the Brain Forgets and Remembers (Houghton Mifflin, 2001).

Neuroscience of memory:

O'Keefe, J. & Dostrovsky, J. — "The hippocampus as a spatial map: preliminary evidence from unit activity in the freely-moving rat", Brain Research 34, 171–175 (1971).
Bliss, T.V.P. & Lomo, T. — "Long-lasting potentiation of synaptic transmission in the dentate area of the anaesthetized rabbit following stimulation of the perforant path", Journal of Physiology 232, 331–356 (1973).
Squire, L.R. — "Memory and the hippocampus: a synthesis from findings with rats, monkeys, and humans", Psychological Review 99, 195–231 (1992).

Cross-cultural and WEIRD critique:

Henrich, J., Heine, S.J. & Norenzayan, A. — "The weirdest people in the world?", Behavioral and Brain Sciences 33, 61–83 (2010).
Wang, Q. — "Culture and the development of self-knowledge", Current Directions in Psychological Science 15, 182–187 (2006).
Halbwachs, M. — Les cadres sociaux de la memoire (Alcan, 1925); English trans. On Collective Memory (University of Chicago Press, 1992, ed. Coser).

Educational applications:

Anderson, J.R. — Cognitive Psychology and Its Implications (Worth, 8th ed., 2015).
Roediger, H.L. & Karpicke, J.D. — "Test-enhanced learning: taking memory tests improves long-term retention", Psychological Science 17, 249–255 (2006).
Dunlosky, J. et al. — "Improving students' learning with effective learning techniques", Psychological Science in the Public Interest 14, 4–58 (2013).

Ethics of psychological research:

Beck, H.P., Levinson, S. & Irons, G. — "Finding Little Albert: a journey to John B. Watson's infant laboratory", American Psychologist 64, 605–614 (2009).
National Commission for the Protection of Human Subjects — The Belmont Report (1979).

Prerequisites

29.01.01 pending

Tier anchors

beginner: Gray, Psychology (Worth); Myers, Psychology (Worth); any introductory psychology textbook chapters on learning and memory
intermediate: Anderson, Cognitive Psychology and Its Implications; Baddeley, Working Memory; Eichenbaum, The Cognitive Neuroscience of Memory
master: primary sources: Pavlov 1927, Skinner 1938, Bandura 1961, Tolman 1948, Baddeley & Hitch 1974, Ebbinghaus 1885, Loftus & Palmer 1974, Sperling 1960; secondary: Anderson, Baddeley, Eichenbaum, Schacter

References

Pavlov, I.P. — Conditioned Reflexes (Oxford University Press, 1927) · Lectures I-IV
Skinner, B.F. — The Behavior of Organisms (Appleton-Century, 1938) · Ch. 1-5, 7
Bandura, A., Ross, D., & Ross, S.A. — Transmission of aggression through imitation of film-mediated aggressive models, Journal of Abnormal and Social Psychology 63, 575-582 (1961) · full text
Tolman, E.C. — Cognitive maps in rats and men, Psychological Review 55, 189-208 (1948) · full text
Baddeley, A.D. & Hitch, G. — Working memory, in Bower (ed.), The Psychology of Learning and Motivation Vol. 8 (Academic Press, 1974), pp. 47-89 · full chapter
Ebbinghaus, H. — Memory: A Contribution to Experimental Psychology (1885; Dover reprint, 1964) · Ch. 1-7
Loftus, E.F. & Palmer, J.C. — Reconstruction of automobile destruction, Journal of Verbal Learning and Verbal Behavior 13, 585-589 (1974) · full text
Anderson, J.R. — Cognitive Psychology and Its Implications (Worth, 8th ed., 2015) · Ch. 6-9
Schacter, D.L. — The Seven Sins of Memory (Houghton Mifflin, 2001) · full text

Estimated time

beginner: 35m
intermediate: 60m
master: 90m