Extinction (psychology)

Extinction is a behavioral phenomenon observed in both operantly conditioned and classically conditioned behavior, which manifests itself by fading of non-reinforced conditioned response over time. When operant behavior that has been previously reinforced no longer produces reinforcing consequences the behavior gradually stops occurring. In classical conditioning, when a conditioned stimulus is presented alone, so that it no longer predicts the coming of the unconditioned stimulus, conditioned responding gradually stops. For example, after Pavlov's dog was conditioned to salivate at the sound of a metronome, it eventually stopped salivating to the metronome after the metronome had been sounded repeatedly but no food came. Many anxiety disorders such as post traumatic stress disorder are believed to reflect, at least in part, a failure to extinguish conditioned fear.

Theories
The dominant account of extinction involves associative models. However, there is debate over whether extinction involves simply "unlearning" the unconditional stimulus (US) – Conditional stimulus (CS) association (e.g., the Rescorla–Wagner account) or, alternatively, a "new learning" of an inhibitory association that masks the original excitatory association (e.g., Konorski, Pearce and Hall account). A third account concerns non-associative mechanisms such as habituation, modulation and response fatigue. Myers & Davis review fear extinction in rodents and suggested that multiple mechanisms may be at work depending on the timing and circumstances in which the extinction occurs.

Given the competing views and difficult observations for the various accounts researchers have turned to investigations at the cellular level (most often in rodents) to tease apart the specific brain mechanisms of extinction, in particular the role of the brain structures (amygdala, hippocampus, the prefrontal cortex), and specific neurotransmitter systems (e.g., GABA, NMDA). A recent study in rodents by Amano, Unal and Paré published in Nature Neuroscience found that extinction of a conditioned fear response is correlated with synaptic inhibition in the fear output neurons of the central amygdala that project to the periaqueductal gray that controls freezing behavior. They infer that inhibition derives from the ventromedial prefrontal cortex and suggest promising targets at the cellular level for new treatments of anxiety.

Classical conditioning
Learning extinction can also occur in a classical conditioning paradigm. In this model, a neutral cue or context can come to elicit a conditioned response when it is paired with an unconditioned stimulus. An unconditioned stimulus is one that naturally and automatically triggers a certain behavioral response. A certain stimulus or environment can become a conditioned cue or a conditioned context, respectively, when paired with an unconditioned stimulus. An example of this process is a fear conditioning paradigm using a mouse. In this instance, a tone paired with a mild footshock can become a conditioned cue, eliciting a fear response when presented alone in the future. In the same way, the context in which a footshock is received such as a chamber with certain dimensions and a certain odor can elicit the same fear response when the mouse is placed back in that chamber in the absence of the footshock.

In this paradigm, extinction occurs when the animal is re-exposed to the conditioned cue or conditioned context in the absence of the unconditioned stimulus. As the animal learns that the cue or context no longer predicts the coming of the unconditioned stimulus, conditioned responding gradually decreases, or extinguishes.

Operant conditioning
In the operant conditioning paradigm, extinction refers to the process of no longer providing the reinforcement that has been maintaining a behavior. Operant extinction differs from forgetting in that the latter refers to a decrease in the strength of a behavior over time when it has not been emitted. For example, a child who climbs under his desk, a response which has been reinforced by attention, is subsequently ignored until the attention-seeking behavior no longer occurs. In his autobiography, B.F. Skinner noted how he accidentally discovered the extinction of an operant response due to the malfunction of his laboratory equipment: "My first extinction curve showed up by accident. A rat was pressing the lever in an experiment on satiation when the pellet dispenser jammed. I was not there at the time, and when I returned I found a beautiful curve. The rat had gone on pressing although no pellets were received. ... The change was more orderly than the extinction of a salivary reflex in Pavlov's setting, and I was terribly excited. It was a Friday afternoon and there was no one in the laboratory who I could tell. All that weekend I crossed streets with particular care and avoided all unnecessary risks to protect my discovery from loss through my accidental death."

When the extinction of a response has occurred, the discriminative stimulus is then known as an extinction stimulus (SΔ or S-delta). When an S-delta is present, the reinforcing consequence which characteristically follows a behavior does not occur. This is the opposite of a discriminative stimulus which is a signal that reinforcement will occur. For instance, in an operant chamber, if food pellets are only delivered when a response is emitted in the presence of a green light, the green light is a discriminative stimulus. If when a red light is present food will not be delivered, then the red light is an extinction stimulus (food here is used as an example of a reinforcer). However, some make the distinction between extinction stimuli and "S-Delta" due to the behavior not having a reinforcement history, i.e. in an array of three items (phone, pen, paper) "Which one is the phone" the "pen" and "paper" will not produce a response in the teacher but is not technically extinction on the first trial due to selecting "pen" or "paper" missing a reinforcement history. This still would be considered as S-Delta.

Successful Extinction Procedures
In order for extinction to work effectively, it must be done consistently. Extinction is considered successful when responding in the presence of an extinction stimulus (a red light or a teacher not giving a bad student attention, for instance) is zero. When a behavior reappears again after it has gone through extinction, it is called spontaneous recovery. It (extinction) is the result of challenging behavior(s) no longer occurring without the need for reinforcement. If there is a relapse and reinforcements are given, the problem behavior will return. Extinction can be a long process; therefore, it requires that the facilitator of the procedure be completely invested from beginning to end in order for the outcome to be successful. The fewer challenging behaviors observed after extinction will most likely produce a less significant spontaneous recovery. While working towards extinction there are different distributions or schedules of when to administer reinforcements. Some people may use an intermittent reinforcement schedule that include: fixed ratio, variable ratio, fixed interval and variable interval. Another option is to use a continuous reinforcement. Schedules can be both fixed and variable and also the number of reinforcements given during each interval can vary.

Extinction procedures in the classroom
A positive classroom environment wields better results in learning growth. Therefore, in order for children to be successful in the classroom, their environment should be free of problem behaviors that can cause distractions. The classroom should be a place that offers consistency, structure, and stability, where the student feels empowered, supported and safe. When problem behaviors occur, learning opportunities decrease. Problem behaviors in the classroom that would benefit from extinction may include off-task behaviors, blurting, yelling, interrupting and use of inappropriate language. The use of extinction has been used primarily when the problem behaviors interfered with successful classroom outcomes. While other methods have been used in conjunction with extinction, positive outcomes are not likely when extinction is not used in behavior interventions.

Burst
While extinction, when implemented consistently over time, results in the eventual decrease of the undesired behavior, in the short term the subject might exhibit what is called an extinction burst. An extinction burst will often occur when the extinction procedure has just begun. This usually consists of a sudden and temporary increase in the response's frequency, followed by the eventual decline and extinction of the behavior targeted for elimination. Novel behavior, or emotional responses or aggressive behavior, may also occur.

For example, a pigeon has been reinforced to peck an electronic button. During its training history, every time the pigeon pecked the button, it will have received a small amount of bird seed as a reinforcer. Thus, whenever the bird is hungry, it will peck the button to receive food. However, if the button were to be turned off, the hungry pigeon will first try pecking the button just as it has in the past. When no food is forthcoming, the bird will likely try repeatedly. After a period of frantic activity, in which their pecking behavior yields no result, the pigeon's pecking will decrease in frequency.

Although not explained by reinforcement theory, the extinction burst can be understood using control theory. In perceptual control theory, the degree of output involved in any action is proportional to the discrepancy between the reference value (desired rate of reward in the operant paradigm) and the current input. Thus, when reward is removed, the discrepancy increases, and the output is increased. In the long term, 'reorganisation', the learning algorithm of control theory, would adapt the control system such that output is reduced.

The evolutionary advantage of this extinction burst is clear. In a natural environment, an animal that persists in a learned behavior, despite not resulting in immediate reinforcement, might still have a chance of producing reinforcing consequences if the animal tries again. This animal would be at an advantage over another animal that gives up too easily.

Despite the name, however, not every explosive reaction to adverse stimuli subsides to extinction. Indeed, a small minority of individuals persist in their reaction indefinitely.

Extinction-induced variability
Extinction-induced variability serves an adaptive role similar to the extinction burst. When extinction begins, subjects can exhibit variations in response topography (the movements involved in the response). Response topography is always somewhat variable due to differences in environment or idiosyncratic causes but normally a subject's history of reinforcement keeps slight variations stable by maintaining successful variations over less successful variations. Extinction can increase these variations significantly as the subject attempts to acquire the reinforcement that previous behaviors produced. If a person attempts to open a door by turning the knob, but is unsuccessful, they may next try jiggling the knob, pushing on the frame, knocking on the door or other behaviors to get the door to open. Extinction-induced variability can be used in shaping to reduce problematic behaviors by reinforcing desirable behaviors produced by extinction-induced variability.

Autism
Children with Autism Spectrum Disorder (ASD) are known to have restricted or repetitive behaviors that can cause problems when trying to function in day-to-day activities. Extinction is used as an intervention to help with problem behaviors. Some problem behaviors may include but are not limited to, self-injurious behaviors, aggression, tantrums, problems with sleep, and making choices. Ignoring certain self-injurious behaviors can lead to the extinction of said behaviors in children with ASD. Escape Extinction (EE) is commonly used in instances when having to make choices causes problem behavior. An example could be having to choose between mint or strawberry flavored toothpaste when brushing your teeth. Those would be the only two options available. When implementing EE, the interventionist will use physical and verbal prompting to help the subject make a choice.

Glutamate
Glutamate is a neurotransmitter that has been extensively implicated in the neural basis of learning. D-Cycloserine (DCS) is a partial agonist for the glutamate receptor NMDA at the glycine site, and has been trialed as an adjunct to conventional exposure-based treatments based on the principle of cue extinction.

A role for glutamate has also been identified in the extinction of a cocaine-associated environmental stimuli through testing in rats. Specifically, the metabotropic glutamate 5 receptor (mGlu5) is important for the extinction of a cocaine-associated context and a cocaine-associated cue.

Dopamine
Dopamine is another neurotransmitter implicated in learning extinction across both appetitive and aversive domains. Dopamine signaling has been implicated in the extinction of conditioned fear    and the extinction of drug-related learning

Circuitry
The brain region most extensively implicated in learning extinction is the infralimbic cortex (IL) of the medial prefrontal cortex (mPFC) The IL is important for the extinction of reward- and fear-associated behaviors, while the amygdala has been strongly implicated in the extinction of conditioned fear. The posterior cingulate cortex (PCC) and temporoparietal junction (TPJ) have also been identified as regions that may be associated with impaired extinction in adolescents.

Across development
There is a strong body of evidence to suggest that extinction alters across development. That is, learning extinction may differ during infancy, childhood, adolescence and adulthood. During infancy and childhood, learning extinction is especially persistent, which some have interpreted as erasure of the original CS-US association,  but this remains contentious. In contrast, during adolescence and adulthood extinction is less persistent, which is interpreted as new learning of a CS-no US association that exists in tandem and opposition to the original CS-US memory.