Thus, over the course of learning, behavior incrementally converges on statistically optimal behavioral strategies given the context. In the striatum, which representations to gate into working memory and which to suppress may be learned through modulation of synaptic plasticity by dopaminergic RPE signals computed selleck screening library in the midbrain. For example, these signals may modulate the activity of
separate populations of “Go” and “NoGo” neurons that express D1 and D2 dopamine receptors respectively (Shen et al., 2008; O’Reilly and Frank, 2006). Applied to the cognitive control of memory, RPE could hypothetically operate in a similar manner, reinforcing or punishing selection/maintenance of a particular retrieval strategy given the context. Becker and Lim (2003) proposed a model of semantic clustering in free recall that provides an example of DNA Damage inhibitor how RPE might drive adjustments in control of memory (also see Gorski and Laird, 2011). This model sought to simulate semantic clustering strategies during recall. Clustering was
implemented by maintaining a semantic context in “PFC” working memory units where it influenced serial retrieval by the MTL/hippocampus. After each item was retrieved, it was assessed for its familiarity. Items associated with too much or too little familiarity were judged as errors (i.e., repetitions or intrusions, respectively). Either of these errors produced a negative RPE that punished the maintenance of a particular semantic context (i.e., retrieval strategy) in PFC. When enough such errors accumulated, the category maintained in PFC shifted. This model simulates classical semantic clustering, as well as reductions in recall due to a “frontal” challenge, namely dividing attention (Moscovitch, 1994). Importantly, the model highlights that recall itself can be affected not only by demands on maintaining a strategy but also detecting when a strategy has become obsolete and a shift unless is in order. Consistent with this insight, frontal patients have been shown to use fewer numbers of semantic categories for clustering than controls, even when controlling for
deficits in the degree to which they retrieve semantically related items consecutively (Jetter et al., 1986; Hildebrandt et al., 1998). Hence, this model illustrates that RPE could be an important signal used by the brain to adjust memory retrieval strategies. Within the declarative memory domain, there is some behavioral evidence that participants adjust their retrieval strategies based on feedback about outcomes. Han and Dobbins (2009) manipulated explicit feedback to differentially reinforce “old” responses in a recognition memory task and found that participants become more or less likely to endorse memory probes as “old.” This shift in behavior occurred gradually over the course of learning and persisted even in blocks after the feedback was removed.