TY - JOUR T1 - Rapid Learning of Odor–Value Association in the Olfactory Striatum JF - The Journal of Neuroscience JO - J. Neurosci. SP - 4335 LP - 4347 DO - 10.1523/JNEUROSCI.2604-19.2020 VL - 40 IS - 22 AU - Daniel J. Millman AU - Venkatesh N. Murthy Y1 - 2020/05/27 UR - http://www.jneurosci.org/content/40/22/4335.abstract N2 - Rodents can successfully learn multiple novel stimulus–response associations after only a few repetitions when the contingencies predict reward. The circuits modified during such reinforcement learning to support decision-making are not known, but the olfactory tubercle (OT) and posterior piriform cortex (pPC) are candidates for decoding reward category from olfactory sensory input and relaying this information to cognitive and motor areas. Through single-cell recordings in behaving male and female C57BL/6 mice, we show here that an explicit representation for reward category emerges in the OT within minutes of learning a novel odor–reward association, whereas the pPC lacks an explicit representation even after weeks of overtraining. The explicit reward category representation in OT is visible in the first sniff (50–100 ms) of an odor on each trial, and precedes the motor action. Together, these results suggest that the coding of stimulus information required for reward prediction does not occur within olfactory cortex, but rather in circuits involving the olfactory striatum.SIGNIFICANCE STATEMENT Rodents are olfactory specialists and can use odors to learn contingencies quickly and well. We have found that mice can readily learn to place multiple odors into rewarded and unrewarded categories. Once they have learned the rule, they can do such categorization in a matter of minutes (<10 trials). We found that neural activity in olfactory cortex largely reflects sensory coding, with very little explicit information about categories. By contrast, neural activity in a brain region in the ventral striatum is rapidly modified in a matter of minutes to reflect reward category. Our experiments set up a paradigm for studying rapid sensorimotor reinforcement in a circuit that is right at the interface of sensory input and reward areas. ER -