Abstract
Memory stability is essential for animal survival when environment and behavioral state change over short or long time spans. The stability of a memory can be expressed by its duration, its perseverance when conditions change as well as its specificity to the learned stimulus. Using optogenetic and pharmacological manipulations in male mice, we show that the presence of noradrenaline in the olfactory bulb during acquisition renders olfactory memories more stable. We show that while inhibition of noradrenaline transmission during an odor–reward acquisition has no acute effects, it alters perseverance, duration, and specificity of the memory. We use a computational approach to propose a proof of concept model showing that a single, simple network effect of noradrenaline on olfactory bulb dynamics can underlie these seemingly different behavioral effects. Our results show that acute changes in network dynamics can have long-term effects that extend beyond the network that was manipulated.
SIGNIFICANCE STATEMENT Olfaction guides the behavior of animals. For successful survival, animals have to remember previously learned information and at the same time be able to acquire new memories. We show here that noradrenaline in the olfactory bulb, the first cortical relay of the olfactory information, is important for creating stable and specific olfactory memories. Memory stability, as expressed in perseverance, duration and specificity of the memory, is enhanced when noradrenergic inputs to the olfactory bulb are unaltered. We show that, computationally, our diverse behavioral results can be ascribed to noradrenaline-driven changes in neural dynamics. These results shed light on how very temporary changes in neuromodulation can have a variety of long-lasting effects on neural processing and behavior.
Introduction
Stability to maintain skills and stimulus associations while adjusting to new circumstances is fundamental for animal behavior. Neural systems need to be both flexible to adapt rapidly to new information and stable to maintain learned behaviors to ensure survival. Neuromodulatory systems in particular are thought to have evolved to convey flexibility to neural systems by allowing them to process stimuli in very different manners depending on behavioral demands.
For example, acetylcholine may sharpen stimulus-evoked oscillations in neural networks to enhance attention to a stimulus on a momentary basis (Sarter et al., 2006; Parikh and Sarter, 2008), dopamine (DA) may increase plasticity when a stimulus is unexpected and in need of reinforcement (Gentry et al., 2019), or norepinephrine (NE) may increase the signal-to-noise ratio in specific networks during moments of stress to enhance recognition and processing of a stimulus (McBurney-Lin et al., 2019). NE has long been associated with olfactory learning (Devore and Linster, 2012; Linster and Escanilla, 2019) and has been shown to strongly modulate the processing of olfactory stimuli as early as in the olfactory bulb (OB). NE inputs to the OB change processing of low-amplitude odors, increases associative learning, and increases the signal-to-noise ratio (Doucette et al., 2007; Escanilla et al., 2010, 2012; Devore and Linster, 2012; Linster and Escanilla, 2019). We here show that in addition to these acute effects, temporary manipulations of bulbar NE modulate the long-term stability of olfactory memories beyond the timeframe of the manipulations. We measure the stability of a memory by its perseverance during contextual changes, its duration, and its specificity for the encoded stimulus. We use optogenetic inhibition of locus coeruleus (LC) fibers locally in the olfactory bulbs during acquisition of an odor–reward association to temporarily decrease NE activity. Mice exhibit a decrease of stability of the odor–reward association when NE activity is decreased, evidenced by less perseverance when odor–reward associations are reversed and by shorter duration of the acquired memory. The testing for memory stability was performed without manipulations of local NE and any effects of decreased NE activity were because of differences in acquisition. To ensure that our optogenetic manipulations were effective and to determine which bulbar NE receptors were mediating the observed effects, we then repeated the memory duration experiment using local infusions of NE antagonists. Perseverance and duration are two aspects of memory stability. Next, we show another aspect of memory stability, the specificity of the memory for the learned odor, which is also dependent on bulbar NE influx. A proof of concept model for odor learning driven by a realistic well vetted computational model of olfactory processing and its modulation by NE shows that our diverse behavioral results can be mediated by NE-driven changes in neural dynamics. The novelty of our results is showing that very temporary changes in neuromodulation can have a variety of long-lasting effects resulting from the same neural mechanism and that changes in very early sensory networks can have far-reaching effects.
Materials and Methods
Optogenetic experiments (experiments 1 and 2)
Animals
Twelve adult male C57BL6/J mice (Charles River Laboratories) that were 2 months of age at the beginning of the experiments were used for this experiment. Mice were housed in standard laboratory cages and were kept on a 12 h light/dark cycle (at a constant temperature of 22°C) with food and water ad libitum except during behavioral tests during which they were food deprived (∼20% reduction of daily consummation, leading to a 10% reduction in body weight). Mice were housed by group of five, and individually after surgery. All experimental procedures were validated by Lyon 1 and the French Ethical Committee (protocol #DR2013-48).
Odorants
Seven pair of odorants were used in these experiments. Odorants were diluted in mineral oil to achieve an approximate gas-phase partial pressure of 10 Pa (Cleland et al., 2002; Kermen et al., 2011; Table 1).
Viral vector injection and optical fiber implantation
Before surgery, mice were anesthetized with a cocktail injection of 50 mg/kg ketamine and 7.5 mg/kg xylazine administered intraperitoneally and secured in a stereotaxic instrument (Narishige). Three hundred nanoliters of hSyn-eNpHR3.0-EYFP lentivirus (9.22 × 106 IU/ml, expressing halorhodopsin and the yellow fluorescent protein; NpHR mice, n = 7) and 300 nl of control (Ctrl) hSyn-EYFP lentivirus [1.1 × 106 IU/ml, expressing only enhanced yellow fluorescent protein (EYFP); Ctrl mice, n = 5] were injected bilaterally into the locus coeruleus at the following coordinates with respect to the bregma: anteroposterior (AP), ±5.4 mm; mediolateral (ML), ±0.9; dorsoventral (DV), –4 mm at a rate of 150 nl/min. In all mice, bilateral optical fibers [core diameter, 200 nm; 0.22 numerical aperture (NA); Doric Lenses] were implanted into the OBs (from bregma: AP, +4.6 mm; ML, ±0.75 mm; DV, −2 mm; Fig. 1A). Behavioral experiments were performed 8 weeks after surgery. The pLenti-hSyn-eNpHR3.0-EYFP lentivirus was a gift from Karl Deisseroth (Stanford University: Stanford, CA) (Gradinaru et al., 2010) and was obtained through Addgene (plasmid #26775). Elaboration of the control pLenti-hSyn-EYFP lentivirus has been previously described (Kermen et al., 2016). The expression of light-sensitive chloride pumps and the delivery of light to axonal projection targets has been successfully used to inhibit activity at presynaptic terminals (Spellman et al., 2015; Stuber et al., 2011; Tye et al., 2011; Raimondo et al., 2012).
Behavioral procedure
Apparatus
Behavioral training took place on a computer-assisted two-hole board apparatus described previously (Mandairon et al., 2009). The hole board is equipped with capacitive sensors that monitor the events of nose poking (visits) in the holes. The holes were odorized by placing a cotton swab impregnated with 60 µl of 10 Pa odorant under bedding in a small dish placed into the hole. A food reward was buried into the bedding of one of the holes, with the location of the odor reward randomly determined for each trial.
Shaping
The mice were first trained to retrieve a reward (a small bit of sweetened cereal; Kellogg's) by digging through the bedding. The mouse was put in the start area of the two-hole board apparatus and allowed to dig for 1 min. During the first few trials, the reward was placed on top of the bedding in one of the holes. After the mice successfully retrieved the reward several times, it was successively buried deeper and deeper in the bedding. Shaping was considered to be complete when a mouse could successfully retrieve a reward buried deep in the bedding for at least 16 of 20 trials. Odor set 1 (Table 1) was used for shaping.
Acquisition
Each session consisted of 1 min trials during which the mouse was allowed to retrieve the food reward from the hole. If a mouse failed to find the reward after 60 s, the trial was ended and the mouse replaced on the starting position behind a cover while the next trial was set up. The inhibition of NA fibers in the OB was performed by bilateral continuous light stimulation (crystal laser: 561 nm, 10–15 mW) automatically triggered by the entry of the mouse's nose within a 5 cm zone around the odorized hole (light-triggering zone; VideoTrack, Viewpoint) and stopped automatically when the nose of the mouse exited the zone.
Reversal test
Mice were first trained for 10 trials of 1 min (see Acquisition) on an odor–reward association immediately followed by 15 trials with reversed odor–reward contingency (experiment 1). Optical stimulation was used for the initial 10 acquisition trials only. Mice were tested using odor set 6 with odor 1 (O1) associated with the reward in the first 10 trials and O2 associated with the reward for the last 10 trials. Each mouse was tested once.
Long-term memory test
Mice were trained with optical stimulation during the 20 acquisition trials and tested without optical stimulation for 5 trials 2 and 24 h later (experiment 3). Each mouse was tested twice in this experiment, once with odor set 3 and once with odor set 4, with O1 for each odor set associated with reward. A control experiment tested the role of NE inputs to the OB for recall: mice were trained during 20 trials without optical fiber stimulation and tested at 2 and 24 h with optical stimulation. Each mouse was tested once in this experiment, using odor set 5, with O1 associated with reward. In a control experiment, mice were first trained during 20 trials without optical stimulation and tested 2 and 24 h later with optical stimulation (odor set 2; Table 1).
Experimental design and statistical analysis
In experiments 1 and 2, 12 male mice (5 controls and 7 NpHR mice) were tested. Mice were tested twice with separate odor sets in experiment 2. All statistical analyses were performed using SPSS. Analysis was performed on the latency (delay) to dig in the rewarded dish as the dependent variable. The latency to dig in the rewarded dish is a good indicator for the strength of the acquired memory, with short latencies signaling a strong memory and fast decision-making, and longer latencies signaling slower decision-making and weaker learning (Mandairon et al., 2018). Repeated-measures ANOVA was used to assess how the blockade of NE projections modulates acquisition or recall. In each case, the experimental group (control or NpHR) was used as the between-subjects factor, and trial block (TB) or trial number as the within-subjects factor (α = 0.05). Pairwise comparisons between trial blocks to assess acquisition and recall were performed using Wilks' λ with α = 0.05. Successful reversal learning was assessed by comparing the delays during the last acquisition trial block (trials 6–10) to those during the last reversal trial block (trials 21–25). Long-term memory was assessed by comparing delays during the last acquisition trial block (trials 16–20) to those at 2 and 24 h. All raw data used for statistical analysis are available in Extended Data Fig. 1-1.
Cellular analysis
Mice were killed using pentobarbital (0.2 ml/30 g) and intracardiac perfusion of 50 ml of fixative (PFA 4%), pH 7.4. The brains were removed, postfixed overnight, cryoprotected in sucrose (20%), frozen rapidly, and then stored at −20°C before sectioning with a cryostat. Immunohistochemistries of NET (NE-Transporter to label NE fibers) and EYFP (to label transduced fibers) were performed in four to six sections (40 µm thick) of the OB distributed along its anteroposterior axis, using anti-NorEpinephrin Transporter (mouse; 1/1000; MAb Technologies) and anti-GFP (chicken; 1/1000; AnaSpec/Tebu). Appropriate secondary antibodies were used (goat anti-mouse Alexa Fluor 546 vector (1/250); and goat anti-chicken Alexa Fluor 488 (1/250); Thermo Fisher Scientific). Sections were then cover-slipped in Vectashield (Vector Laboratories). All fluorescent analyses were performed blind with regard to the identity of the animal. Images were taken in the granule cell (GC) layer of the OB with a Zeiss microscope equipped with an apotome, using 40× oil-immersion objective. Z-stacks were acquired with a 0.2 µm interval between images. Eight to thirteen pictures per animal were analyzed. Length of NET-positive and NET/GFP-positive fibers were analyzed with 3D viewer in ImageJ and NET/GFP-positive fibers among NET-positive fibers assessed (Fig. 1B). No animal was excluded from the analysis. The amount of overlap between GFP and NET expressing fibers was similar in Ctrl (mean ± SEM, 24.5 ± 3.85; n = 5) and NpHR (mean ± SEM, 16.36 ± 4.90; n = 7) mice (F(1,10) = 1.827; p = 0.206).
Pharmacological experiments
Animals
Twelve (experiment 3) and 9 (experiment 4) adult male C57BL6/J mice (Charles River Laboratories) that were 2 months of age at the beginning of the experiments were used for pharmacological experiments. Mice were housed in standard laboratory cages and were kept on a 12 h light/dark cycle with food and water available ad libitum except during behavioral test where they were food deprived to no less than 85% of their free feed weight. All experimental procedures were conducted under a protocol approved by the Cornell University Institutional Animal Care and Use Committee.
Odorants
Odorants are listed in Tables 1 and 2. Before each behavioral session, 60 μl of odor was loaded into 5 ml of sand and then covered with an additional 5 ml of sand.
Cannulation surgery
After behavioral shaping and before experiments, mice underwent surgery to implant bilateral cannulae in the OB for drug delivery according to established methods. Mice were anesthetized with gas anesthesia (isoflurane, 2–4%), injected intraperitoneally with 0.05 mg/kg atropine, and guide cannulae (22 gauge; Plastics One) were inserted 5 mm anterior and 1.5 mm ventral from bregma and affixed to the skull with dental cement (Guérin et al., 2008; Tong et al., 2018). After surgery, mice were given pain killers and saline injections and were allowed to recover for 7 d.
Drug infusions
Twenty minutes before behavioral testing, mice were bilaterally infused with NE antagonists or saline. Two microliters of solution was infused at 1 μl/min, and the infusion needle was left in place for 5 min after infusion. For long-term memory testing, mice were infused with saline, the α1-blocker prazosin (hydrochloride, 1 mm), the α2-blocker yohimbine (hydrochloride, 2 mm), the β-antagonist alprenolol (hydrochloride, 12 mm) or a cocktail of all three, all purchased from Sigma-Aldrich and diluted in 0.9% saline. For specificity testing, mice were infused with the nonspecific α-antagonist phentolamine (12 mm) or saline, with dosages determined from our previous behavioral experiments (Mandairon et al., 2008; Escanilla et al., 2010, 2012).
Experimental procedure
Behavioral testing took place in a modified mouse cage with a start and a testing chamber separated by an opaque removable Plexiglas door. Mice were put into the start chamber with the door closed and Petri dishes with a sand–odor mix were placed into the test chamber with a sugar pellet in the rewarded odor dish. The divider was opened, and mice were allowed to dig in the dishes to retrieve the sugar reward. The time delay to dig in the correct dish was recorded by hand and later double checked on the video trace. Mice were shaped to dig until they consistently retrieved the reward for 18 of 20 trials. For long-term memory tests, each mouse was trained for 20 trials using two scented dishes, one rewarded and one not, and tested on five trials 24 h later. Odor sets 1 and 2 were used for shaping and odor sets 3–7 were used for the experimental trials. Each mouse was tested on each drug condition (saline, α1-blocker, α2-blocker, β-blocker, all blockers) with a different odor set; the order of drug conditions was pseudorandomized and counterbalanced among mice. For specificity testing, each mouse was trained on an odor–reward association with a straight chain aliphatic odorant (C) for 4, 8, or 12 trials paired with an unscented dish, immediately followed by unrewarded test trials with the conditioned odor (C), two similar odors differing by one or two carbons from the conditioned odor (C + 1 and C + 2) and one unrelated odor (X; Table 2). Each mouse was trained and tested under each drug condition and number of training trials with a different odor set, with order of drug conditions, number of trials, and number of odor sets randomized and counterbalanced.
Experimental design and statistical analysis
Experiment 3 used 12 male mice who were each tested on each of five drug conditions. To test for the role of NE receptors in acquisition and 24 h recall, we used repeated-measures ANOVAs with latency to dig in the correct dish as the dependent variable, drug group as the between-subjects factor, and trial block or trial number as the within-subjects factor (α = 0.05), followed by pairwise comparisons between trial blocks to assess acquisition and recall using Wilks' λ. Memory at the 2 and 24 h tests was assessed by comparing delays to find the rewarded odor during the last acquisition trial block (trials 16–20) to the 2 and 24 h trial block. Experiment 4 used nine male mice who were tested on each drug/number of trial combination (2 × 3 design). For specificity testing, data were analyzed using a repeated-measures analysis with drug group (saline or phentalomine) and number of training trials (4, 8, or 12) as between-subjects factors and digging times in response to unrewarded test odors (C, C + 1, C + 2, and X) as within-subjects factor, followed by pairwise comparisons between digging responses in the conditioned and novel odors for each group (Wilks' λ) to assess memory specificity for the conditioned odor. All data analysis was performed in SPSS. All raw data used for statistical analysis is available in Extended Data Fig. 1-1.
Note on behavioral experiments
All behavioral experiments used a similar paradigm in which mice have to find a buried reward in an odorized dish. The details of the procedure (hole board vs modified home cage) and the number of trials differed between experiments. In experiment 1, we aimed to have acquisition complete without much overtraining to allow for reversal, if possible, and therefore chose 10 acquisition trials followed by 15 reversal trials. In experiments 2 and 3, we aimed to have enough trials for long-term 24 h memory to exist without overtraining the animals to be able to see an effect of NE modulation; we chose to train mice for 20 trials (Tong et al., 2018). In experiment 4, mice were trained on the odor–reward association with a scentless distracter odor because in this experiment we used a method to test how mice generalize between odorants (Linster and Hasselmo, 1999; Cleland et al., 2002), and we used a range of training trials (4, 8, and 12) to show that mice remember the odor more specifically when trained longer (Cleland et al., 2011; Cho and Linster, 2020). In each case the number of training trials was chosen based on experience to maximize the chance to see an effect and to avoid floor and ceiling effects.
Computational modeling
Computational modeling of the olfactory bulb followed the outline presented in the study by Linster and Kelsch (2019), with detailed equations and parameter sets described below, and the associated parameters described in Table 3. The modeled OB network incorporates the following five neuron types: olfactory sensory neurons (OSNs), mitral cells (MCs), external tufted cells (ETs), periglomerular cells (PGs), and GCs. Each group is composed of 100 neurons organized in functional columns with connectivity parameters that are specified in Table 3. MCs make synapses with 25% of GCs (pMC-GC = 0.25), and GCs make inhibitory local synapses onto MCs only. NE modulation to the OB was modeled according to the principles we discovered previously in brain slice and computational experiments (Nai et al., 2009, 2010; Linster et al., 2011); here, we simulated a high dosage of NE resulting in a dominance of α1-receptor effects on GC and MCs, which is also in agreement with the results from the pharmacological experiments presented here. Briefly, NE α1 modulation increases MC excitability with no change in membrane voltage or spontaneous activity, and increases GC activation with an increase in voltage and spontaneous activity (Nai et al., 2009, 2010; Linster et al., 2011). Learning an odor–reward association was modeled by projecting MC outputs in response to a conditioned stimulus (CS) to a response neuron (RN) that also received “reward” information [unconditioned stimulus (US); see Fig. 4A]. Excitatory synapses between MCs and the RN underwent activity-dependent synaptic plasticity when reward was present as well as a slow exponential decay toward baseline when reward was not present. This exponential decay had a time constant of 10 d, which resulted in memory durations similar to those observed experimentally for control mice and control simulations (NE). Reward association learning was simulated as follows. For each simulated “trial block” (five trials of 30 s each), the conditioned odor was paired with reward (activation of RN by US), which resulted in changes in synaptic weights between MCs and RN. After each trial block, we then set the reward to zero and presented the C, two overlapping odors with varying degree of overlap (C + 1, C + 2; 78% response; 34% correlation with C), and an unrelated odor (X; −0.42 correlation with C) for one simulated trial, computed the resulting activation of the RN (Devore et al., 2014), and saved the synaptic weight amplitudes. This was repeated over the course of four trial blocks (20 trials total) to test to what degree the specificity of the association evolved as a function of learning and depends on the presence of NE. Memory duration in the model was assessed by presenting the conditioned odor at intervals of 1 h simulated during a 24 h forgetting time. We ran 10 different instances of the model, each initialized with a different seed for the random number generator.
Analysis
To assess memory duration, we statistically compared RN response amplitudes (spiking probabilities) during preacquisition testing to that during each segment of the 24 h forgetting period. To assess memory specificity, we statistically compared RN response magnitudes to conditioned and test odors after each trial block during acquisition.
Network architecture
The modeled OB network incorporates the following five neuron types: OSNs, MCs, ETs, PGs, and GCs. Each group is composed of 100 neurons organized in functional columns. MCs make synapses with 25% of GCs (pMC-GC = 0.25), and GCs make inhibitory local synapses only (McIntyre and Cleland, 2016). To assess associations between odors (CS) and rewards (US), an RN was added that received excitatory synaptic inputs with very low initial weights from all MCs and underwent activity-dependent synaptic plasticity when US and CS were present at the same time. During postacquisition, when no US was present, these synapses underwent a slow exponential decay back to baseline values. The response magnitude of the RN was measured as instantaneous spiking probabilities.
Neurons and synapses
Our model is composed of single compartment leaky integrate-and-fire neurons, with the exception of MC which are modeled as two compartments. Changes in membrane voltage v(t) over time in each compartment are described by Equation 1: (1) where τ is the membrane time constant and Vext(t) is the voltage change resulting from external inputs (synaptic or sensory).
Each one of the voltage changes because of external inputs Vext is a result of the synaptic strength of the connection from neuron j to neuron i (wij), and the respective synaptic conductance in cell i at time t (gi(t)). EN,ij is the Nernst potential of the synaptic current, and vi(t) is the membrane potential of the postsynaptic neuron i, as described in Equation 2: (2)
The communication between neurons happens via discrete spikes. The spiking output F(5) of a given neuron, I, is a function of its membrane potential v and the minimal threshold and saturation threshold of the output function, θmin and θmax, where Fi(5) = 0 if v ≤ θmin and Fi(5) = 1 if v ≥ θmax and Fi(5) increases linearly between θmin and θmax
Fi(5) defines their instantaneous firing probability. The time course of the conductance change is calculated as follows: (3) where gmax is a constant with no unit representing the maximum conductance of a given channel and is equal to 1 (synaptic strength is scaled by the synaptic weight w), while τ1 and τ2 are the rising and falling times of this conductance. After firing, the spike of each spiking neuron is reset to the spiking threshold (Vrest).
In the simulations presented here, simulated exposure to an odorant induced activity-dependent plasticity of synapses from MC to the RN. Synaptic strengths were first calculated from the parameters given in Table 1. During simulated trial blocks, synapses between MCs and the RN underwent synaptic potentiation, as follows: where wij is the synaptic strength between the presynaptic MC and the postsynaptic RN, α (α = 0.01) is the rate of potentiation, and xj and ξ are the total numbers of spikes emitted by the presynaptic and postsynaptic cells during the preceding sniff cycles between t1 and t2. The synaptic weights also undergo postsynaptic normalization after the weight changes have been computed, with the sum of synaptic weights from MC to RN staying constant; this creates competition between synaptic weights over the course of training and leads to increasing specificity of synaptic weights to the odor used for conditioning. Postacquisition synaptic weights decay exponentially with a long time constant (10 h) to simulate forgetting in the absence of US–CS pairings. This time constant was adjusted to result in a long-term memory of at least 24 h for simulations in which NE was present during acquisition.
Implementation
All simulations were implemented using the C programming language in a Linux environment (Ubuntu 14.04 LTS x64) on an Intel desktop computer, with the Euler integration method used for the differential equations with a time step of 1 ms.
Data availability
The code/software described in the article is freely available online at https://senselab.med.yale.edu/modeldb/enterCode?model=266801. The code is available in Extended Data Fig. 2-1.
Results
We first show that memory stability depends on bulbar LC inputs by optogenetically decreasing NE fiber activity in the OB while testing mice on a reversal task (experiment 1) and a long-term memory task (experiment 2). We then use a pharmacological approach to show that our optical manipulation was effective and to narrow down NE receptor types underlying our observations (experiment 3). Last, we show that memory stability as expressed by the specificity of this memory is modulated by bulbar NE (experiment 4) and use a computational approach to propose a single mechanism that could underlie all of these observations.
NE inhibition in the olfactory bulb decreases odor memory stability expressed as perseverance in response to change
Experiment 1 tested how stable a memory acquired after a training period of 10 trials is by using a reversal training paradigm. NpHR and Ctrl mice were trained under light stimulation (with optical stimulation ON when the nose of the mouse was within a 5 cm radius of each odor dish) in a 10-trial simultaneous go-no-go task (Escanilla et al., 2008; Chaudhury et al., 2009; Moreno et al., 2012; Mandairon et al., 2018) in which they had to associate odorant 1 with a reward while odorant 2 was not reinforced (odor set 6; Table 1). These 10 trials were immediately followed by 15 trials of reversal learning with no light stimulation in which odorant 1 was not reinforced and odorant 2 was rewarded. The latency to find the rewarded odorant was recorded as a measure for how well the odor–reward association was learned. A repeated-measures ANOVA showed an overall effect of trial block (Ftrialblock(4,3) = 15.958; p = 0.023) and a significant interaction with experimental group (Ftrialblock*group(4,3) = 5.835; p = 0.049), showing that how mice behaved over the course of trials depended on treatment. Both experimental groups rapidly acquired the odor–reward association as evidenced by significant decreases in delay to find the rewarded odor between the first and last trial blocks (1–5 vs 6–10; Ctrl: p = 0.002; and NpHR: p = 0.042; by Wilks' λ; Fig. 1C). During reversal learning, Ctrl mice showed a high degree of perseverance to the previously learned association even after 15 trials, evidenced by a significant increase in delay between the last acquisition and the last reversal trial blocks (6–10 vs 21–25; p = 0.034), whereas NpHR mice quickly learned the new odor–reward association, as evidenced by similar delays during the last acquisition and the last reversal trial block (6–10 vs 21–25; p = 0.268; Fig. 1C). Thus, mice with decreased NE during acquisition were more flexible during reversal.
Figure 1-1
Data depicted in Figure 1. Download Figure 1-1, XLSX file.
These results show that the stability of the acquired memory is less persistent for NpHR compared with Ctrl animals; while the effects of inhibiting bulbar NE are not evident during the initial acquisition of an odor–reward association, they manifest during reversal training, although NE is not modulated at that time.
NE inhibition in the olfactory bulb decreased odor memory stability expressed as memory duration
Experiment 2 tested how the presence of NE during learning affects the duration of the odor memory. We trained the same mice for 20 acquisition trials on the same task (odor sets 3 and 4; Table 1). We tested their recall ability 2 and 24 h later (Fig. 1Di). NE transmission in the OB was light inhibited for NpHR mice during the acquisition but not the recall trials. For acquisition trials, there was no significant interaction between the trial block and the experimental group (Ftrialblock*group(3,20) = 0.915; p = 0.081) showing that NpHR and Ctrl groups behaved similarly during acquisition. Both groups had significantly lower delays at the 2 h test than at the beginning of acquisition (trials 1–5 vs 2 h; p = 0.041 and 0.010) showing that the odor–reward association was remembered 2 h later. However, 24 h after training, NpHR mice exhibited delays as long as during initial training (trials 1–5 vs 24 h; p = 0.754), whereas Ctrl mice did not (p = 0.023), showing that while Ctrl mice remembered the association 24 h later, NpHR mice did not. Latencies to find the reward at the 24 h test were significantly increased compared with the 2 h test in NpHR (2 vs 24 h; p = 0.001) but not control mice (p = 0.649; Fig. 1Di). We directly compared performance during the first five acquisition trials and the five trials at the 24 h test: while control mice had significantly lower delays during the 24 h trials compared with the initial acquisition (F(1,8) = 7.9; p = 0.023), NpHR mice have similar delays in both cases (F(1,11) = 3.037; p = 0.109), showing that they relearned the task (Fig. 1Dii). A separate control experiment showed that NE inhibition during 2 or 24 h recall after acquisition without light inhibition did not affect the delays to find the correct odorant. Mice were first trained for 20 trials without light stimulation and underwent recall trials at 2 and 24 h with light stimulation. We found a significant effect of trial block (Ftrialblock(3,7) = 6.000; p = 0.017), but no interaction with group (Ftrialblock,group(3,7) = 0.824; p = 0.696) during acquisition showing that while both groups learned there was no difference in their acquisition curves. Neither group displayed significantly longer delays at 24 h than during the last acquisition trials (respectively, p = 0.669 and p = 0.759), showing that both groups remember the odor–reward association after 24 h and that the inhibition of NE release has no effect on recall.
These results show that while inhibition of bulbar NE release does not noticeably slow down acquisition of an odor–reward association, it does impair the duration of memory, confirming our hypothesis that the presence of NE during acquisition renders a memory more stable.
In experiment 3, to test the effectiveness of our manipulation and to identify NE receptors underlying the observed effects, we used a pharmacological approach. Mice (n = 12) were implanted with bilateral cannulae in the OB for intracerebral drug infusions and were tested using the same paradigm. In each training session, mice were trained for 20 trials with α1, α2, β, all, or no NE receptors blocked (Mandairon et al., 2008) and were tested 24 h later with a single five-trial block and no drug infusions (Fig. 2). Because of the longer time course of drug application, we omitted the 2 h test here. Results showed that all mice acquired the task similarly, but that mice with α1 receptors blocked during acquisition showed significantly longer delays to find the rewarded odor 24 h after training (Fig. 2). We found a significant effect of trial block (Ftrialblock(4,28) = 16.078; p < 0.001) as well as an interaction between trial block and drug treatment (Ftrialblock*drug(16,86.176) = 1.961; p = 0.025). Mice acquired the odor–reward association equally across drug treatments (comparison between delays to find rewarded odor during trials 1–5 and trials 16–20; p < 0.05 for all drug conditions; Table 4, exact values). At the 24 recall test, mice in the saline, α2, and β drug conditions showed significant recall (comparison between trials 1–5 and 24 h; p < 0.05; Table 4, exact values). In contrast, mice with α1 receptors or all receptors blocked did not exhibit significant recall after 24 h (p > 0.05; Table 4, exact values). Hence, α1 receptor blockade locally in the OB had effects similar to those of the light inhibition of NE fiber activity locally in the OB, showing that NE effects on long-term memory are mediated at least partially by α1 receptor activation, and that our light inhibition of NE release was effective.
Figure 2-1
Data depicted in Figure 2. Download Figure 2-1, XLSX file.
NE inhibition in the olfactory bulb decreases odor memory stability as expressed in specificity for the learned odor
Experiment 4 tested to what degree activation of NE receptors determines the specificity of an odor–reward association. We used a generalization task in which mice learn to associate an odor with reward and are later tested on novel odors to assess how specific the formed memory is (Cleland et al., 2002, 2009; Cleland and Narla, 2003; Mandairon et al., 2006; Chaudhury et al., 2009). Mice (n = 9) were implanted with bilateral cannulae in their OBs and were trained to associate an odorant with a reward during 4, 8, or 12 trials. After the completion of training trials, mice were tested in four consecutive counterbalanced unrewarded trials with the C, C + 1 and C + 2, and X. How long mice search for the reward in a novel odor is a measure for how much they confuse the novel odor with the conditioned odor and, hence, assesses memory specificity (Linster and Hasselmo, 1999). Mice were trained after the infusion of saline (Fig. 3A) or the nonspecific α receptor blocker phentalomine into both OBs (Fig. 3B).
Figure 3-1
Data depicted in Figure 3. Download Figure 3-1, XLSX file.
Saline-treated control mice showed higher specificity for the conditioned odor when trained longer: the perceptually most similar odor (C + 1) was discriminated after 8 or 12 training trials, while the less similar (C + 2) and the unrelated odor (X) were discriminated after as few as four trials (Table 5, detailed p values; Fig. 3A). In contrast, mice with NE receptors blocked the needed 12 training trials to discriminate the less similar odor (C + 2) and did not discriminate the most similar odor even after 12 acquisition trials (Table 5, detailed p values; Fig. 3B). Overall, we found a strong effect of test odor (Fodor(3,29) = 14.063; p < 0.0001) as well as a significant interaction among test odor, drug treatment, and the number of conditioning trials (Fodor*drug*#trials(3,58) = 3.023; p = 0.046), with p values for individual data points specified in Table 5. These data show that more training increases memory specificity, as does the presence of NE, supporting our hypothesis that NE can increase learning and therefore lead to effectively more stable memories (Fig. 3, Table 5).
Results from experiment 4 show that specificity for an odor memory increases when NE is unimpaired in the OB and that less training is needed to create highly specific odor memories. When NE is impaired, more training is needed and memory specificity generally decreases. Both the number of trials and the presence of NE modulated the stability of the formed memory, as expressed by its specificity to the learned odor.
Summary of behavioral results
Together, our behavioral data support the idea that the presence of NE allows for odor–reward associations to be more stable, leading to stronger perseverance, longer duration, and more specificity. We manipulate NE in the OB only and observe no direct acute effects of these manipulations. To understand how these seemingly different effects can arise, we propose a simple computational model of NE modulation and plasticity in the OB, which shows that in principle a simple effect of NE can lead to the described long-term behavioral effects.
Olfactory bulb NE enhances synchrony and thereby learning, leading to increased memory stability, duration, and specificity
We have shown multiple long-term behavioral effects of acute NE modulation in the olfactory bulb: more stable memories, longer-lasting memories, and more specific memories. To investigate the underlying neural mechanisms, we use an existing computational model of NE modulation in the OB (Escanilla et al., 2010; Linster et al., 2011; de Almeida et al., 2015; Linster and Escanilla, 2019) to which we added a behavioral readout (RN). We have systematically analyzed the cellular effects of NE in the OB before; here we present a proof of concept model for the prediction that the presence of NE, via α1 receptors, modulates bulbar dynamics and, with it, plasticity, mediating stronger, longer-lasting, and more specific odor–reward associations. Our simplified model (Fig. 4A) implements NE modulation as measured in brain slice experiments (Nai et al., 2009, 2010; Linster et al., 2011; Linster, 2019) and in vivo (Manella et al., 2017), and determined computationally (Escanilla et al., 2010; Linster et al., 2011; de Almeida et al., 2015). Briefly, NE inputs to the OB model acting on α1 receptors enhanced MC excitability and GC spontaneous activity levels, resulting in stronger MC–GC interactions and more pronounced MC spike synchronization (Fig. 4B; quantified in Escanilla et al., 2010; de Almeida et al., 2015). When odor–reward association learning is simulated, synaptic weights between OB mitral cells and the RN increase in an activity-dependent manner. During learning, the RN is activated by a simulated reward signal, and the OB by the conditioned odor. Figure 4C shows the evolution of the maximal synaptic weight in the model during acquisition (conditioned odor plus reward; TB1–TB4) and during the 24 h postacquisition time: when NE is simulated (compared with control mice), weights increase faster and to a higher level than when NE is not simulated (NpHR mice). The maximal synaptic weights acquired with NE in the OB after two trial blocks (10 training trials) in the model are the same than those acquired after four trial blocks (20 training trials) without NE. Synaptic weights between MCs and RN increase faster when NE is implemented because MC spikes are more synchronous, drive the RN, and the plasticity rule better. After acquisition, model parameters are set to No NE, and synaptic weights slowly decline back to baseline. As can be seen in the graph in Figure 4C after a 2 h simulated decay time, both sets of weights are above baseline. However, after a 24 h simulated delay, weights acquired without NE modulation are not different from baseline—forgetting is complete—whereas weights acquired with NE modulation are higher than baseline. RN responses to the conditioned odor increase during acquisition in response to increasing synaptic weights and plateaus after two trial blocks for both NE and No NE simulations, showing that with the parameters chosen here, no strong effect of missing NE during acquisition is observed in RN. RN responses continue to plateau during forgetting in the NE simulations, but not No NE simulations, and after 24 h of simulated forgetting time, the RN response is above baseline when acquired with NE, but not when acquired without NE. These simulations show that in principle a simple effect on dynamics and synchrony in the OB can have no obvious acute effects but lead to substantial long-term effects in the next layers. Obviously, we could choose parameters to have many different outcomes and our model of RN and forgetting are simplified; however, this is a parsimonious proof of concept showing how local and temporary changes in neural dynamics can have multiple profound long-term effects.
Figure 4-1
Data depicted in Figure 4. Download Figure 4-1, XLSX file.
We tested memory odor specificity in the model by presenting odors not used during acquisition, which differed in similarity with the conditioned odor (C + 1; 78% overlap in OSN activation with C, C + 2; 34% overlap in OSN activation and X; −0.42 overlap in OSN activation) and computing the RN response to these novel odors at different time points of the simulation (pretraining; after 1, 2, 3, or four training blocks; and post-training; Fig. 5A). The graph in Figure 5A shows the relative response magnitude of RN to novel odors with respect to the C. Pretraining, synaptic weights are at baseline and RN responds to all odor similarly (Fig. 5B). As training increases, RN responses become more specific: when trained with NE, RN response to all novel odors is much lower than the conditioned odor response after as few as two trial blocks (Fig. 5Ai). In contrast, the specificity of the response decreases more slowly when acquired without NE (Fig. 5Aii), with the response to C + 1 being significantly from C different after four trial blocks only. Overall, these results are similar to behavioral results showing that specificity increases with training and that this increase can be sped up by the presence of NE during learning. Note that these results are from the same simulations discussed above using the same parameters. Learning was simply halted after every training block and odors C, C + 1, C + 2, and X presented to the network and the RN response computed. The evolution of specificity is clear when we observe how the distribution of synaptic weights to RN changes with learning (Fig. 5Bi,Bii). In each case, weight distribution is flat and random before learning. We are depicting a subset of weights organized to be centered at the most responsive glomerulus (to conditioned odor) with less responsive glomeruli flanking on each side. This depiction is for visualization purposes only, there is no such spatial organization in the model. After two training blocks, the distribution of synaptic weights is relatively flat for the No NE case and steeper for the NE case. Training with No NE after four trial blocks results in a similar weight distribution to training with NE after two trial blocks, mirroring the results from Figure 3. These results show that as synaptic weights increase, their distribution is rendered more narrow, resulting in more specific recall relative to the conditioned odor. When NE is not simulated, this process is slowed down. This is a direct result of mitral cell action potentials being more synchronous when NE is simulated as shown in Figure 5, Bi and Bii.
Figure 5-1
Data depicted in Figure 5. Download Figure 5-1, XLSX file.
Overall, our simulations show that a simple network effect resulting in increased synchronization and higher signal-to-noise ratio during learning can have multiple long-term effects on odor–reward association. We created a very simplified model implementing cellular effects of NE meant to provide proof of concept rather than a detailed model of how odor–reward associations would happen.
Discussion
Our experimental results show that the stability of an olfactory memory, measured by its perseverance, duration, and specificity, is decreased when NE release and activity is disturbed in the OB during acquisition. A computational model incorporating known cellular and network effects of NE in the OB (for review, see Linster and Escanilla, 2019) suggests that NE modulation of OB dynamics are mediating these effects. Interestingly we do not observe acute effects of lacking NE activity during learning, but see significant effects of this temporary manipulation long after the manipulation has stopped, which can underlie observations with respect to NE and stress in the OB (Manella et al., 2013) or NE and integration of adult-born neurons into the network (Moreno et al., 2012). We show that when NE is present in the OB during acquisition of an odor–reward association, synaptic weights mediating this association grow rapidly and specifically to the odor used for conditioning. At the end of acquisition, a strong and odor-specific association has been created because of strong synaptic plasticity in response to highly synchronous neural activity. This strong association is more difficult to reverse, takes longer to extinguish, and is more specific to the conditioned odor than an association acquired with NE activity in the olfactory decreased. NE has been shown to have a variety of effects on odor processing in the olfactory bulb, such as modulating odor detection thresholds (Escanilla et al., 2010, 2012; Linster and Escanilla, 2019), changing odor discrimination learning (Doucette et al., 2007; Mandairon et al., 2008), modulating habituation to an odor (Guérin et al., 2008; Manella et al., 2013), and affecting neural activity in response to an odor on a long timescale (Shea et al., 2008). With one exception (Shea et al., 2008), these effects have been measured when NE processes were actively manipulated in the OB. Here, we take advantage of optical techniques to manipulate NE release in the OB during acquisition only. Overall, our behavioral manipulations show that more learning leads to longer and more specific odor memories and that the presence of NE can speed up this process: less learning is needed for similar degrees of stability when NE is present.
Studies in hippocampus have shown that stimulation of LC fibers can result in release of DA as well as NE (Kempadoo et al., 2016; Takeuchi et al., 2016). Our optogenetic inhibition of release from LC fibers can thus result in a decrease of dopaminergic modulation in addition to the expected decrease in NE modulation. This is in contrast to the classic belief that DA is strictly internal to the OB (McLean and Shipley, 1988; Shipley and Ennis, 1996). Dopamine receptors are known to be predominantly located in the input layer of the OB and are known to be regulated by overall sensory activity (Brunjes et al., 1985), adding to olfactory bulb homeostasis. Behaviorally, we and others showed that the activation of D2 receptors locally in the OB can decrease odor detection and discrimination at low odor concentrations. Electrophysiological results show a predominant effect on glomerular layer computations with D1 receptor activation increasing excitation in that layer (Liu, 2020) and D2 receptor activation decreasing odor responses. It is therefore possible that limiting dopaminergic release from LC fibers could contribute to our observations, with the caveat that LC projections terminate predominantly in the deeper layers of OB and are less present in the glomerular layer where DA would be most effective. Behavioral results show that locally activating DA receptors in the OB decreases odor detection Escanilla et al., 2009, whereas local activation of NE receptors increases odor detection. Given that pharmacological blockade of NE receptors replicate the effects of LC fiber inhibition, we are confident that our observations can be largely attributed to the effects of reducing NE with a possible small contribution of DA.
Stress during learning can affect memory duration and stability. For example, when rats were acutely stressed during a simple odor-encoding task, memory for the encoded odor was enhanced (Manella et al., 2013). Interestingly, this enhancement could be mimicked by direct infusion of NE into the first sensory processing network, the OB, and the effect of stress was blocked by application of NE antagonists during the stress phase (Manella et al., 2013). Reversal learning, commonly used to study cognitive flexibility, has been shown to be facilitated by some types of long-term stress (Dong et al., 2013; Thai et al., 2013), suggesting a facilitatory role of NE, as we observe in the present experiments (Zitnik et al., 2016). Directly enhancing LC neural activity can also facilitate reversal learning that is dependent on activity levels (Snyder et al., 2012).
Here we show the long-term effects of a temporary decrease of NE in an early sensory structure influencing or driving downstream plasticity processes. While it is well known that NE modulates plasticity in many ways, in our experiment NE is not manipulated in those networks assumed to undergo plasticity. Rather, temporary changes in dynamics during acquisition create long-term effects downstream. In the computational model, rapid within bulbar plasticity, assumed to be modulated by NE, is not included. Activity-dependent plasticity between mitral and granule cells, for example, would further modulate bulbar dynamics and be additive to the mechanisms proposed here. However, such direct plasticity could be expected to speed up acquisition, which is not a phenomenon that we observe behaviorally in our tasks (possibly because of ceiling effects). We here present the most parsimonious mechanism, without claiming that many processes interact with each other and many of these can be modulated by NE. We simply show proof of concept for the idea that local and temporary changes in a sensory network can have long-lasting effects expressed downstream. Once changes in response to odorants have been established in postbulbar processes (RN in our simplified model), manipulating bulbar dynamics has less of an effect because dynamics, while crucial to plasticity processes, have less of an effect on readout, which can be thought of as a more rate-dependent process.
A previous study showed that NE activity in the amygdala can modulate the stability of fear memories in networks receiving inputs from the amygdala (Haubrich et al., 2020); these results are in good agreement with our present experiments. The present results go beyond what was previously shown to be a role for NE modulation in the OB with respect to signal-to noise modulation and odor discrimination (for review, see Linster and Escanilla, 2019) by showing that these acute processes can have long-lasting effects beyond the time when NE modulates the OB network. Thus, changes in network dynamics as early as the OB can directly influence odor processing and plasticity in higher-order structures leading to more stable memory consolidation. NE-mediated effects of acute stress can outlast the period of stress; here we show that these effects do not always rely on direct modulation of plasticity in the network in question. Our model relies on known and previously modeled cellular effects of NE to create predictions about the long-term behavioral effects we have observed here. As shown before, different aspects of olfactory memory such as duration and specificity covary (Freedman et al., 2013; Hackett et al., 2015) and can be ascribed to a common mechanism rather than evolving independently.
Footnotes
This work was supported by the CNRS, Institut National de la Santé et de la Recherche Médicale, Lyon 1 University (N.M.) and by Collaborative Research in Computational Neuroscience (CRCNS) Grant DC008702 (C.L.). We thank K. Deisseroth for the gift of the halorhodopsin construct; C. Benetollo from the Neurogenetic and Optogenetic Platform of the CRNL for lentiviral production; and G. Froment, D. Nègre, and C. Costa from the lentivector production facility/SFR BioSciences de Lyon (UMS3444/US8).
The authors declare no competing financial interests.
- Correspondence should be addressed to Christiane Linster at CL243{at}cornell.edu