Abstract
The ability to predict reward promotes animal survival. Both dopamine neurons in the ventral tegmental area and serotonin neurons in the dorsal raphe nucleus (DRN) participate in reward processing. Although the learning effects on dopamine neurons have been extensively characterized, it remains largely unknown how the response of serotonin neurons evolves during learning. Moreover, although stress is known to strongly influence reward-related behavior, we know very little about how stress modulates neuronal reward responses. By monitoring Ca2+ signals during the entire process of Pavlovian conditioning, we here show that learning differentially shapes the response patterns of serotonin neurons and dopamine neurons in mice of either sex. Serotonin neurons gradually develop a slow ramp-up response to the reward-predicting cue, and ultimately remain responsive to the reward, whereas dopamine neurons increase their response to the cue but reduce their response to the reward. For both neuron types, the responses to the cue and the reward depend on reward value, are reversible when the reward is omitted, and are rapidly reinstated by restoring the reward. We also found that stressors including head restraint and fearful context substantially reduce the response strength of both neuron types, to both the cue and the reward. These results reveal the dynamic nature of the reward responses, support the hypothesis that DRN serotonin neurons signal the current likelihood of receiving a net benefit, and suggest that the inhibitory effect of stress on the reward responses of serotonin neurons and dopamine neurons may contribute to stress-induced anhedonia.
SIGNIFICANCE STATEMENT Both serotonin neurons in the dorsal raphe and dopamine neurons in the ventral tegmental area are intimately involved in reward processing. Using long-term fiber photometry of Ca2+ signals from freely behaving mice, we here show that learning produces a ramp-up activation pattern in serotonin neurons that differs from that in dopamine neurons, indicating complementary roles for these two neuron types in reward processing. Moreover, stress treatment substantially reduces the reward responses of both serotonin neurons and dopamine neurons, suggesting a possible physiological basis for stress-induced anhedonia.
Introduction
The ability to predict a reward and to adjust reward responses according to behavioral contexts can enable animals to gain rewards at relatively low costs. Reward processing involves dopamine neurons in the ventral tegmental area (VTA) and serotonin neurons in the dorsal raphe nucleus (DRN; Schultz et al., 1997; Montague et al., 2004; Nakamura, 2013; Hu, 2016; Luo et al., 2016). VTA dopamine neurons encode reward prediction error (Schultz et al., 1997; Montague et al., 2004; Cohen et al., 2012). DRN serotonin [5-hydroxytryptamine (5-HT)] neurons are activated by various primary rewards before learning and by reward-predicting cues after learning (Nakamura et al., 2008; Bromberg-Martin et al., 2010; Miyazaki et al., 2011; Liu et al., 2014; Li et al., 2016; Matias et al., 2017). Unlike dopamine neurons, there is at present no unified theory describing how serotonin neurons contribute to regulating reward-related behaviors (Hayashi et al., 2015; Luo et al., 2016). Previous studies have predominantly focused on serotonin neuronal activity before and after training (Nakamura et al., 2008; Bromberg-Martin et al., 2010; Miyazaki et al., 2011; Li et al., 2016; Matias et al., 2017). It remains unclear how the responses of serotonin neurons evolve during learning. It is also unknown how important reward parameters such as reward values and reward availability may differentially affect serotonin neurons and dopamine neurons. Addressing these questions may help elucidate the potentially different roles of serotonin neurons and dopamine neurons in reward-related behaviors.
Very little is known about how stress can influence the reward responses of serotonin neurons and dopamine neurons. Numerous studies have shown at the behavioral level that acute stressors may change reward sensitivity for humans and animals (Bogdan and Pizzagalli, 2006; Born et al., 2010; Porcelli et al., 2012; Berghorst et al., 2013; Kumar et al., 2014). Stress is often associated with anhedonia (Kendler et al., 1999; Gold and Chrousos, 2002; Hammen, 2005). Moreover, limited ability to adapt to stressful contexts causes defects in emotion control, and may lead to psychological disorders such as depression, anxiety, and post-traumatic stress disorders (Schneiderman et al., 2005; de Kloet et al., 2005; Chrousos, 2009). Although it is important for animals to properly adjust reward responses according to stress levels, how stress modulates reward-associated neuronal activity remains unclear. Stress coping involves dopamine neurons and serotonin neurons (Anisman and Zacharko, 1990; Amat et al., 2005; Bekris et al., 2005; Porcelli et al., 2012; Hollon et al., 2015). Both VTA dopamine neurons and DRN serotonin neurons receive strong inputs from the lateral habenula that respond to stressors (Watabe-Uchida et al., 2012; Pollak Dorocic et al., 2014; Weissbourd et al., 2014; Wang et al., 2017). However, it remains unclear how acute stress modulates the responses of DRN serotonin neurons and VTA dopamine neurons to rewards and to reward-predicting cues.
Here, we used fiber photometry of Ca2+ signals to examine how learning and acute stressors modulate the reward responses of DRN serotonin neurons and VTA dopamine neurons (Chen et al., 2013; Gunaydin et al., 2014; Li et al., 2016). We first tracked the activity patterns of serotonin neurons in freely behaving mice throughout the course of appetitive Pavlovian conditioning. We then compared the reward response pattern of DRN serotonin neurons to that of VTA dopamine neurons. We further examined the effects of reward values, omission, and reinstatement on both neuron populations. We finally studied the effects of acute stressors on the reward responses of DRN serotonin neurons and VTA dopamine neurons. These experiments revealed that learning differentially shapes the response patterns of both neuron types. Moreover, acute stress reduces the extent of activation associated with both rewards and reward-predicting cues. Our results indicate that serotonin neurons and dopamine neurons play complementary roles in reward processing. Moreover, acute stress may negatively affect reward processing by suppressing the responses of both serotonin neurons and dopamine neurons.
Materials and Methods
Subjects
Animal care and use conformed to the institutional guidelines of the National Institute of Biological Sciences, Beijing, as well as the governmental regulations of China. We used adult (8–14 weeks old at the time of surgery) SERT-Cre mice [strain name B6.Cg-Tg(Slc6a4-Cre)ET33Gsat; MMRRC; RRID: MGI_3691580] and DAT-Cre mice [strain name B6.SJL-Slc6a3tm1.1(cre)Bkmn; The Jackson Laboratory; RRID: MGI_3689567] of either sex. Mice were maintained with a 12/12 photoperiod (lights on at 8:00 A.M.) and were housed in groups of five for 6–8 weeks. After surgery, mice were housed individually with a reversed photoperiod (lights off at 8:00 A.M.) for at least 10 d before conducting any further experiments.
Surgery and virus injection
Details for the construction of pAAV-DIO-GCaMP6m plasmid and packing of the AAV vectors have been described previously (Li et al., 2016). To inject the AAV vectors to specific brain regions, mice were anesthetized with pentobarbital (i.p., 80 mg/kg) and mounted to a stereotaxic apparatus. After making a small craniotomy, we slowly injected viral vectors (500 nL at 40 nL/min) into the DRN (coordinates: −5.2/0/2.5 mm; AP/ML/DV) with a 15° caudal-to-rostral angle or the VTA (coordinates: −3.2/0.55/4.6 mm; AP/ML/DV) using a glass pipette controlled by a microsyringe pump (Nanoliter 2000 Injector, WPI). The glass pipette was left in place for 10 additional minutes and then slowly withdrawn. Following AAV-DIO-GCaMP6m virus injection, an optical fiber [230 μm outer diameter (o.d.), 0.37 NA; Shanghai Fiblaser] was placed in a ceramic ferrule and was inserted toward the DRN or VTA. The ceramic ferrule was supported with a skull-penetrating M1 screw and dental acrylic.
We implanted intraoral cheek fistula using a previously described procedure (Hintiryan et al., 2006). Four days after AAV injection, we made an incision in the mouse cheek that was lateral and rostral to the first molar and made another incision in the scalp. We inserted soft a Silastic tube [30 mm in length, 0.30 mm inner diameter (i.d.) and 0.46 mm o.d.; Dow Corning] 2 mm into the oral cavity through the cheek incision site, and then guided the tube out subcutaneously through the scalp incision. An L-shaped 26 gauge (o.d. 0.48 mm) stainless steel tube was connected to the Silastic tube and was imbedded alongside the ceramic ferrule containing the optical fiber. A piece of polyethylene tubing (10 mm in length, 0.4 mm i.d., 1.1 mm o.d.) was fit to the exposed end of the L-shaped steel tube. When the tube system was not being used in an experiment, a plug was inserted to the exposed end of the polyethylene tube to prevent blockage. After the implantation of intraoral cheek fistula, mice were housed individually.
Fiber photometry
The fiber photometry experimental device/setup used here was described in detail previously (Gunaydin et al., 2014; Guo et al., 2015). Briefly, an exciting laser beam from a 488 nm laser (OBIS 488LS, Coherent) was first reflected by a dichroic mirror (MD498, Thorlabs), focused with a 10× objective lens (NA = 0.3; Olympus), and coupled to an optical fiber patch cable (230 μm o.d., NA = 0.37; 2 m long). A patch cable was then connected to the optical cannula implanted in the brain through a mating sleeve. The laser power at the tip of optical fiber was adjusted to 0.03–0.04 mW to minimize bleaching of GCaMP probes. The excited GCaMP fluorescence was transmitted back through the optical fiber and objective lens. After passing the dichroic mirror and a bandpass filter (MF525-39, Thorlabs), the fluorescence signal was converted to an electrical signal by a photomultiplier tube (R3896, Hamamatsu). The electrical signal was then amplified and further low-pass filtered (40 Hz cutoff; Brownlee 440). The analog voltage signals were digitalized at 500 Hz using a Power 1401 digitizer and recorded with Spike2 software (CED).
Behavioral tasks
Appetitive classical conditioning.
For Pavlovian conditioning, mice were deprived of water for 24 h before training. In each of the conditioning session, a mouse was repetitively presented with 100 trials of cue–sucrose “pairs” as follows: a 2 s tone [conditioned stimulus (CS)] followed by a 1 s delay and then the delivery of a 0.5 s sucrose solution [unconditioned stimulus (US), detailed below], with 20–40 s randomized intertrial intervals. Auditory tones were generated with square waveforms (0.5 duty, 4 kHz; digitized at 96 kHz) and delivered via speakers. A peristaltic pump infused the sucrose solution (5% w/v) into the oral cavity through the intraoral cheek fistula at the speed of 20 μl · s−1. The entire conditioning process consisted of at least four and no more than six daily sessions for the SERT-Cre mice and six daily sessions for the DAT-Cre mice. A MATLAB program and an Arduino R3 controller were used to generate the tone signal and the pump trigger signal; a Power 1401 digitizer simultaneously recorded tone timing (100 Hz sampling frequency) and GCaMP signals (500 Hz). At the end of the session, the animal was returned to its home cage and was allowed to access water freely for 5 min. On the day after the final conditioning session, we challenged mice with an omission session in which sucrose was omitted during trials 51–150 of a total of 250 trials.
For training sessions with two reward sizes, mice were exposed to two types of auditory tones, a 2 s 4 kHz period of pure tone or a 2 s period of white noise, that were coupled, respectively, with small and large rewards. The reward sizes were controlled by using two different pump durations (0.5 vs 2.5 s) with the same speed (10 μl · s−1). For both reward sizes, the sucrose was delivered 1 s after the end of either auditory tone. The entire training process consisted of a total of six sessions, each of which consisted of 50 pure tone/small reward pairs and 50 white noise/large reward pairs that were randomly delivered.
Stress.
We used quinine, acute head restraint, and footshock as stressors. To exclude the potential effect(s) of water and energy in the sucrose solution, mice were allowed ad libitum access water and food before tests. For all of the stressor treatments, mice were given 20 normal trials (“A”), 20 stress trials (“B”), and then 20 additional normal trials (“A”) in an ABA design. We first placed a mouse in a chamber and recorded Ca2+ signals during an initial round of 20 trials of random intraoral infusion of a sucrose solution. To examine the effect of quinine addition (stress), we replaced the sucrose solution (5% sucrose) with the mixture of sucrose (5% sucrose) and quinine (10 mm) for the next 20 trials. The final 20 trials used unaltered sucrose solution. To test the effect of acute head restraint, we restrained the mouse by fixing it to a standing platform with a small horizontal bar implanted on its head. The initial 20 trials of these experiments examined the responses of freely moving mice to the random intraoral infusion of sucrose. For the 20 stressed trials, mice were restrained on the standing platform during the sucrose infusion trials. The mouse was then released and returned to the neutral chamber for the final 20 unstressed sucrose infusion trials. To examine the effect of a fearful context on sucrose-evoked Ca2+ responses, we induced stress using footshocks. After 20 trials of recordings in a neutral chamber, the mouse was introduced to an opaque acrylic box with a metal grid floor (footshock context, 25 L × 25 W × 30 H in cm). After giving five random footshocks (0.7 mA, 0.5–1 s) within a 10 min period, we waited ∼10 min and then recorded Ca2+ responses to sucrose for 20 trials. We finally placed the mouse back to the initial chamber for 20 unstressed trials. Similar procedures were performed to examine the effects of stress on cue–reward conditioned responses, with the exception that a mouse was fully conditioned via either 400 or 600 trials of cue–reward coupling. During these cue–reward conditioned response experiments, the tone/reward pairs were presented during the stress sessions.
Aversive classical conditioning.
The entire conditioning session consisted of 100 trials that paired a tone with quinine. We used a 12 kHz pure tone as the conditioned stimulus and quinine solution (10 mm, 10 μl in 0.5 s) as the unconditioned stimulus.
Immunostaining
Mice were deeply anesthetized with an overdose of pentobarbital and then intracardially perfused with 0.9% saline followed by 4% paraformaldehyde in PBS. After cryoprotection with 30% sucrose, mouse brains were sectioned coronally (35 μm thick) with a cryostat (Leica CM1900). For immunohistochemistry, the sections were blocked with 3% BSA in PBS with 0.3% Triton X-100. We incubated brain sections with a rabbit polyclonal antibody against tryptophan hydroxylase 2 (TPH2; 1:400; Millipore, ABN60) or a rabbit polyclonal antibody against TH (1:500; Millipore, AB152) at 4°C overnight. After washing with PBS, the brain sections were incubated with Cy3-conjugated goat anti-rabbit IgG (1:500; Jackson ImmunoResearch) for 1 h at room temperature. PBS-washed sections were then coverslipped with 50% glycerol mounting medium. The stained sections were imaged with a confocal microscope (LSM510 META or Axiovert 200M, Zeiss).
Data analysis and statistical tests
No data were excluded. Photometry data were exported from the Spike2 program as MATLAB .mat files for further analysis. After smoothing the data with a moving average filter (20 ms span), we segmented the data based on behavioral events within individual trials. We derived the fluorescence change values (ΔF/F) by calculating (F − F0)/F0, where F0 is the baseline fluorescence signal averaged over a 1.5 s long control time window; this window was typically commenced 0.5 s before initiating a trigger event (cue or infusion). ΔF/F values are presented as heatmaps or as peri-event plots.
To examine changes in response strengths during the anticipatory phase and the consummatory phase of the reward conditioning, we normalized the response strengths of each individual trial across the appetitive conditioning sessions. For DRN serotonin neurons, we initially calculated the average of the peri-event Ca2+ signals of the first 50 trials in the initial session; we identified the peak value in the average plot, and normalized the response strength of each of the trials to the peak value. For normalization of VTA dopamine neuron data, we used the sum of Ca2+ signals during reward expectation (0–3 s) and reward acquisition (3–5 s) in the peri-event plot. We then averaged the normalized response strength during the anticipatory phase and consummatory phase for every 10 trials. To measure the response strength of both serotonin and dopamine neurons during the consummatory phase, we determined the peak value of the normalized peri-event data during the 2 s time window following reward delivery (3–5 s following the cue onset). To examine the response strength of serotonin neurons during the reward anticipatory phase, we calculated the area under the peri-event plot during the 3 s time window following cue onset (0–3 s following the cue onset) and averaged the value per 10 trials. For dopamine neurons, we normalized the peak response intensity during the anticipatory phase to the sum of the peaks during the anticipatory phase and the consummatory phase. To analyze the discriminability of reward values and their associated cues, we used the receiver operating characteristic (ROC) and calculated the area under the ROC curve (auROC). An auROC value of 0.5 indicates that there was no difference in activity in response to large reward versus small reward.
We used permutation tests to analyze the statistical significance of the event-related fluorescence change (Groppe et al., 2011). We used 5000 permutations for an α level of 0.001 to compare the values of ΔF/F at each time point with the event-related fluorescence (ERF) baseline values. The 99.9% fraction of the baseline ΔF/F values was defined as the upper threshold value, and the 0.1% fraction was defined as the lower threshold. Values above the upper threshold or below the lower threshold were considered significant increase or decrease, respectively. Statistical results were superimposed on the average ERF curve with red and blue lines indicating statistically significant (p < 0.001) increases or decreases, respectively. The duration of activation and inhibition was respectively calculated by summing the total time during which the Ca2+ signals were above the upper-threshold or below the lower threshold.
Results
Pavlovian conditioning differentially shapes the response patterns of DRN serotonin neurons and VTA dopamine neurons
We first investigated how the activity of DRN serotonin neurons changes during learning. We trained mice with Pavlovian conditioning by repetitively presenting an auditory cue (CS) with delayed delivery of a sucrose solution (US; Fig. 1A). Within each trial (100 trials per session), a 2 s tone was followed by a 1 s delay and then direct infusion of sucrose solution (10 μl in 0.5 s) into the oral cavity through a small implanted tube (Fig. 1B). To monitor Ca2+ signals in DRN serotonin neurons, we expressed a genetically encoded Ca2+ indicator, GCaMP6, in serotonin neurons by infusing the Cre-dependent adeno-associated virus AAV-DIO-GCaMP6m into the DRN of SERT-Cre mice (henceforth referred to as SERT-DRN-GCaMP6 mice). This resulted in the highly accurate and efficient expression of GCaMP6 in neurons expressing TPH2, the rate-limiting enzyme for serotonin synthesis in the brain (Zhang et al., 2004; Li et al., 2016; Fig. 1C). We implanted an optical fiber with its tip in the DRN for fiber photometry of GCaMP fluorescence changes in serotonin neuron populations during the entire conditioning process (Fig. 1D and Fig. 1-1A,B). Our previous study demonstrated that the GCaMP fluorescence changes reflect Ca2+ signals rather than any artifact of movement (Li et al., 2016).
Appetitive Pavlovian conditioning differentially shapes the response patterns of DRN serotonin neurons and VTA dopamine neurons. A, The design of the cue–reward classical conditioning task. An auditory tone (2 s) was repetitively paired with sucrose delivery (0.5 s) after a 1 s delay. Each daily session consisted of 100 trials, and each mouse completed four or six sessions. B, Fiber photometry of Ca2+ signals from freely behaving mice engaged in the classical conditioning task. We infused sucrose solution into the oral cavity through cheek fistula as indicated in the schematic. We implanted an optical fiber into the DRN to simultaneously record changes in GCaMP6 fluorescence using fiber photometry. C, Expression of GCaMP6 (Green) in TPH2-immunopositive neurons (red) in the DRN of a SERT-DRN-GCaMP6 mouse. D, Raw traces showing the behavior events of the tone (black), sucrose delivery (blue), and GCaMP6 signals (green) during the conditioning session of day 1 (d1) and day 6 (d6) from a SERT-DRN-GCaMP6 mouse. E, Heatmaps illustrating the GCaMP changes (ΔF/F, %) of DRN serotonin neurons across six daily classical conditioning sessions from a SERT-DRN-GCaMP6 mouse. F, Peri-event plots of the average Ca2+ signals. G, The intensity of the Ca2+ signal during the anticipatory phase of DRN serotonin neurons gradually increased as the number of conditioning sessions increased. We measured the area under the peri-event curve (AUC) between cue onset (0 s) and sucrose delivery (3 s) to represent the response strength in the reward anticipatory phase. H, The response to sucrose remained relatively unchanged over time. Within each session, the peak response intensity following sucrose delivery (3–5 s) was normalized to the sucrose response intensities of the initial 50 trials of the conditioning. I, Heatmaps illustrating the GCaMP changes of VTA dopamine neurons during six daily classical conditioning sessions from a DAT-VTA-GCaMP6 mouse. J, Peri-event plots of the average intensity of Ca2+ signals of dopamine neurons. K, L, The response strength of VTA dopamine neurons to the tone onset (K) and sucrose delivery (L) across conditioning sessions (see Figs. 1-1 and 1-2, respectively). Shaded areas in F–H and J–L indicate SEM (F–H, n = 14 SERT-DRN-GCaMP6 mice; J–L, n = 9 DAT-VTA-GCaMP6 mice). F, G, J–L, Red and blue colors superimposed on the black line indicate significant increases and decreases from the baseline, respectively (F, J, p < 0.001; G, H, K, L, p < 0.05, multivariate permutation tests).
Figure 1-1
Figure 1-2
We analyzed how the changes in the Ca2+ signal patterns related to the timing of the cue and of sucrose delivery for all trials in all of the sessions. We segmented raw traces and aligned the Ca2+ signals to the cue onset (Fig. 1E). We then averaged the signals to plot peri-event time histograms for individual mice, and calculated the mean Ca2+ signals for each conditioning session (n = 14 mice; Fig. 1F and Fig. 1-1B). In the second session, as the number of trials increased, the cue began to elicit an increasing trend in the intensity of Ca2+ signals (“ramp-up” trend); these cue-evoked Ca2+ signals lasted through the delay period and peaked immediately following sucrose infusion. The activation of serotonin neurons during the anticipatory phase (including the tone and the delay period) reached statistical significance following ∼170 trials of training (p < 0.05; permutation test; Fig. 1G). In the subsequent conditioning sessions, the intensity of these cue-evoked ramp-up Ca2+ signals continued to increase (Fig. 1F,G). By the end of the conditioning session, the maximal amplitude of the cue-evoked Ca2+ signals was eventually as high as the amplitude of the sucrose-evoked Ca2+ signals (Fig. 1F and Fig. 1-1C). In contrast to the increase of cue-evoked responses, the Ca2+ signals associated with sucrose infusion remained stable throughout the conditioning process (Fig. 1H). Therefore, learning induces a ramp-up activity pattern in DRN serotonin neurons in response to the CS but does not affect DRN serotonin neuron responses to the US.
It was recently shown that stimulating DRN serotonin neurons slows down animal locomotion (Correia et al., 2017). We analyzed the relationship between the Ca2+ signals and locomotor activity. During the initial phase of the conditioning, mice displayed decreased locomotion upon sucrose delivery. This decrease in locomotion became more pronounced as the conditioning continued; its timing gradually shifted closer to the cue and became stable after three or four conditioning sessions (Fig. 1-1D). The onset of Ca2+ signals after the cue preceded the onset of locomotion decrease for ∼1 s (Fig. 1-1D). This suggests that reward anticipation and acquisition, but not a decrease in locomotor activity, causes the activation of DRN serotonin neurons.
Although the reward response patterns of VTA dopamine neurons have been researched extensively (Schultz et al., 1997; Montague et al., 2004; Matsumoto and Hikosaka, 2009; Cohen et al., 2012; Eshel et al., 2015; Menegas et al., 2017), most studies have taken recordings from head-fixed animals that acquired sucrose by active licking. To enable a direct comparison with the responses of serotonin neurons, we also monitored the Ca2+ response patterns of dopamine neurons in freely behaving mice. Additionally, we delivered sucrose via direct intraoral infusion. After expressing GCaMP6 in VTA dopamine neurons, we recorded the Ca2+ signals of the dopamine neurons from DAT-VTA-GCaMP mice in six daily sessions, each of which consisted of 100 trials (Fig. 1-2). In the initial conditioning session, the auditory cue produced a mild yet statistically significant decrease in Ca2+ signals, whereas sucrose infusion strongly activated the dopamine neurons (Fig. 1I,J). As the conditioning process proceeded, the initially observed cue-evoked inhibition gradually switched, becoming an activation that increased in strength with additional sessions. This cue-evoked activation peaked at ∼0.3 s after cue onset and quickly decayed before the cue ceased (Fig. 1J). In contrast, the response to sucrose infusion (“reward-associated response”) gradually decreased as the conditioning proceeded (Fig. 1I,J,L). At the population level, the cue-evoked excitatory response was significantly above the baseline after ∼110 trials (p < 0.05, permutation test; Fig. 1K), whereas the reward-associated response was significantly below the initial value after ∼290 trials (p < 0.05, permutation test), and was eventually reduced to ∼25% of its original amplitude (Fig. 1J,L).
It was thus clear at the end of the learning process that VTA dopamine neurons differed from DRN serotonin neurons in two key ways. First, the cue evoked a gradually increasing ramp-up Ca2+ signal that lasted throughout the reward anticipatory phase in serotonin neurons, whereas the cue only evoked a brief excitatory response in dopamine neurons. Second, the reward continued to elicit Ca2+ signals in serotonin neurons throughout the learning period, whereas the reward gradually became ineffective for activating dopamine neurons. We noted that our observations of the response patterns of DRN serotonin neurons differed from a recent study which showed that the Ca2+ signals of serotonin neurons in head-fixed mice transiently increased following the reward-predicting cue and decreased following sucrose licking (Matias et al., 2017). Moreover, this recent study reported that serotonin neurons were activated when a fully predicted reward was omitted.
We investigated how changes in reward availability affected the activation patterns of DRN serotonin neurons in freely behaving mice. Following six sessions of Pavlovian conditioning, the mice were tested with an “omission session” during which we presented cue–reward pairs in the initial 50 trials, omitted sucrose from the pair in the following 100 trials, and reinstated sucrose in the final 100 trials (Fig. 2A). In <20 trials, reward omission completely abolished the excitatory responses of DRN serotonin neurons during both the reward anticipatory phase (0–3 s; Fig. 2B) and the previous reward consumption phase (3–5 s; Fig. 2C). Following reward reinstatement, the sucrose-evoked responses in DRN serotonin neurons returned immediately (Fig. 2C); the cue-evoked responses recovered fully within 20 trials, which was much more quickly than the number of trials (∼170) that were required to establish the cue responses during the initial conditioning phase (Fig. 2B). In VTA dopamine neurons, reward omission decreased cue-evoked responses as was seen in DRN serotonin neurons, but it took ∼60 trials for this decrease to reach statistical significance (Fig. 2E). Possibly because the population GCaMP signals were not strong enough to report a mild decrease in spiking frequency, we did not observe the inhibitory responses that have previously been associated with negative prediction error as caused by reward omission (Hollerman and Schultz, 1998). When the sucrose was reintroduced, the responses of VTA dopamine neurons to sucrose increased dramatically for ∼10 trials and then quickly decreased to the pre-omission level (Fig. 2D,F, arrows). The recovery of the cue-evoked response in VTA dopamine neurons following reward reinstatement required ∼20 trials and was thus much slower than the recovery that we had observed in DRN serotonin neurons (Fig. 2E). Therefore, both DRN serotonin neurons and VTA dopamine neurons adapt their responses according to reward availability. Serotonin neurons rapidly and faithfully tracked reward availability, while VTA dopamine neurons respond strongly to unexpected reward availability.
The effects of reward omission and reinstatement on the responses of serotonin neurons and dopamine neurons. A, A heatmap showing the Ca2+ signals of DRN serotonin neurons in a mouse challenged with reward omission. We presented the CS for all of the trials, but omitted US during trials 51–150. B, The effect of reward omission on the activity of serotonin neurons during the anticipatory phase (n = 7 SERT-DRN-GCaMP6 mice). Each point in the line plot represents the average value of the normalized area under the curve (AUC) between cue onset (0 s) and sucrose delivery (3 s) for 10 trials. C, The effect of reward omission on the activity of serotonin neurons during the reward consumption phase (n = 7 SERT-DRN-GCaMP6 mice). D–F, The effect of reward omission on the activity of VTA dopamine neurons (n = 9 DAT-VTA-GCaMP6 mice). We defined response strength in E and F as the peak value normalized to the average peak value of the initial 50 trials during the first session. Shaded areas in B, C, E, and F indicate SEM. Red and blue colors in B, C, E, and F indicate significant increases and decreases from the baseline, respectively (p < 0.05, multivariate permutation test).
Serotonin neurons and dopamine neurons respond in a reward value-dependent manner
Previously, we showed that quinine itself does not significantly inhibit DRN serotonin neurons (Li et al., 2016). However, it remains to be tested whether adding the bitter tastant quinine to the sucrose solutions would affect the responses of DRN serotonin neurons to sucrose. In a single quinine session of 60 trials, we recorded the responses to sucrose (5%) during the initial 20 trials. We then mixed quinine with sucrose solution while maintaining the sucrose concentration for the next 20 trials. In the final 20 trials, we delivered sucrose solution without quinine. Adding quinine into sucrose solution completely and reversibly abolished sucrose-evoked Ca2+ signals (n = 6 mice; Fig. 3A–C; F(2,15) = 10.33, p = 0.0017, evaluated via repeated-measures one-way ANOVA with “aversive stimuli” as the within-subject factor). Adding quinine into sucrose solution also completely abolished the sucrose-associated Ca2+ signals of VTA dopamine neurons (n = 7 mice; Fig. 3D–F; F(2,18) = 12.29, p = 0.0003, repeated-measures one-way ANOVA with aversive stimuli as the within-subject factor). Although quinine itself has a mildly inhibitory effect on the baseline activity of both serotonin neurons and dopamine neurons, coupling a cue to quinine resulted in a statistically significant decrease in the Ca2+ signals during the cue (Figs. 3-1 and 3-2, respectively), demonstrating the aversive nature of quinine. Thus, aversive stimuli reduce the reward value for both DRN serotonin neurons and VTA dopamine neurons.
The effects of reward values on the responses of serotonin neurons and dopamine neurons. A–C, The effects of adding quinine on the sucrose-evoked Ca2+ signal intensity of DRN serotonin neurons. The heatmap in A illustrates the Ca2+ signals from a mouse across all trials. The sucrose concentration was held at 5% for 60 trials; quinine (10 mm) was added during trials 21–40. The peri-event plots in B show the average Ca2+ signals to sucrose from the animal shown in A before, during, and after the quinine trials. Data are aligned to pump onset for liquid delivery. Population-level data in C show the effect of mixing quinine with sucrose on the response of DRN serotonin neurons (n = 6 SERT-DRN-GCaMP6 mice). Response amplitudes were normalized to those before quinine addition for each individual mouse. D–F, Heatmap representation (D) and peri-event plot of Ca2+ signals (E) from a representative mouse and population data (F, n = 7 DAT-VTA-GCaMP6 mice) showing the effects of adding quinine on the sucrose response of VTA dopamine neurons. G, Heatmaps illustrating the GCaMP changes (ΔF/F, %) of DRN serotonin neurons to small (left) and large (right) rewards from a SERT-DRN-GCaMP6 mouse during the day 6 conditioning session. The trials of the two reward sizes were intermixed in pseudorandom order and sorted for illustration purposes. H, Peri-event plots of the average Ca2+ signals of DRN serotonin neurons to the small (black) and large (red) rewards. (n = 8 SERT-DRN-GCaMP6 mice). I, J, Heatmaps (I) and average Ca2+ signals of VTA dopamine neurons (J) during the day 6 conditioning session (n = 7 DAT-VTA-GCaMP6 mice). Error bars in C and F indicate SEM. *p < 0.05; **p < 0.01; ***p < 0.001; n.s., not significant; multiple comparisons after repeated-measures one-way ANOVA. Shaded areas in B, E, H, and J indicate SEM. Red and blue colors in B and E indicate significant increases and decreases from the baseline, respectively (p < 0.05, multivariate permutation tests). See Figures 3-1, 3-2, and 3-3, respectively.
Figure 3-1
Figure 3-2
Figure 3-3
We examined how reward values affect the neuronal activity of serotonin neurons and dopamine neurons during Pavlovian conditioning. We coupled two auditory cues [a 4 kHz pure tone (CS1), white noise (CS2)], respectively to a small reward and a large reward [0.5 s (US1) or 2.5 s (US2) of sucrose infusion]. Both SERT-DRN-GCaMP6 mice and DAT-VTA-GCaMP6 mice underwent six conditioning sessions, each of which consisted of 50 CS1–US1 pairs and 50 CS2–US2 pairs that were presented in a pseudorandom order. For both serotonin neurons and dopamine neurons, the larger reward produced stronger Ca2+ signals and conditioned stronger activation by CS2 (Fig. 3G–J and Fig. 3-3A–H). For serotonin neurons, the two cues predicting different reward sizes produced size-dependent responses after only two training sessions (Fig. 3G,H and Fig. 3-3A–D,I,J). For dopamine neurons, in contrast, the intensity of the activation only grew over time for the cue predicting the large reward; whereas the response strength of dopamine neurons to the cue predicting the small reward did not change (Fig. 3I,J, and Fig. 3-3E–H,K,L). Therefore, the particular reward sizes exert differential impacts on the serotonin neuron responses versus dopamine neuron responses. During the learning process, the response strengths of serotonin neurons more faithfully track reward values, whereas dopamine neurons prefer a larger reward.
Acute stressors suppress the responses of serotonin neurons and dopamine neurons to rewards and reward-predicting cues
We used two separate stressor treatments to study how acute stress affects the reward responses of DRN serotonin neurons and VTA dopamine neurons in mice that had not undergone Pavlovian conditioning. We challenged mice with a head restraint, which, unlike the quinine stressor, was not perfectly aligned with reward delivery in time. In a single stress session, we recorded sucrose-evoked Ca2+ signals for 20 trials in freely behaving mice, in 20 additional trials during which mice were head-restrained, and finally in 20 trials in freely behaving mice that had been released from the head restraint (Fig. 4A). For serotonin neurons, head restraint reduced the average intensity of the sucrose response by nearly 50% (n = 7 mice; F(2,18) = 11.14, p = 0.0012, repeated-measures one-way ANOVA with “stress” as the within-subject factor), and this decrease was fully reversible upon release (Fig. 4C).
Acute stress decreases the reward response intensity of DRN serotonin neurons. A, A heatmap showing the effect of acutely restraining on the Ca2+ signal intensity of DRN serotonin neurons. The same amount of sucrose was delivered for 60 trials, and the mouse was head-restrained during trials 21–40. B, Peri-event plot of the average Ca2+ signals to sucrose from the animal shown in A before, during, and after the trials with head restraint. Data are aligned to pump onset for liquid delivery. C, Population-level data showing the effect of head restraint on the response of DRN serotonin neurons (n = 7 SERT-DRN-GCaMP6 mice). Response amplitudes were normalized to those before head restraint for each individual mouse. D–F, The effects of a fearful context on the sucrose response of DRN serotonin neurons (n = 11 SERT-DRN-GCaMP6 mice). A total of 60 trials were recorded. After initial 20 trials of recording, a mouse was moved to a new chamber and given five random footshocks ∼10 min before the recording of 20 trials recording sucrose responses. The mouse was then returned to the initial recording chamber and a final 20 trials were recorded. Error bars in C and F indicate SEM. *p < 0.05; **p < 0.01; n.s., not significant; multiple comparisons after repeated-measures one-way ANOVA. Shaded areas in B and E indicate SEM. Red and blue colors in B and E indicate significant increases and decreases from the baseline, respectively (p < 0.05, multivariate permutation tests). See Figure 4-1.
Figure 4-1
A stress experience that occurs some period before reward delivery might also have an effect on the reward responses of neurons. To explore this idea, we devised experiments in which we observed differences in the Ca2+ signals between normal and fearful context. We first recorded sucrose-evoked responses of serotonin neurons for 20 trials in a “normal” recording chamber. To create a fearful context, we placed the mouse in a new chamber, randomly delivered five footshocks (1 s, 0.7 mA), and after 10 min recorded sucrose responses for 20 trials. Finally, we returned the mouse back to the initial normal chamber and recorded responses for 20 normal sucrose delivery trials. The intensity of the sucrose-evoked Ca2+ signals was reversibly reduced by ∼30% in the chamber associated with prior footshocks (n = 11 mice; Fig. 4D–F; F(2,30) = 14.36, p = 0.0002, repeated-measures one-way ANOVA with stress as the within-subject factor). Thus, prior stress can strongly decrease the reward responses of DRN serotonin neurons. Consistent with the reduction in Ca2+ signals, introducing mice into the fearful context disrupted the locomotor response pattern associated with sucrose delivery (Fig. 4-1).
Stressors also suppressed the sucrose-evoked Ca2+ signals of VTA dopamine neurons. Head restraint reduced the sucrose-evoked responses of dopamine neurons by >90% (n = 6 mice; Fig. 5A–C; F(2,15) = 9.333, p = 0.0055, repeated-measures one-way ANOVA with stress as the within-subject factor). For the chamber associated with prior footshock, the intensity of the responses of dopamine neurons was reduced by approximately one-third (n = 7 mice; Fig. 5D–F; F(2,18) = 8.857, p = 0.0084, repeated-measures one-way ANOVA with stress as the within-subject factor), suggesting that a fearful context can also decrease the reward responses of VTA dopamine neurons.
Acute stress decreases the reward response intensity of VTA dopamine neurons. A–C, The effect of acute head-restraint on the sucrose response of VTA dopamine neurons (n = 6 mice). D–F, The effect of placing an animal in a fearful context on the sucrose response of VTA dopamine neurons (n = 7 mice). Error bars in C and F indicate SEM. *p < 0.05; **p < 0.01; n.s., not significant; multiple comparisons after repeated-measures one-way ANOVA. Shaded areas in B and E indicate SEM. Red and blue colors in B and E indicate significant increases and decreases from the baseline, respectively (p < 0.05, multivariate permutation tests).
We further investigated how stress associated with head restraint and a fearful context would affect neuronal responses to reward-predicting cues. Mice completed multiple sessions (100 trials per session) of Pavlovian conditioning to establish strong associations between a cue and a large reward (2.5 s sucrose delivery). For both DRN serotonin neurons and VTA dopamine neurons, acute head restraint and fearful context reversibly reduced the intensity of responses to the cue by ∼70% and 50%, respectively (n = 6 mice; Fig. 6A–C; F(2,15) = 11.14, p = 0.012; n = 6 mice; Fig. 6D–F; F(2,15) = 8.333, p = 0.012; n = 6 mice; Fig. 7A–C; F(2,15) = 9.333, p = 0.0055; n = 8 mice; Fig. 7D–F; F(2,21) = 9.75, p = 0.0048; repeated-measures one-way ANOVA with stress as the within-subject factor). Consistent with the reduction in Ca2+ signals, introducing mice into the fearful context disrupted the locomotor response pattern associated with the cue–sucrose stimulus pairs (Fig. 6-1). Therefore, stress reduces the intensity of the responses of both DRN serotonin neurons and VTA dopamine neurons and affects the pattern of conditioned responses to reward-predicting cues at the behavioral level.
Acute stress decreases the conditioned responses of DRN serotonin to reward-predicting cues. A–C, The effect of head restraint on the conditioned responses of DRN serotonin neurons to a cue that was associated with sucrose delivery. A, Heatmap representation of Ca2+ signals; (B) average plot of Ca2+ signals before, during, and after the trials with head restraint; (C) population data (n = 7 SERT-DRN-GCaMP6 mice). D–F, Heatmap (D) and average Ca2+ signals (E) of a mouse and population data (F) showing the effect of a fearful context on the responses of DRN serotonin neurons to reward-predicting cues (n = 6 mice). Error bars in C and F indicate SEM. **p < 0.01; n.s., not significant; multiple comparisons after repeated-measures one-way ANOVA. Shaded areas in B and E indicate SEM. Red and blue colors in B and E indicate significant increases and decreases from the baseline, respectively (p < 0.05, multivariate permutation tests). See Figure 6-1.
Figure 6-1
Acute stress decreases the conditioned responses of VTA dopamine neurons to reward-predicting cues. A–C, Heatmap (A) and peri-event plot of Ca2+ signals (B) of a representative mouse and population data (C) showing the effect of head restraint on the responses of VTA dopamine neurons to reward-associated cues (n = 6 DAT-VTA-GCaMP6 mice). D–F, Heatmap (D) and peri-event plot of Ca2+ signals (E) of a mouse and population data (F) showing the effect of fearful context on the responses of VTA dopamine neurons to reward-predicting cues (n = 8 mice). Error bars in E and F indicate SEM. **p < 0.01; n.s., not significant; multiple comparisons after repeated-measures one-way ANOVA. Shaded areas in B and E indicate SEM. Red and blue colors in B and E indicate significant increases and decreases from the baseline, respectively (p < 0.05, multivariate permutation tests).
Discussion
Using fiber photometry of Ca2+ signals, we examined how learning and acute stress affect the responses of DRN 5-HT neurons and VTA dopamine neurons of mice to both reward and reward-predicting cues over a long experimental period. We first compared how DRN serotonin neurons and VTA dopamine neurons respond differently during the entire process of appetitive Pavlovian conditioning. Important reward parameters such as reward value and availability determined the response intensity and the response pattern for both of the neuron populations. By applying various aversive stimuli, we further revealed that stress suppresses the response intensity of both serotonin neurons and dopamine neurons to rewards and to reward-predicting cues. Our findings indicate that reward responses adapt dynamically during reward associative learning and stressful contexts, suggesting a role of serotonin neurons and dopamine neurons in mediating the effects of stress on reward-related behaviors.
Learning gradually shapes the reward responses of serotonin neurons and dopamine neurons into different patterns
By tracking the neuronal activity throughout the entire process of Pavlovian conditioning (up to 600 trials over 6 d), we were able to observe several aspects of how learning shapes the response patterns of DRN serotonin neurons and VTA dopamine neurons. Initially, both types of neurons respond to primary rewards. After ∼170 trials of Pavlovian conditioning, the response pattern of serotonin neurons became clearly different from that of dopamine neurons. Most prominently, for DRN serotonin neurons, the process of Pavlovian conditioning gradually leads to increasingly stronger ramp-up of Ca2+ signals following the onset of the reward-predicting cue (CS); the peak intensity of activation eventually coincided with the delivery of the primary reward (US). In contrast, for dopamine neurons, Pavlovian conditioning leads to a Ca2+ response to the CS, but a substantially weaker Ca2+ response to the US (Fig. 1). Moreover, reward availability has different effects on serotonin neurons and dopamine neurons. Reward omission reduces the responses of both serotonin neurons and dopamine neurons. For serotonin neurons, reward reinstatement leads to a gradual recovery in the previously established ramp-up response pattern. For dopamine neurons, reward reinstatement produces a dramatically stronger activation when an omitted reward is redelivered (Fig. 2). Testing with two reward sizes suggests that serotonin neurons more faithfully represent the “absolute” value of a reward, whereas dopamine neurons represent the “relative” value of the reward (Fig. 3).
Our observation of distinct response patterns suggests that serotonin neurons and dopamine neurons perform different functions in reward processing. Our observations support the theory that transient dopamine levels encode information about the difference between an actual reward and an expected reward. As proposed by the temporal difference-lambda learning model, the dopamine signals may serve as a “credit” that can be assigned to distant states and actions in a manner that is inversely proportional to time difference (Schultz et al., 1997; Montague et al., 2004; Cohen et al., 2012; Eshel et al., 2015; Sadacca et al., 2016). This reward credit may in turn drive the changes in associative strength between neurons encoding the CS (cue) and those encoding the US (primary reward; Rescorla and Wagner, 1972; Sutton, 1988; Schultz et al., 1997; Montague et al., 2004). However, the information carried by dopamine neurons after learning is limited in two major ways. First, they cannot respond positively to the acquisition of an expected reward, although receiving such a reward remains pleasurable for an animal. Second, dopamine neurons respond transiently to a reward-predicting cue, so their activity cannot continuously guide reward-related behavior during the delay between the CS and US. After learning, unlike dopamine neurons, serotonin neurons continue to respond to the primary reward. This suggests that serotonin neurons contribute to the accurate evaluation of the acquisition status of a reward. Moreover, the ramp-up activation of serotonin neurons during the reward anticipatory phase might be important for promoting patience during a “waiting to obtain a reward” period (Miyazaki et al., 2011, 2014; Fonseca et al., 2015). However, during Pavlovian conditioning, larger reward sizes produce stronger Ca2+ signals during the waiting period, even though the duration of waiting period remains the same. This suggests that the activity of serotonin neurons encodes information beyond waiting duration, possibly including information related to attention allocation or related to the confidence level that an organism ascribes to the probability of receiving a reward in a given state (Pearce and Hall, 1980; Cardinal et al., 2002; Pearce and Mackintosh, 2010; Luo et al., 2016; Matias et al., 2017).
The response patterns of DRN serotonin neurons revealed in this study differ substantially from those reported in a recent study (Matias et al., 2017). First, the Matias et al. (2017) study showed that after conditioning DRN serotonin neurons are transiently activated by reward-predicting cues and lack responses to a fully predicted reward, whereas we found that these neurons exhibit a ramp-up activation pattern during the reward anticipatory phase and remain responsive to the primary reward. Second, they showed that the serotonin neurons were activated when a fully predicted reward was omitted, whereas we found that reward omission gradually reduced the response of these neurons to the reward-predicting cue and rapidly abolished the response during the period of reward omission. After learning, the ramp-up pattern of serotonin neurons shown in our study resembles the pattern of tonic spike firing by putative serotonin neurons in the DRN noted in several previous reports (Nakamura et al., 2008; Bromberg-Martin et al., 2010; Miyazaki et al., 2011; Liu et al., 2014; Li et al., 2016). It should be noted that single-unit recordings showed that a small number of putative serotonergic neurons in the DRN prefer small rewards or are inhibited by rewards (Nakamura et al., 2008; Li et al., 2016). Fiber photometry only detects the activity of populations of cells and may thus be inadequately sensitive to detect minor response patterns. The Ca2+ signal patterns of VTA dopamine neurons revealed by our fiber photometry technique are highly similar to the spike firing patterns and Ca2+ signal patterns reported for the VTA dopamine neurons in primates and rodents (Schultz et al., 1997; Montague et al., 2004; Nakamura et al., 2008; Cohen et al., 2012; Eshel et al., 2015; Menegas et al., 2017). These similarities support the validity of our fiber photometry approach. Therefore, our findings do not fully agree with the suggestion that DRN serotonin neurons encode “unsigned” (both positive and negative) reward prediction error (Matias et al., 2017). We note that Matias et al. (2017) recorded from chronically head-fixed mice, whereas we recorded from freely behaving mice. It will be interesting in future studies to investigate whether the difference between these two recording conditions results in different response patterns.
Stressors reduce the reward responses of DRN serotonin neurons and VTA dopamine neurons
A key finding of the present study is that various stressors can substantially suppress the response intensities of both DRN serotonin neurons and VTA dopamine neurons to a primary reward and to reward-predicting cues. Mixing the sucrose reward with quinine completely abolished the sucrose-evoked Ca2+ signals of both serotonin neurons and dopamine neurons. For both types of neurons, head restraint or the fear context associated with prior footshocks substantially reduced sucrose-associated Ca2+ signals. After an animal learned to associate a cue with a reward, head restraint and fearful context also significantly reduced the excitatory responses to the reward-predicting cue. These stressors involve different sensory modalities, suggesting that stress generally suppresses the responses of serotonin neurons and dopamine neurons to reward and to reward-predicting cues.
An important implication from our study may be the indication that serotonin neurons and dopamine neurons function to integrate information about reward value and cost. Our data do not support the notion that 5-HT neurons at the population level positively encode aversiveness. It has been reported that airpuffs briefly activate many 5-HT neurons in the DRN of head-fixed mice (Cohen et al., 2015). It remains unclear whether the airpuff-evoked responses are associated with the aversive nature of airpuff or with its strong sensory saliency (Horvitz et al., 1997; Horvitz, 2000). Our previous study showed that various rewarding stimuli activate DRN serotonin neurons, whereas quinine and footshock induce mild reductions in the intensity of population-level Ca2+ signals (Li et al., 2016). Here, we further showed that coupling a cue to quinine resulted in reduction of Ca2+ signals from the baseline. More importantly, we demonstrated that different stressors inhibited the reward-associated Ca2+ signals of serotonin neurons. These findings support our recent argument that the activity levels of serotonin neurons effectively correspond to “beneficialness”, how beneficial the current environmental state represents to an animal (Luo et al., 2016). The beneficialness is determined by “net benefit”, which can be calculated as the reward values minus costs. The circuit underlying this calculation remains to be dissected, although the lateral habenula (LHb) represents an obvious source for the stress (i.e., cost) signals. LHb neurons are activated by stressors and, via GABAergic relays, inhibit DRN serotonin neurons and VTA dopamine neurons (Ji and Shepard, 2007; Li et al., 2013; Proulx et al., 2014; Sego et al., 2014; Hu, 2016; Wang et al., 2017; Zhou et al., 2017).
Our study revealed that learning and reward parameters exert different effects on the responses of DRN serotonin neurons and VTA dopamine neurons and suggests that these two neuron populations perform complementary roles in reward processing. Moreover, we demonstrated that stress substantially reduces the reward responses of DRN serotonin neurons and VTA dopamine neurons. Anhedonia is a key symptom of depression, and stress represents one of the most reliable predictors of depression in humans (Kendler et al., 1999; Gold and Chrousos, 2002; Hammen, 2005). The negative effect of stress on reward responses provides evidence to support the association of acute stress and anhedonia (Bogdan and Pizzagalli, 2006; Chrousos, 2009; Hollon et al., 2015), suggesting that some reduction in the reward responses of both serotonin neurons and dopamine neurons might underlie stress-induced anhedonia. Enhancing the reward sensitivity of these two important neuron populations might ameliorate psychiatric disorders associated with stress-induced anhedonia.
Footnotes
This work was supported by China MOST (2012YQ03026005, 2013ZX0950910, 2015BAI08B02), NNSFC (91432114, 91632302), and the Beijing Municipal Government to M.L.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Minmin Luo, National Institute of Biological Sciences, #7 Science Park Road, Zhongguancun Life Science Park, Beijing 102206, China. luominmin{at}nibs.ac.cn