Abstract
Many animals exhibit flexible behaviors that they can adjust to increase reward or avoid harm (learning by positive or aversive reinforcement). But what neural mechanisms allow them to restore their original behavior (motor program) after reinforcement is withdrawn? One possibility is that motor restoration relies on brain areas that have a role in memorization but no role in either motor production or in sensory processing relevant for expressing the behavior and its refinement.
We investigated the role of a higher auditory brain area in the songbird for modifying and restoring the stereotyped adult song. We exposed zebra finches to aversively reinforcing white noise stimuli contingent on the pitch of one of their stereotyped song syllables. In response, birds significantly changed the pitch of that syllable to avoid the aversive reinforcer. After we withdrew reinforcement, birds recovered their original song within a few days. However, we found that large bilateral lesions in the caudal medial nidopallium (NCM, a high auditory area) impaired recovery of the original pitch even several weeks after withdrawal of the reinforcing stimuli. Because NCM lesions spared both successful noise-avoidance behavior and birds' auditory discrimination ability, our results show that NCM is not needed for directed motor changes or for auditory discriminative processing, but is implied in memorizing or recalling the memory of the recent song target.
Introduction
Many forms of sensorimotor learning involve a trial and error process of motor explorations to approximate a sensory target. In the case of vocalizations, motor learning can occur long after the sensory target has last been experienced (Marler and Tamura, 1964; Funabiki and Konishi, 2003), revealing a long-lasting influence of sensory memories on development and maintenance of motor performance. Vocal learners also rely on the ability to process self-generated auditory input, as shown, for instance, by the disruptive effects of deafening or auditory feedback perturbation on human speech (Lane and Webster, 1991; Svirsky et al., 1992; Houde and Jordan, 1998) and birdsong (Nordeen and Nordeen, 1992; Leonardo and Konishi, 1999; Brainard and Doupe, 2000).
Very few animal models are suited for studying auditory-vocal memory. Higher mammals such as nonhuman primates and dogs not only limit their vocal output to innate vocalizations (Hauser et al., 2002), but they also perform poorly in long-term memory tests when the sensory modality is auditory (Fritz et al., 2005; Scott et al., 2012). In contrast, vocal learners such as songbirds rely heavily on audition and perform well in standard auditory memory tasks, even when compared with humans (Zokoll et al., 2008; Comins and Gentner, 2010). To study the neural mechanisms underlying sensory memory and motor plasticity we perform brain lesions in zebra finches and characterize the lesion effects on their ability to selectively change and restore their songs.
We are interested in the memory role of the caudal medial nidopallium (NCM), a secondary auditory brain area that is often compared with human auditory association cortex (Reiner et al., 2004). Suppression of protein synthesis in NCM and neighboring areas in juvenile birds during song tutoring significantly impairs the quality of subsequent song copying (London and Clayton, 2008). NCM lesions in adults do not affect song production but lead to a reduction of behavioral preference for tutor song (Gobes and Bolhuis, 2007). In combination with correlative evidence (Chew et al., 1995, 1996; Bolhuis et al., 2000; Phan et al., 2006; Moorman et al., 2012), these results suggest that NCM may be necessary for storing or retrieving a memory of a song target. However, it remains unknown whether such target memory plays a role in maintaining adult song.
Here we apply an operant conditioning paradigm in which we deliver a loud white noise stimulus to induce adult zebra finches to change the pitch of one of their song syllables away from its normal range (Tumer and Brainard, 2007; Andalman and Fee, 2009; Warren et al., 2011). When we end white noise playback, birds spontaneously restore baseline pitch levels within a few days, thus displaying an intact memory of their baseline pitch. However, when the end of noise playback is coupled with bilateral lesions in NCM, pitch restoration is severely impeded. We show that impaired pitch restoration is not due to a deficit in motor plasticity or severe degradation of auditory processing, thus implicating NCM in recall of a pitch memory used for song maintenance.
Materials and Methods
Subjects and song recordings.
Our study included 19 adult male zebra finches (Taeniopygia guttata) raised in our breeding facility. Birds were 85–409 d old at the beginning of the experiments. Results from 5 additional young (70- to 75-d-old) male zebra finches were included for the assessment of auditory discrimination performance. Birds were housed individually in sound-attenuating recording chambers (IAC; inner size 60 × 60 × 60 cm) on a 14/10 h day/night schedule and were given ad libitum access to food and water. After 2–3 d of acclimatization in the chamber most birds resumed a normal singing rate. Songs were recorded with a wall-attached microphone (Audio-Technica PRO42) at a sampling rate of 32 kHz, and were then used to train a song syllable recognizer (a two-layer neural network), which detected in real time a particular song syllable targeted for pitch reinforcement.
All the experimental procedures were in accordance with the Veterinary Office of the Canton of Zurich.
Pitch reinforcement.
To selectively reinforce certain versions of a target harmonic syllable, we ran a custom LabVIEW (National Instruments) program that continuously analyzed the stream of microphone signal to detect the syllable and calculate its pitch (fundamental frequency): after syllable detection and following a fixed delay, we calculated one value of pitch every 4 ms (each based on the last 16 ms of microphone signal) using an adaptation of the Harmonic Product Spectrum algorithm (Noll, 1970). Three consecutive pitch measurements were taken and were combined to decide whether or not to play white noise (WN) through a loudspeaker (Harman/Kardon HKS 4BQ): When driving pitch up, we played 50 ms WN acoustic stimuli when at least one measurement was below a manually chosen threshold θ. Thus, to escape WN, the minimum p of the three pitch measurements had to be higher than θ. Analogously, when driving pitch down, the maximum pitch value (also referred to as p) had to be smaller than θ. We adjusted the threshold θ on a daily basis before lights on (i.e., before birds begin to sing) to near the middle of the previous day's pitch distribution; thereafter we kept θ constant during the day. The WN amplitude depended linearly on pitch up to 5 Hz below or above threshold; beyond that the WN amplitude was constant (independent of pitch; Fig. 1A). Note: in our birds, the 5 Hz linear range corresponds to approximately half the average SD of the daily pitch distribution. WN stimuli were delivered 12 ms after the third pitch measurement.
We counted the number of false positive and false negative syllable detections, both of which could interfere with pitch reinforcement. We found the false positive detection rate to be negligibly small, averaging 0.35% (range 0.04 to 1.64%, N = 10 birds). False positives were discarded in all pitch analyses. We determined the true positive rate on the last baseline day before WN playback by counting the actual number of target syllables produced by manual syllable clustering and by comparing that number to the number of correctly detected syllables, which yielded a positive detection rate of 99.4% (range 97.2 to 100%, N = 10 birds). Thus, the false negative detection rate was 0.6% on average.
The pitch traces within a syllable never being perfectly flat, our pitch estimates p might be influenced by fluctuations of both syllable detection time and song tempo. We observed nonetheless that such possible influences had a negligible effect on our pitch estimates. We first defined the syllable detection time as the time interval between syllable onset (determined by threshold crossing of the root mean square sound waveform) and the syllable detection event reported by the syllable recognizer. Detection times exhibited a very small jitter. Their average SD was 2.1 ms (range 1.5 to 3.7 ms in N = 10 birds, evaluated on the last day before WN playback; Fig. 1F), which is less than 1 ms above the lower bound dictated by aliasing errors (the latter account for 1.2 ms of jitter given the 4 ms intervals between pitch samples). In comparison, the average SD of target syllable duration was 3.1 ms (range 1.6 to 5.2 ms, N = 10 birds), demonstrating that the detection jitter was considerably smaller than the syllable duration jitter.
During baseline days we found a significant correlation (p < 0.01) between pitch and detection time in only 3 of 10 birds (with Pearson's correlation coefficients −0.22, −0.14, and −0.36, respectively), revealing that in most animals our pitch estimates were robust to changes in detection time and song tempo. Over the course of an experiment, the syllable detection time drifted in individual animals: the d′ value (see below, Data analysis) between syllable detection times during baseline singing and at the end of the experiment (∼1 month later) was in the range −2.5 to 2.5 (N = 10 birds, average 0.4). To test for possible spurious pitch changes due to drifts in detection time, we computed the d′ between baseline pitch and pitch at the end of the experiment in two different ways: with pitch measurements taken either at fixed intervals after detection time (d′dt) or after syllable onset (d′on). We obtained a negligible difference d′dt − d′on within the range −0.14 to 0.26 (N = 10 birds, average difference 0.02). Thus, our findings did not depend on whether we measured pitch in relation to syllable detection or in relation to syllable onset.
Surgery.
Birds received either bilateral NCM lesions or bilateral sham lesions. Before surgery, birds were anesthetized with isoflurane and head fixed in a stereotaxic frame. Their head was set to an angle of 65 degrees from horizontal, measured at the anterior surface of the skull. To perform the lesions, we vertically injected 500 nl of ibotenic acid (Sigma-Aldrich I2765) solution in ddH2O (7 mg/ml) in each hemisphere through a glass pipette (two injections per penetration, one at 1500 μm ventral from the surface of the brain, the other at 2000–2250 μm from the surface). For sham lesions, we similarly injected 500 nl of Ringer's solution in each hemisphere. All injections were made at a lateral distance from the midline of about 500 μm (lesioned birds: average 460 μm, range 340 to 520 μm, N = 6 birds; sham-lesioned birds: average 490 μm, range 350 to 650 μm, N = 4 birds) and about 800 μm anterior of the bifurcation of the mid-sagittal sinus (lesioned birds: average 807 μm, range 650 to 1000 μm, N = 6 birds; sham-lesioned birds: average 773 μm, range 710 to 810 μm, N = 4 birds). The craniotomy was then sealed with Kwik-Cast (World Precision Instruments) and the scalp was closed with veterinary glue (GLUture; Abbott Laboratories). Birds typically resumed singing in the evening or the morning after surgery.
Song restoration.
After surgery, no more WN playbacks were delivered. Based on visual inspection of the spectrograms, the first songs produced after surgery were not visually different from presurgery songs (Fig. 4F), which led us to conclude that lesions did not have major immediate effects on vocal output. We used the daily singing rate to evaluate the birds' well being. The fraction of target syllables sung postsurgery relative to presurgery quickly approached 1 (0.41 on the first recovery day, range 0.04 to 1.53, N = 10 birds; and 0.98 on the second recovery day, range 0.41 to 1.86, N = 10 birds).
The target syllable pitch was monitored on a daily basis and its restoration was assessed relative to pitch on the last baseline day before WN playbacks.
Data analysis.
For any pair of different days i and j, we compared syllable properties such as pitch, entropy, or detection time through the d′ measure that weighs differences between averages by the underlying variances: where μi is the average value on day i and σi2 its variance. In the case of equal variances (σi2 = σj2), d′i,j reports the difference in average between the two days in the convenient unit of SDs.
To determine whether the daily average pitch significantly deviated from baseline pitch, we estimated a pitch significance threshold as follows. We analyzed song recordings from six birds that were not subjected to surgery or pitch reinforcement and that produced a complex syllable similar to the target syllable in Figure 1B. We first estimated the variability of daily average pitch in these birds. That is, we collected the daily average pitch p̄i and SD σi on 7 consecutive days i = 1,…,7. We then computed the mean baseline pitch p̄b = 〈p̄i〉i and subtracted it from the daily average pitch, which after normalization by the SD σi resulted in the following daily pitch z-score: We then considered all z-score values from the six different birds to yield 36 independent z-score estimates (42 estimates minus 6 degrees of freedom from subtraction of the averages). The SD of these z-score estimates was 0.29. To achieve a significant deviation of z-score at the 1% level, the daily z-score must be at least 2.58 times larger than this value, corresponding to 0.29 × 2.58 = 0.75. Thus, the significance threshold for pitch deviations is 0.75: on a day i, a bird significantly changed the average pitch away from baseline when the z-score Zi was higher than 0.75.
To assess significance of pitch difference on a day i relative to the last baseline day b before WN playback, we tested whether the absolute value |d′i,b| exceeded a threshold value of d′θ = 0.75. This measure is therefore an evaluation of significance of pitch change relative to the naturally occurring variability of baseline pitch.
Based on our observations in control birds (see Results), we analyzed pitch restoration in a “late window” from 6 to 20 d from the first day without WN. We quantified the amount of pitch recovery on day i after end of WN by computing the residual pitch Di defined as the normalized difference between mean baseline pitch p̄b and mean daily pitch p̄i on that day, normalized to the difference between p̄b and mean daily pitch p̄0 on the last day with WN: We assessed pitch recovery in the late window in terms of the mean residual pitch across days i and its SD σL. A mean DL near zero indicates recovery of baseline pitch, whereas a small SD σL indicates stable pitch.
To assess pitch stability also in two control birds that were subjected to NCM lesions but not to pitch reinforcement (for these birds Di is undetermined because p̄0 = p̄b), we assessed pitch stability (in the late window) in terms of absolute d′ values. In particular, we computed the late pitch deviation B from baseline by averaging: with i ranging across all days in the late window and b the last baseline day without WN (or the last day before surgery for birds without pitch reinforcement). B close to zero indicates both highly similar and highly stable pitch, because the absolute value guarantees that pitch fluctuations near zero do not average out. In contrast, when B is far from zero, which is what we observe in NCM-lesioned birds, the pitch in the late window could either be dissimilar to baseline pitch or unstable. To test for the latter, we assessed the pitch fluctuation F in the late window by computing the average d′ between all days i and j in the late window:
To assess stability of the produced song after NCM lesions, we analyzed the target syllable structure in terms of Wiener entropy, a measure of “tonality” that ranges from minus infinity (pure tones) to zero (white noise; Tchernichovski et al., 2000). We computed the Wiener Entropy of 16 ms sound snippets in steps of 4 ms using a custom MATLAB (MathWorks) script. For each bird, we first averaged entropy values over the duration of the syllable and then over all syllable renditions on 1 day, shown in Figure 4E.
Auditory discrimination.
To test whether bilateral NCM lesions impaired the birds' hearing, we subjected 8 birds (26–38 d after lesions) to a simple auditory discrimination experiment based on an operant conditioning paradigm presented in Tokarev and Tchernichovski (2014). The experimental setup is described in Figure 4A and B. A male bird (Fig. 4A, in the foreground) was housed individually in a cage containing a small window from which he could see a female zebra finch in a neighboring cage inside the same recording chamber. Near the window we placed a perch that was the only vantage point from which the bird could see the female. The female's presence attracted the bird onto the perch, which was built as a lever that triggered an electric contact as soon as the bird sat on it.
All trials began after the bird had been sitting on the perch and not vocalizing for at least 2–3 s (Fig. 4B, bird-on-perch probability 1). At trial onset (time 0) we randomly (with 50% probability) played one of two different song bouts (from two different zebra finches) through a loudspeaker. One of the songs was always followed by a 1 s air puff starting 3 s after playback onset. The puff tended to blow the bird off the perch (Fig. 4B, sudden drop of the black curve at 3 s), thus providing a mild aversive reinforcement signal (the cage was large enough for the bird not to get hurt). The other song was not paired with an air puff.
After a brief training period (see Results), birds began to increasingly leave the perch before air puff onset to avoid the air puffs. Most importantly, they left the perch more often during the song paired with an air puff than during the other song (Fig. 4C). To evaluate song discrimination in these experiments, we calculated (at 4 ms time resolution) the conditional perch probabilities P(perch|puff) and P(perch|nopuff) that the bird sat on the perch as a function of time since trial onset. We calculated these probabilities both for trials of puffed and of nonpuffed songs (Fig. 4B). We then compared these probabilities using a difference between proportions significance test: in every 4 ms time bin we first calculated the pooled sample proportion as follows:
where npuff and nnopuff are the numbers of trials with and without air puffs, and nperch|puff and nperch|nopuff are the number of puff/no puff trials in which the bird was on the perch during that time bin. The standard error of the sample proportion is then
based on which we calculated the difference (z-score)
with P(perch|puff) =
Histology.
At the end of the experiments, lesioned birds were deeply anesthetized with an overdose of pentobarbital sodium and perfused via the left ventricle with a 4% paraformaldehyde solution (P6148; Sigma-Aldrich). The brains were removed and fixed in the same solution overnight. Sagittal sections (80–100 μm thickness) were cut and Nissl stained with cresyl violet to highlight cell bodies. For each hemisphere, we selected the brain slice showing the largest lesioned surface area and isometrically aligned it to a previously drawn outline of the songbird auditory lobule extracted from a sagittal reference section about 500 μm from the midline (Fig. 2C). As landmarks for alignment we chose the ventricle (V) that separates the nidopallium from the hyperpallium and the lamina mesopallialis (LaM), which separates the nidopallium from the mesopallium. In each aligned brain slice we drew a dot at the approximate center of the lesioned area (Fig. 2C). Injection sites in different birds were then compared based on their positions relative to the brain outline (Fig. 2D). The NCM region is broadly defined as the most posterior part of the medial nidopallium (Fig. 2D, yellow area).
Because lesion damage in experimental birds was assessed only 3 weeks or more after surgery, we evaluated the extent of the lesions at different time points in four additional birds. Lesions were made as described above, and birds were killed 1, 7, 14, and 32 d later. From the bird perfused 1 d after surgery we estimated the initial lesioned volume to extend about 640 μm laterally from the injection site (eight sections each 80 μm thick), and about 430 μm anterior from the injection site (measured in the slice showing the largest lesioned area; Fig. 2A). Over time, possibly due to neurogenesis and cell migration, the lesioned areas gradually shrunk due to the influx of cells with large cell bodies (Figs. 2A,B). Note that the exponential fit in Figure 2B only provides a qualitative description of lesion size development; we are not suggesting a predictive model of lesion size with a fixed time constant. Given this shrinkage, histology from experimental birds could not reveal the original extent of the lesioned regions–only their approximate location.
All measurements were taken with NIH ImageJ.
Results
We subjected adult male zebra finches to a pitch reinforcement paradigm (Tumer and Brainard, 2007; Andalman and Fee, 2009; Warren et al., 2011) in which a WN acoustic stimulus is played back every time the pitch of a target syllable produced is lower (respectively, higher) than a certain threshold value. As a result, birds tend to raise (respectively, lower) the pitch of the target syllable to escape WN. Typically, when WN playback is ceased, birds spontaneously restore baseline pitch levels within the course of several days (Tumer and Brainard, 2007; Warren et al., 2011). Successful song restoration indicates that, during pitch shifting, birds maintain a memory of a target song, which could be either a memory of the tutor song that birds acquired during a sensory-learning phase (Immelmann, 1969; Zann, 1996) or a memory of their recently produced song. Thus, this pitch restoration process enables us to experimentally evaluate the availability of a song memory and/or its involvement in shaping vocal motor output.
We assessed pitch changes using the d′ measure that expresses pitch differences in units of SDs (see Materials and Methods). In each bird, after recording baseline (unperturbed) singing, we reinforced the pitch of a target syllable containing a harmonic stack. We repeatedly adjusted the pitch threshold for WN playbacks to drive highly significant pitch increases/decreases: at the end of the pitch reinforcement phase, the absolute pitch change relative to baseline was on average d′ = 3.3 (range 2.2 to 5.2, N = 10 birds). The pitch reinforcement phase lasted on average 11 d (range 6 to 15 d, N = 10 birds), depending both on the birds' performance and on how often and how much we changed the threshold for WN playback. After pitch had been sufficiently shifted, we ended WN playbacks to release birds from pitch reinforcement. Birds were divided into two groups: a first group that received bilateral neurotoxic NCM lesions and a second group that received bilateral sham lesions. Right after neurotoxic and sham lesions, birds were released from pitch reinforcement.
Sham-lesioned control birds successfully restore baseline pitch
Control birds receiving sham lesions changed their pitch on average by a d′ = 3.9 relative to baseline (range 3.3 to 5.2, N = 4 birds) within on average 10.3 d (range 9 to 11 d). After sham NCM lesions (made via saline injections) they readily recovered their baseline pitch: the d′ to baseline pitch fell below 0.75 (corresponding to our estimated 1% significance threshold level; see Materials and Methods) on average on the sixth day after WN end (range 3 to 9 d, N = 4 birds). In addition, after falling below threshold level, the pitch was stable and did not show large fluctuations (Fig. 3).
Based on these observations we defined an early recovery window (days 1–5 after WN end) within which control birds recovered baseline pitch and a late recovery window (days 6–20 after WN end) within which recovered pitch was on average stable (see also Materials and Methods). To assess the level of pitch recovery in the late window, we computed the average pitch deviation B from pitch on the last baseline day (before WN playback, pitch deviation is reported in terms of absolute d′ values; see Materials and Methods). Across all sham-lesioned birds the pitch deviation to baseline was B = 0.46 ± 0.21 (range 0.25 to 0.72, N = 4 birds), revealing that pitch had recovered within the early window and was then maintained stably during the late window. In these birds, the average pitch fluctuation F (absolute d′ between late recovery days; see Materials and Methods) of daily pitch during the late recovery window was F = 0.27 ± 0.11 (range 0.13 to 0.36, N = 4 birds), revealing that pitch fluctuations were small after pitch had recovered. We obtained similar results when we evaluated the average residual pitch DL across the late recovery window, which is a measure that compensates for differences in the magnitude of reinforced pitch changes. We found that the residual pitch in the late window was on average DL = 0.04 ± 0.12 (range −0.08 to 0.16, N = 4 birds) and had an SD of σL = 0.05 ± 0.02 (range 0.03 to 0.08, N = 4 birds), revealing both small deviation from baseline pitch and small variability (Fig. 3).
NCM lesions impede song restoration
We drove the pitch in birds to receive NCM lesions by on average d′ = 2.9 from baseline pitch (range 2.2 to 4.2, N = 6 birds) within on average 11.3 d (range 6 to 15 d, N = 6 birds). After WN end and bilateral NCM lesions, birds failed to restore pitch as quickly and consistently as control birds did (Fig. 3). Across all NCM-lesioned birds, the average pitch deviation B away from baseline in the late window was B = 1.45 ± 0.64 (range 0.56 to 2.29, N = 6 birds), revealing significantly less recovery than in sham-lesioned birds (Student's t test, p = 0.02). The average pitch fluctuation in the late window was F = 0.70 ± 0.33 (range 0.22 to 1.19, N = 6 birds), revealing significantly larger pitch variability in NCM-lesioned birds compared with controls (Student's t test, p = 0.04). In NCM-lesioned birds the residual pitch DL and its SD σL in the late window were DL = 0.28 ± 0.54 (range −0.54 to 0.85, N = 6 birds) and σL = 0.23 ± 0.14 (range 0.07 to 0.41, N = 6 birds), which is, respectively, 7 and 4.6 times larger than in sham-lesioned birds (Fig. 3). In one bird we monitored pitch until 45 d after surgery, at which time pitch was still far away from baseline (normalized residual pitch Di = −0.55) and was strongly fluctuating.
NCM-lesioned birds tend to revert pitch in the direction of baseline
Despite not restoring baseline pitch like sham-lesioned birds did, each of the six birds with NCM lesions showed a tendency of changing pitch toward baseline (Fig. 3), which argues against a complete loss of target pitch memory. We tested whether the postlesion decrease in daily pitch deviation from baseline was consistent with random pitch drift: across all lesioned birds the median d′ (1.33, N = 90 d′ values) between late-window and baseline pitches was significantly different from the median d′ (2.64, N = 6 d′ values, p = 1.4 × 10−4, Wilcoxon rank-sum test) between pitches on the first recovery day and baseline, revealing that birds significantly changed pitch over the course of recovery.
We also tested whether random walks could explain observed d′ trajectories drifting toward baseline by calculating the probability that six (the number of birds) 20-step (the length of the full recovery window) random walks with equal up and down probabilities in each step remain on the negative side between steps 6 and 20 (the late window). From a sample of 109 sets composed of six random walks each, only a fraction p = 1.7 × 10−4 remained exclusively on the negative side in the “late window,” implying that our results are highly unlikely to be explained by balanced random pitch drift and that post-NCM-lesion paths indeed show nonzero drift toward baseline.
NCM lesions do not destabilize pitch
To test whether the large pitch fluctuations observed after NCM lesions were due to the preceding pitch reinforcement, we made bilateral lesions in two birds that were never pitch reinforced before. After surgery both birds maintained a stable pitch for up to 20 d; the pitch fluctuations F in the late window were F = 0.15 and 0.31, respectively. Also, postlesion pitch was not notably different from prelesion pitch. The average pitch deviation between the day before the lesion and all late recovery days was B = 0.17, respectively, 0.43. Thus, the amount of pitch drift and fluctuations caused by NCM lesions is no larger than that in sham-lesioned (and pitch reinforced) birds. We conclude that NCM lesions alone do not explain the loss of pitch stability; they do so only in a state of reinforced pitch away from baseline.
NCM lesions do not prevent successful escaping from aversive reinforcement
To evaluate the effect of lesions on processing of auditory feedback and motor plasticity, in four birds we tested whether they could escape aversive reinforcement after receiving large bilateral NCM lesions. Before the lesions we briefly verified that birds were able to significantly drive pitch away from baseline. Then, after letting birds restore baseline pitch, we made large bilateral NCM lesions and attempted to reinforce pitch again 3, 10, 19, and 20 d after the lesions. All birds successfully changed pitch in the expected direction (Fig. 4D). After only 4 d of WN playbacks, birds had changed their pitch from baseline to a d′ of 2.66 ± 0.43 (range 2.10 to 3.14, N = 4). This reveals that NCM lesions do not affect birds' ability to selectively change their motor program.
NCM lesions do not affect syllable entropy
Visual inspections of song spectrograms (Fig. 4F) as well as pitch measurements at baseline suggest that NCM lesions have no major impact on song production. We further evaluated the stability of the target syllable by tracking its daily mean Wiener entropy. In case of song degradation we would expect entropy values to become less negative (i.e., syllables become noisier). In lesioned birds the syllable entropy did not consistently increase between the last baseline day before WN and the last day of the late recovery window (Fig. 4E): the d′ between average entropy on these 2 days was in the range −0.50 to 0.94 (mean 0.24 ± 0.51, N = 6 birds, positive d′ values indicating increased entropy, hence noisier syllable). Thus, NCM lesions did not impact the spectral noise of the target syllable that may have interfered with pitch restoration.
NCM lesions do not severely impair hearing
Aversive reinforcement is not well suited to study the ability to process natural auditory stimuli, because escape from WN can be achieved by correlating the presence/absence of WN with motor variability (instead of the auditory feedback thereof). We thus tested whether NCM lesions cause impairments of auditory discrimination that may be underlying the evaluation of pitch feedback during song restoration. We made NCM lesions in eight birds (three adults and five young birds) before subjecting them to an air-puff song discrimination task. In this task, birds are exposed to two auditory stimuli, one of which is always followed by an air puff. Birds that are able to discriminate between the stimuli escape from the perch more often during the air puff-predicting (negatively reinforced) stimulus than during the other stimulus (see Materials and Methods).
All NCM-lesioned birds tested succeeded in showing a significantly higher escape rate during playback of the aversively reinforced songs. The probability that birds stayed on the perch just before the onset of the air puff dropped significantly (p < 0.01; see Materials and Methods) after 1–4 d of practice (mean 2.4, N = 8 birds). We quantified successful escapes by counting the trials in which a bird had left the perch at the moment we delivered the air puff (typically about 300–900 trials per day; see Table 1 for results on the last day of testing).
In conclusion, auditory processing was not significantly impacted in NCM-lesioned animals, suggesting that the impairment of pitch restoration after NCM lesions is not a result of lost auditory processing ability, but rather a remote memory involvement of NCM.
Discussion
We performed large lesions in a high auditory area of the songbird brain and studied their effects on reversing changes in adult song. Lesions impaired the restoration of baseline pitch but had no noticeable effects on learning of an auditory discrimination task. Postlesion songs were indistinguishable from prelesion songs, strengthening previous reports on smaller NCM lesions (about one-fifth of our injected neurotoxin volumes) preserving song stability (Gobes and Bolhuis, 2007). Furthermore, lesions spared successful pitch reinforcement, revealing that a fully intact NCM is not required for mediating selective pitch changes. The lesions we made targeted the most posterior/medial part of the nidopallium, including regions in which stimulus-specific neural response habituation was observed in multiunit recordings (Chew et al., 1995) and in which protein synthesis blockage in juveniles lead to impaired tutor song imitation (London and Clayton, 2008). The mediolateral extent of our lesions was so large that it also covered more lateral parts of NCM where immediate early gene expression correlates with song learning (Bolhuis et al., 2000) and where lesions reduce behavioral preference for tutor song over other songs (Gobes and Bolhuis, 2007). The extent of our applied lesions thus provides strong evidence that NCM has little, if any, role in motor performance and motor plasticity, and possibly also in feedback evaluation. Based on these results, we reject the hypothesis that motor or auditory functions relevant to pitch restoration might be responsible for the observed impairment induced by the lesions. Instead we suggest that the lesions may have damaged neural tissue necessary for either storing a memory of target pitch or for retrieving such a memory.
Our experiments do not allow us to unambiguously identify which properties of memory may have been affected. A common confound, which should be the target of further research, is the often overlooked distinction between memory storage and retrieval (Squire, 2006). Memory performance in humans assessed with forced-choice recognition is good even with probed lists of thousands of items (Standing, 1973), suggesting that recall is limited by the retrieval process rather than actual forgetting (Romani et al., 2013). A similar limitation of recall is also demonstrated by retrogradely amnesic patients who appear to retain memories but struggle to recall them (Lewis et al., 1968). The memory recall deficit in case of retrograde amnesia also has been extensively investigated in animals. Typical such studies require animals to first learn a task before they receive electrical or chemical convulsing shocks to induce retrograde amnesia. After exhibiting worsened performance, animals are given one more shock-free “reminder” trial, which is often sufficient to restore good performance (Lewis et al., 1968; Cherkin, 1972). Hence, one avenue toward testing for impairment of memory recall in songbirds could be to play back the target song to an NCM-lesioned bird that failed to recover the prelesion pitch. If lack of pitch restoration is due to inaccessibility of the target memory but not to loss of “desire” to sing a target, then birds might be able to recognize their target song from the playback and use the retrieved memory for successfully restoring their song. While such experiments could be worth conducting, it is also worth remembering that to date there are no reports that adult songbirds can form new auditory target memories past their critical sensory learning phase (Immelmann, 1969).
Our findings agree with a broad literature that ascribes a role in target song memorization to NCM (Bolhuis and Gahr, 2006; Pinaud and Terleph, 2008; Hahnloser and Kotowicz, 2010). If that target is indeed the tutor song as previously suggested (Terpstra et al., 2004), then we are drawn to conclude that pitch restoration is a sensory-guided song-learning mechanism reminiscent of developmental song learning in young birds. If, instead, NCM holds part of the memory of the bird's own song, then the impaired pitch restoration in NCM-lesioned birds could indicate the extent to which birds rely on a memory of their own song (rather than the tutor song) for pitch restoration. Possibly, NCM lesions in pitch-reinforced isolate (nontutored) birds could help to resolve these contrasting views, though the feasibility of pitch-reinforcing the highly variable songs of isolates (Immelmann, 1969; Fehér et al., 2009) is unclear.
In addition to NCM, song maintenance also relies on a basal ganglia network including the songbird striatal/pallidal nucleus Area X and the lateral magnocellular nucleus of the anterior nidopallium (LMAN), both of which are required for successful pitch reinforcement in adults (Scharff and Nottebohm, 1991; Kao et al., 2005; Ali et al., 2013; Kojima et al., 2013). Because Area X and LMAN-lesioned birds are unable to change pitch to avoid aversive reinforcement, we conclude that NCM is more selectively involved in song maintenance than are Area X and LMAN. The adult role of the latter areas seems to be the mediation of directed song changes in general, whereas the role of NCM seems to be to support song maintenance in particular. For a particular song feature (pitch), we thus identified an anatomical difference between a natural learning process (spontaneous restoration of baseline pitch) and an externally enforced process (pitch shift to escape WN). While both processes rely on LMAN and Area X, only the former requires NCM. Hence, we conclude that reinforcement learning using WN playback should not be seen as a universally valid paradigm (in adults) for probing neural mechanisms of natural song learning (in juveniles).
Our work contributes to an area of lesion research that allowed, for example, the identification of hippocampal and cortical areas in the initial acquisition of memories (Scoville and Milner, 1957; Teng and Squire, 1999; Squire et al., 2001; Tse et al., 2007) and long-term memory storage (Takehara et al., 2003; Steffenach et al., 2005; Clark et al., 2011), respectively. The ability of NCM-lesioned birds to learn a song discrimination task contrasts with these typically dissociated roles of mammalian brain areas in either acquisition or late retrieval of memories. Reconciling these contrasts could be different neural mechanisms involved in memorization of the target song and of other (nontarget) songs. Also, NCM's memory function may undergo changes from the first critical sensory period to adulthood, reflecting architectural requirements for vocal learning and constraints dictated by critical sensory periods.
While to our knowledge there are no lesion reports in humans that directly parallel our findings, there is an interesting difference between the impairments induced by lesions in the songbird NCM and in the human auditory cortex. Lesions of the latter in the right hemisphere decrease the ability to resolve the direction of pitch changes, but not of pitch differences (Johnsrude et al., 2000). In contrast, NCM-lesioned birds showed a tendency to revert pitch in the direction of baseline, despite not fully restoring baseline pitch. Such tendency could be explained by either the partial nature of our NCM lesions or the partial role of NCM in holding or retrieving the pitch target memory. Our preferred interpretation is the latter: a memory outside NCM might drive pitch in the direction of baseline, whereas the memory in NCM would be needed to precisely match the target. In any case, common to both birds and humans is that different aspects of pitch discrepancy (direction versus distance) seem to be processed in distinct brain regions. While human auditory cortex lesions affect the processing of pitch sensory discrepancy held in short-term memory, songbird NCM lesions affect the processing of pitch vocal discrepancies in relation to a long-term memory.
Footnotes
This work was funded by the Swiss National Science Foundation (Grant 31003A_127024) and by the European Research Council under the European Community's Seventh Framework Programme (ERC Grant AdG 268911 FP7/2007-2013). We thank Anna Stepien, Anja Zai, Michael Griesser, and Alexander Hanuschkin for helpful comments on this manuscript and Franziska Baumann-Klausener and Simone Rickauer for technical assistance.
The authors declare no competing financial interests.
- Correspondence should be addressed to Richard H.R. Hahnloser, Institute of Neuroinformatics, University of Zurich and ETH Zurich, 8057 Zurich, Switzerland. rich{at}ini.phys.ethz.ch