Abstract
Habituation is the most basic form of learning, yet many gaps remain in our understanding of its underlying neural mechanisms. We demonstrate that in the owl's optic tectum (OT), a single, low-level, relatively short auditory stimulus is sufficient to induce a significant reduction in the neural response to a stimulus presented up to 60 s later. This type of neural adaptation was absent in neurons from the central nucleus of the inferior colliculus and from the auditory thalamus; however, it was apparent in the OT and the forebrain entopallium. By presenting sequences that alternate between two different auditory stimuli, we show that this long-lasting adaptation is stimulus specific. The response to an odd stimulus in the sequence was not smaller than the response to the same stimulus when it was first in the sequence. Finally, we measured the habituation of reflexive eye movements and show that the behavioral habituation is correlated with the neural adaptation. The finding of a long-lasting specific adaptation in areas related to the gaze control system and not elsewhere suggests its involvement in habituation processes and opens new directions for research on mechanisms of habituation.
Introduction
Habituation is defined as behavioral response decrement that results from repeated stimulation and does not involve response fatigue (Thompson and Spencer, 1966). The distinction between fatigue and habituation is commonly made by demonstrating that the response decrement is specific to the repeated stimulus (Barry, 2009; Rankin et al., 2009). Thus, habituation is a learning process that allows the animal to ignore irrelevant stimuli and focus on novel important stimuli. Habituation is considered the most basic form of learning that exists in all animals and all senses (Thompson, 2009). Despite its importance, many gaps remain in our understanding of the neural mechanisms of habituation (Rankin et al., 2009).
At the neural level, stimulus-specific adaptation (SSA) is a phenomenon that resembles behavioral habituation in several aspects. First, in SSA, the response of a neuron to a stimulus is decreased when the same stimulus is presented repeatedly. But more importantly, the decrease is specific to the stimulus; changing the stimulus (frequency, amplitude, location, etc.) regains the neural response (Sobotka and Ringo, 1994; Ulanovsky et al., 2003; Katz et al., 2006; Reches and Gutfreund, 2008). This phenomenon has recently attracted scientific interest, particularly in the auditory system (Nelken and Ulanovsky, 2007; Anderson et al., 2009; Malmierca et al., 2009; Farley et al., 2010). It has been suggested to play a role in auditory scene analysis, optimal coding, and novelty detection (Ulanovsky et al., 2004). But can it be a neural correlate of behavioral habituation to an auditory stimulus? One major caveat of this hypothesis is the different time scales. Most forms of SSA were demonstrated and studied using interstimulus intervals (ISIs) of up to 2 s (Ulanovsky et al., 2003; Nelken and Ulanovsky, 2007; Reches and Gutfreund, 2008; Anderson et al., 2009; Malmierca et al., 2009; Antunes et al., 2010; Farley et al., 2010). Behavioral habituation, on the other hand, acts at ISIs of tens of seconds to minutes, even for relatively short-duration stimuli (Weinberger et al., 1975; Valentinuzzi and Ferrari, 1997; Bala and Takahashi, 2000; Zimmer, 2006; Dong and Clayton, 2009). It is therefore necessary to identify a form of neural adaptation that maintains a comparable memory trace.
For this purpose, we recorded single-unit and multiunit responses of neurons from the optic tectum (OT, a midbrain gaze control center that is homologous to the superior colliculus of mammals). We report a new form of neural adaptation that has a relatively long memory trace (up to a minute and possibly more) and is stimulus specific. We further show that this form of adaptation is absent in the ascending auditory pathway to the OT and in the primary auditory thalamus, but is apparent, in addition to the OT, in the entopallium (E), a forebrain area connected to the gaze control system (Benowitz and Karten, 1976; Reches and Gutfreund, 2009). We therefore propose that gaze control circuitry, which is involved in the selection of salient stimuli in space (Boehnke and Munoz, 2008; Mysore et al., 2010), is also involved in the selection of salient stimuli in time.
Materials and Methods
Experimental animals.
A total of 15 barn owls (Tyto alba) of either sex were used in this study. All owls were hatched in captivity and raised and kept in a large flying cage equipped with perching spots and nesting boxes. The owls were provided for in accordance with guidelines of the Technion Institutional Animal Care and Use Committee.
Electrophysiological procedures.
The birds were prepared for repeated experiments in a single surgical procedure: the birds were anesthetized with 2% isoflurane in a mixture of nitrous oxide and oxygen (4:5). A craniotomy was performed and a recording chamber was cemented to the skull. At the beginning of each electrophysiological session, the owl was anesthetized briefly with isoflurane (2%) and nitrous oxide in oxygen (4:5). Once anesthetized, the animal was positioned in a stereotaxic apparatus at the center of a double-wall, sound-attenuating booth. The head was fixed and aligned using the pecten oculi as a retinal landmark (Wathey and Pettigrew, 1989). Within the booth, isoflurane was removed and the bird was maintained on a fixed mixture of nitrous oxide and oxygen (4:5). The cover of the craniotomy was removed and a platinum-iridium or tungsten, epoxy-coated electrode (0.5–2 MΩ) was driven into the recording chamber. A Tucker-Davis Technologies System3 and an online spike sorter (MSD, Alpha-Omega) were used to isolate action potentials from a small cluster of neurons (multiunit recording) or single neurons (single-unit recording). Single units were isolated using template-based sorting. At the end of each recording session, the recording chamber was treated with chloramphenicol ointment (5%) and closed. The owl was then returned to its home flying cage.
Targeting of nuclei.
Identification of the location of the recording site was based on stereotaxic coordinates and on the expected physiological properties of each nucleus: the OT was recognized by characteristic bursting activity and spatially restricted visual and auditory receptive fields. Position within the OT was determined based on the location of the visual receptive field (RF). The deep layers of the OT were located beneath the bursty layers and identified based on the characteristic regular firing. The external nucleus of the inferior colliculus (ICX) was targeted by positioning the electrode 2 mm caudal and 2.5 mm medial from the tectal representation of 0° azimuth and +10° elevation relative to the visual axes (tectal reference point). The ICX was identified by broadband frequency tunings and robust mapping of interaural time and level differences (ITD and ILD, respectively). The central nucleus of the inferior colliculus (ICC) was targeted 3.5 mm medial and 3 mm caudal to the same tectal reference point. The ICC was identified by narrowband frequency tunings and tonotopic mapping. The nucleus ovoidalis (nOv) was targeted 1.2 mm rostral and 5 mm medial from the tectal representation of 0° azimuth and 0° elevation relative to the visual axes. In nOv, we searched for robust auditory responses to binaural stimulation, high levels of spontaneous activity, and tonotopic mapping. The E was targeted 2 mm rostral, 0.4 mm lateral, and 6.5–7 mm dorsal from the same tectal reference point. In the E, we recorded in the area in which we have previously identified auditory responses (Reches and Gutfreund, 2009). This area was characterized by a high level of spontaneous bursty activity and strong responses to visual stimuli. The correspondence of the stereotaxic coordinates and the characteristic physiological responses with the desired brain locations were previously confirmed with anatomical reconstructions of the recording sites (Knudsen, 1982; Wagner et al., 1987; Brainard and Knudsen, 1993; Gutfreund et al., 2002; Miller and Knudsen, 2003; Pérez and Peña, 2006; Reches and Gutfreund, 2009; Netser et al., 2010). In all nuclei, the electrode was advanced within the nucleus in 300 μm steps; at each step, basic auditory responses (frequency, ITD, ILD) and visual responsiveness were first measured to make sure that the recording site is within the desired nucleus. All recording sites that were characterized to be within the nucleus were included in the analysis.
Auditory stimulation.
Sounds for stimulation were generated on a computer and emitted through miniature earphones (Knowles ED-1914). The earphones were placed in the center of the ear canal ∼8 mm from the tympanic membrane. The amplitude and phase spectra of the earphones were equalized within ±2 dB and ±2 μs between 2 and 12 kHz. Sound levels were controlled by two independent attenuators (TDT PA5).
Tuning curves were generated by varying a single parameter (ITD, ILD, central frequency, or sound intensity) while holding all other parameters constant. Sounds for tuning curves measurements were broadband (3–10 kHz) or narrowband (1 kHz width), were 100 ms long, and had a rise/fall time of 5 ms. The value of the tested parameter was varied randomly in stimulus sets that were repeated 10 times.
To test adaptation, we used, depending on the paradigm, broadband sounds (3–10 kHz, 400 ms duration, 5 ms rise/fall time), narrowband sounds (1 kHz width, 400 ms duration, 5 ms rise/fall time), or recordings of natural sounds. Natural sounds were chosen from a library of six possible sounds emitted by mice, rats, or owls (see spectrograms and waveforms in Fig. 6). The natural sounds were truncated with 5 ms rise and fall times to a duration of 300 ms. Sound levels of both computer-synthesized and natural sounds were adjusted to ∼20 dB above the site's response threshold (15–30 dB SPL). In all cases except for the E, the sounds were presented at the unit's best ITD and ILD, determined separately for each recording site (see below, Data analysis). E neurons have large contralateral RFs with a slight preference for negative ILDs (Reches and Gutfreund, 2009). Thus, in these experiments the best values were not determined and the ITDs of the sounds were chosen between 40 and 100 μs contralateral ear-leading and the ILDs between 0 and −6 dB (left ear louder). The adaptation was tested by presenting the sounds in regular sequences whose interstimulus intervals (ISIs) and number of stimuli differed depending on the paradigm tested. Each sequence was repeated 11 times. Intersequence intervals (time between the last stimulus in the sequence and the first stimulus in the next sequence) were 1 min and 10 s for the 10 s ISI tests, 1 min and 15 s for the 15 s ISI tests, 2.5 min for the 30 s ISI tests, and 5 min for the 60 s ISI test.
Tracking eye movements.
Eye movements were measured while the owls were positioned in the electrophysiological set-up described above, breathing a mixture of 4:5 N2O/oxygen. The method to evaluate eye movements is described in detail by Netser et al. (2010). Briefly, the owl's right eye was held open by attaching miniature clips to the small feathers on the eyelids. The clips were gently pulled by strings to adjust a sufficient and stable retraction of the eyelid. An infrared (IR)-sensitive digital video camera equipped with a zoom lens was positioned ∼1 m from the owl's head. An IR LED was attached to the camera lens and adjusted manually in each session to optimally capture the IR light reflected from the retina. The camera angle relative to the head was then adjusted so that the lateral edge of the pecten oculi was visible at the center of the pupil (see inset in Fig. 8).The pecten oculi is an apparent retinal landmark (Wathey and Pettigrew, 1989). Once the camera was positioned in place, the door of the booth was closed and video sequences (30 fps) were collected starting 500 ms before the onset of the stimulus and ending 3 s after. Video sequences were analyzed offline. Trials containing eye blinks, nictitating membrane movements, or vigorous pecten movements during the prestimulus time window (500 ms) were discarded from the analysis.
Data analysis.
Neural responses to an acoustic stimulus were quantified as the number of spikes in a given time window after stimulus onset minus the number of spikes during the same amount of time immediately before stimulus onset (baseline activity). The duration of the time window for spike count corresponded to the duration of the stimulus. The width of tuning curves was defined as the range over which responses were >50% of the maximal response; best ITD, ILD, or frequency was the midpoint of this range.
To observe the time course of the response, we generated poststimulus time histograms (PSTHs) with 8 ms time bins. PSTHs were normalized to the maximum bin and averaged across the population. The averaged population PSTHs (Figs. 2⇓⇓–5, 7) were smoothed for display purposes (running average of 3 points).
To analyze the adaptation tests, we first averaged the responses to each basic sequence across sequence repetitions, yielding the average response to a stimulus as a function of the stimulus position in the sequence. The first sequence in each test was omitted from the analysis to avoid possible biases that may result from the first sequence being novel.
To obtain the population response, we normalized the responses from each site to its maximum response and averaged across all recording sites. To test the significance of adaptation, we performed one-tailed t tests to compare the population average response of the first stimulus in the sequence with the average responses to all the remaining stimuli. A Holm–Bonferroni multiple-comparison correction (Holm, 1979) was applied to all tests.
To quantify the SSA effect, we calculated the stimulus index (SI), which is the normalized difference between the responses to the novel appearance and the frequent appearance defined as follows: where N1 is the response to the first (novel) stimulus in the sequence of stimulus 1, F1 is the average response to the last three stimuli in the sequence of stimulus 1, N2 is the response to the first (novel) stimulus in the sequence of stimulus 2, and F2 is the average response to the last three stimuli in the sequence of stimulus 2.
To quantify the tendency of the neuron to respond to a rare stimulus independent of the stimulus itself, we used the neuron index (NI), defined as follows: To quantify the eye movements, we first converted the grayscale video images to binary images. In each video frame, we counted the number of white pixels in a manually defined rectangle that overlapped the pecten edge. This produced a response curve corresponding to the horizontal position of the pecten edge as a function of time (see detailed description and evaluation of the method by Netser et al., 2010). Then the baseline was reduced from the response curve [i.e., subtracting the average pecten position in the prestimulus interval (−500 to 0 ms)]. The eye response magnitude was defined as the sum of the pecten position values during the poststimulus time window (i.e., area under the response curve between 0 and 600 ms). To obtain the population response, we normalized the responses from each experiment to its maximal response and averaged across all experiments. To test the significance of habituation, we performed one-tailed t tests to compare the population average response of the first stimulus in the sequence with the average responses to all the remaining stimuli. A Holm–Bonferroni multiple-comparison correction was applied to all tests.
Results
Long-lasting adaptation in the OT
We recorded the responses of neurons in the deep and intermediate layers of the OT. Each recording site was tested with sequences of identical auditory stimuli at its best ITD and ILD (see Materials and Methods). The ISIs of the sequences were 5, 15, 30, or 60 s. Figure 1 shows an example of a multiunit recording from a single tectal site. In this example, sequences of five auditory stimuli with an ISI of 30 s were presented (2.5 min gap between sequences). It is evident that this recording site tended to respond stronger to the first stimulus in the sequence (upper raster plot) compared to the last (lower raster plot). The tendency to respond less to subsequent stimuli was observed at the population level. The histograms in Figure 2 show the population responses to sequences with different ISIs. The population responses to the first stimuli were significantly larger (t test with Holm–Bonferroni correction, p < 0.05) than the responses to all subsequent stimuli at all ISIs tested (Fig. 2A–D). A single, relatively short (400 ms), close-to-threshold stimulus (see Materials and Methods) was sufficient to induce a significant reduction of the response to a second stimulus presented up to 1 min later (Fig. 2D). Therefore, this neural adaptation was strong, rapid, and long lasting.
Long-lasting adaptation outside the OT
To track the origin of the long-lasting adaptation, we recorded from five nuclei along the auditory pathways: the ICC, the ICX, the OT, the E, and the nOv. In each nucleus, multiple recording sites were isolated (see Materials and Methods) and tested with a simple adaptation test (11 repetitions of the same broadband sound with an ISI of 10 s). The average responses are shown in the histograms in Figure 3. The neural responses in the ICC, which is the source of the auditory inputs to both the primary auditory pathway (thalamofugal pathway) and the midbrain localization pathway (Arthur, 2005), did not show a tendency for adaptation at 10 s ISIs (blue histogram, n = 11). This observation suggests that the long-lasting adaptation reported in the OT develops at higher levels of the auditory pathway. Long-lasting adaptation was also not observed in nOv (red histogram, n = 31), the primary thalamic auditory nucleus. This was in sharp contrast to the results from the OT and the E, which showed clear neural adaptations at long ISIs (purple and brown histograms, n = 16 and n = 14, respectively). The E is the forebrain recipient of the tectofugal pathway (Benowitz and Karten, 1976), which in barn owls carries auditory as well as visual signals (Reches and Gutfreund, 2009). The auditory responses in the E were slower and less phasic compared to the other nuclei (compare PSTHs in Fig. 3). Nevertheless, the adaptation in the E was enhanced compared to all other nuclei (an amplitude attenuation of ∼75% in the E compared to ∼28% in the OT). Significant but mild adaptation at an ISI of 10 s was also revealed in the population of recording sites in the ICX (green histogram, n = 14), a midbrain nucleus that provides the main source of auditory inputs to the OT (Knudsen, 1983).
To further study the long-lasting adaptation in the E, we measured adaptation profiles at ISIs of 30 and 60 s. Figure 4A shows an example of a multiunit recording from a single site in the E. In this example, sequences of five auditory stimuli with an ISI of 30 s were presented (2.5 min gap between sequences). The responses to the first stimulus in the sequence (upper raster plot) were much stronger than the responses to the last stimulus (lower raster plot). As in the OT, significant adaptation was observed at ISIs of 30 and 60 s (Fig. 4B,C). Moreover, the strength of adaptation in the E was enhanced compared to the OT. At 60 s ISI, the attenuation of the responses to subsequent stimuli was 56% in the E and 25% in the OT.
Specificity of long-lasting adaptation
To assess whether the long-lasting adaptation in the OT is stimulus specific, we tested the neurons with an alternating sequence composed of two narrowband stimuli (1 kHz width) that differed in their central frequencies by a gap of 2 kHz. The central frequencies were selected to be equally distant from both sides to the best frequency. This selection procedure ensured that both stimuli were within the response range of the neuron (Reches and Gutfreund, 2008). The ISI was maintained at 10 s throughout the test. The first stimulus was repeated 10 times followed by 10 repetitions of the second stimulus, and so on, alternating between the two. In SSA, we expect the responses to the first presentation of a stimulus in each subsequence of 10 identical stimuli (the novel stimulus) to be stronger than the last. Figure 5A shows the population response (n = 22) to the sequence of one stimulus (in blue) and to the sequence of the other stimulus (in red). Note that this is a cyclical stimulation paradigm; thus, the blue sequence arrives after the red sequence and the red sequence arrives after the blue. The average response to the first appearance of the stimulus was significantly stronger than the average response to all subsequent stimuli, for both frequencies (t test with Holm–Bonferroni correction, p < 0.05). This indicates that the long-lasting adaptation in the OT is specific to the frequency of the stimulus. The stimuli adapted rapidly, reaching a significant reduction following one novel stimulus. The average PSTHs of the responses to the two stimuli when they are novel are compared in Figure 5, B and C, with the average PSTHs of the responses to the same stimuli when they are last in the sequence. The adaptation pattern in this paradigm is comparable in magnitude with the pattern observed in the corresponding simple adaptation paradigm (Fig. 3, purple histogram); ∼30% attenuation of the response in the simple adaptation paradigm compared to ∼35% in the alternating paradigm. This suggests that the adaptation is fully specific without cross-stimulus adaptation, at least under the tested conditions.
To examine the consistency of this effect, we calculated two SIs and a single neuron index (NI) for each site (see Materials and Methods). The scatter plot of the SIs of stimulus 1 versus SIs of stimulus 2 is shown in Figure 5D. A point appearing in the first quadrant (within the two dotted lines) implies stronger responses to the novel appearance of both stimuli. However, any point above the diagonal line implies SSA (Ulanovsky et al., 2003). The majority of points were within the first quadrant and only two points out of the 22 recording sites were below the diagonal line. Note that no apparent difference exists between distributions of the single-unit and multiunit recordings. The distribution of the neural indices is presented in Figure 5E. Evidently, the distribution is shifted to positive values (sign test, p < 0.001), indicating a clear population tendency to respond stronger to novel stimuli.
In a different paradigm, to check the specificity of the adaptation to complex sounds, we used recordings of natural sounds. In each recording site, two sounds were drawn randomly from a library of six sounds. Tectal neurons responded well to all six sounds presented within their receptive fields (Fig. 6). A sequence of eight stimuli was presented with an ISI of 13 s. The intersequence interval (time from last stimulus in a sequence to the first stimulus in the next sequence) was 73 s. The first seven stimuli were identical and the eighth stimulus was different. If the adaptation is specific to the stimulus, we expect to regain the response to the last odd stimulus. As expected, the population responses to stimuli 2–7 were reduced compared to the response to the first stimulus (t test, Holm–Bonferroni correction, p < 0.05). However, the population neural response to the odd stimulus (red bar and red curve in Fig. 7A,B, respectively) was significantly larger than all the responses to stimuli 2–7 (t test, Holm–Bonferroni correction, p < 0.05). Interestingly, the population response to the odd stimulus was also significantly larger than the population response to the first stimulus in the sequence (t test, p < 0.05).
To further assess whether the adaptation is specific even at longer ISIs, we recorded the responses of neurons in the OT to sequences of stimuli with an ISI of 60 s. The intersequence interval in this case was 5 min. To minimize the length of the test, which can become overly long with such ISIs, we used here a sequence of three stimuli, the first two of which were identical and the last was different. The sequence was repeated 11 times. In each test, again, the two types of stimuli were drawn randomly from the same library of natural sounds. The population response (n = 12) to the second stimulus was reduced significantly compared to the first (Fig. 7C). On the other hand, the response to the last, odd stimulus was significantly stronger than the response to the second stimulus and not significantly different from the response to the first stimulus (t test, p < 0.05). This demonstrates SSA to the type of stimulus at an ISI of 60 s. The observation that the response to the odd stimulus was no less than the response to the first stimulus, again, suggests that there is no cross-stimulus adaptation at these relatively long ISIs.
Behavioral habituation of the orienting reflex
One caveat of correlating the neural adaptation with behavioral habituation is that the neural recordings are done under the influence of N2O (see Materials and Methods), while most previous behavioral studies of habituation were not (Weinberger et al., 1975; Valentinuzzi and Ferrari, 1997; Bala and Takahashi, 2000; Zimmer, 2006). Addressing this caveat, we have previously shown in barn owls that the pupil dilation reflex is maintained under the same levels of N2O used here, and moreover, the reflex undergoes habituation to short auditory stimuli (300 ms) presented at an ISI of 12 s (Netser et al., 2010). To extend this previous result to a different behavior, and to strengthen the comparison between the physiological results of this study and behavioral habituation, we characterized habituation of reflexive eye movements. In barn owls, eye movements are limited to ±3° (du Lac and Knudsen, 1990). The role in vision, if any, of these small movements is unknown. Moreover, it is unknown whether the eyes orient in a reflexive manner to auditory stimuli. To examine this, owls were positioned in the stereotaxic apparatus and anesthetized in the same way as for the physiological experiments. Horizontal eye movements were detected based on tracking the pecten oculi edge (arrow in Fig. 8Ainset), an apparent retinal landmark in barn owls (Netser et al., 2010). Two speakers positioned 1 m from the owl were used, one speaker at an azimuth of 80° to the right and another at an azimuth of 80° to the left. In each trial, a sequence of six repetitions of a broadband noise with 300 ms duration was presented from one of the speakers. The ISI was 13 s. The experiment was performed 30 times in each speaker, in four different owls. The average normalized eye responses from all the experiments in which the stimuli were presented on the right are shown in Figure 8A. The first event in the sequence tended to induce a stronger positive response, indicating a rightwards shift of the eye. This respond clearly habituated to the subsequent auditory stimuli. When the sound was generated from the left speaker the response to the first event was negative, indicating leftward motion, and similarly habituated to subsequent stimuli (Fig. 8B). Thus, we show for the first time that the eyes of barn owls reflexively respond to a novel sound by an orienting movement toward its direction. Moreover, this behavioral response is maintained under the low levels of N2O used in the physiological experiments.
To examine the stimulus specificity of the orienting eye reflex, we used the same natural sounds used in the physiological experiments (Fig. 6). Sounds were delivered through earphones with a positive ITD value of +80 μs (right ear leading) at the same sound intensity used in the physiological experiments. In each test, eight stimuli were presented with an ISI of 13 s. The first seven stimuli were identical and the eighth stimulus was different. The types of stimuli were randomly chosen in different tests, as was done in the physiological experiments. The orienting response habituated to repeated stimuli. The average eye response to the first stimulus was significantly larger from the average eye responses to stimuli 2–7 (t test, Holm–Bonferroni correction, p < 0.05). The eye average response to the odd stimulus (red bar in Fig. 8C) was significantly larger than the eye average responses to stimuli 3–7 (t test, Holm–Bonferroni correction p < 0.05). Thus, the habituation was specific to the type of stimulus. The pattern of behavioral habituation resembled qualitatively the pattern of neural adaptation (compare Fig. 8C with Fig. 7A). However, unlike the neural response, the average behavioral response to the last odd stimulus was not significantly above the response to the first stimulus.
Discussion
Comparison with other types of adaptation
SSA has been previously shown in various forms in the auditory system (Nelken and Ulanovsky, 2007; Reches and Gutfreund, 2008; Anderson et al., 2009; Malmierca et al., 2009), but the SSA we are describing here differs in two important aspects. First, most previous forms of SSA have been reported along the primary auditory pathway [inferior colliculus (Pérez-González et al., 2005), the medial geniculate body (Anderson et al., 2009), and the auditory cortex (Ulanovsky et al., 2003; Szymanski et al., 2009)], whereas this form was observed exclusively in the auditory localization pathway (OT) and a related forebrain structure (E). Second, previous forms of SSA have been reported to diminish at ISIs longer than 2 s (Ulanovsky et al., 2003) or have only been studied at ISIs <2 s (Reches and Gutfreund, 2008; von der Behrens et al., 2009; Antunes et al., 2010), whereas the adaptation identified here is active and specific in ISIs as long as 60 s. To the best of our knowledge, a comparable adaptation has not been reported in the auditory system.
Several previous studies reported long-lasting effects of adaptation in the auditory cortex in the time range of tens of seconds to minutes (Weinberger et al., 1975; Condon and Weinberger, 1991; Ulanovsky et al., 2004). However, adaptations were induced by long periods of stimulation in these studies. For example, Condon and Weinberger (1991) used a 1.25 Hz train of 300–500 stimuli (300 ms duration each) to induce specific adaptation that lasted for several minutes. The above-mentioned studies are in agreement with the notion that the dynamics of neural adaptation complies with stimulus duration (Marom, 2010): a short-duration stimulus is expected to induce short-lasting adaptation, and vice versa (Varela et al., 1997; Ulanovsky et al., 2004). This concept is not in line with the adaptation reported here whereby a relatively short stimulus was sufficient to induce long-lasting adaptation of 1 min and possibly more. Therefore, the SSA identified here differs from other known forms of auditory SSA, and is likely to involve different mechanisms and play different roles in auditory perception.
The duration of the memory trace of neural adaptation has direct implications on the behaviors that it can support. In many instances, habituation of behavioral responses has been demonstrated with short stimuli that were separated by intervals of several seconds to minutes (Thompson and Spencer, 1966; Weinberger et al., 1975; Bala and Takahashi, 2000; Zimmer, 2006; Glanzman, 2009). Specifically in barn owls, rapid habituation of the pupil dilation response (PDR) was demonstrated using short, low-intensity auditory stimuli with ISIs of 10–13 s. This habituation of the PDR was mediated by microstimulations in the OT (Netser et al., 2010) and was highly specific to the location or frequency of the stimulus (Bala and Takahashi, 2000; Bala et al., 2003; Spitzer et al., 2003; Bala et al., 2007). Therefore, the finding reported here that specific adaptation in the OT can reach time spans of tens of seconds suggests that this type of neural adaptation is a neural correlate of the behavioral habituation. This hypothesis is strengthened by our demonstration of behavioral habituation of the eye-orienting reflex to the same stimulus and at the same experimental conditions in which long-lasting neural adaptation was measured, thus ruling out possible effects of anesthesia or of specific stimulus conditions. The general pattern of behavioral habituation (Fig. 8C) resembled the pattern of neural adaptation at comparable ISIs (Fig. 7A). Both behavioral and neural responses habituated, and in both, the responses to the odd stimulus were elevated. However, the neural responses to the odd stimulus tended to exceed the neural responses to the first stimulus, while the behavioral responses did not. This difference may indicate that there is no simple linear relationship between the neural responses and the behavior and that other brain regions may also influence the final behavioral response.
Origin of adaptation
The ICX provides the direct auditory input to the OT (Knudsen, 1983). We report small but significant adaptation at 10 s intervals already in the ICX. The same stimulus protocol induced stronger adaptation in the OT and even stronger adaptation in the E. In barn owls, auditory information from the OT travels to the forebrain E through the tectofugal pathway (Reches and Gutfreund, 2009). Therefore, the simplistic interpretation would be that the adaptation originates in the ICX and is then strengthened by further processing in the ascending pathway. However, this interpretation is complicated by the fact that there are multiple loops. The OT projects back to the ICX (Hyde and Knudsen, 2000; Luksch et al., 2000; Gutfreund et al., 2002). The E is likely to form a loop with the optic tectum through the auditory arcopallium (Knudsen et al., 1995; Shimizu and Bowers, 1999), which also sends projections to the ICX (Knudsen et al., 1995). Given these multiple loops, the data in Figure 3 cannot resolve the origin of adaptation. It is possible that the adaptation originates in the OT and projects back to the ICX. Another possibility is that it originates in forebrain networks, giving rise to top-down signals that enhance tectal responses to novel stimuli.
Mechanisms of adaptation
One common model to explain SSA is that the two stimuli activate separate paths to the recorded neuron and that basic adaptation mechanisms (synaptic depression or intrinsic cellular mechanisms) act at levels where the activation is separated (Eytan et al., 2003). An intriguing observation in our study was that the responses to the last odd stimuli were not weaker than the responses to the first stimuli (Fig. 7), suggesting a complete lack of cross-stimulus adaptation at these long ISIs. This implies, according to the model, that the basic adaptation processes must take place at levels where there is no overlap between the neural elements carrying the signals from the two stimuli. Since the frequencies of the natural sounds used in these experiments overlapped substantially (Fig. 6), the results are unlikely to be accounted for only by basic response suppressions at lower, frequency-specific levels. To compute the novelty of complex broadband sounds, a network is required that compares the neural responses to the current stimulus with previous responses based on an integration of information about frequency and amplitude modulation. Integration of high-level acoustic features has been shown to occur at forebrain auditory areas in mammals and songbirds (Doupe, 1997; Romanski and Goldman-Rakic, 2002; Lehongre and Del Negro, 2011). Therefore, it is plausible that circuits in the forebrain compute the novelty of the incoming sound and modulate tectal responses accordingly. In this respect, forebrain–tectal connections have been shown to be able to modulate tectal responses (Winkowski and Knudsen, 2006; Winkowski and Knudsen, 2008).
The gaze control system and habituation
Regardless of the specific mechanism for generating adaptation, a clear distinction can be made between the tectofugal pathway and the thalamofugal pathway. Significant adaptation was not found in nOv, the primary thalamic auditory nucleus, and the first station in the thalamofugal pathway (Proctor and Konishi, 1997). The avian nOv is anatomically divided into shell and core (Durand et al., 1992). The functional significance of this division is unknown. Electrolytic lesions performed at the end of our last experiment were found to be well within the core. Since the core in barn owls is by far larger compared to shell regions (Pérez et al., 2009), it is likely that most of our recording sites in nOv were from the core region. Therefore, we cannot rule out the possibility that long-lasting adaptation of neural responses exits in the lateral or medial shell of nOv. Related to this, recent studies in rodents reported robust SSA in the medial subdivision of the primary auditory thalamus (Anderson et al., 2009; Antunes et al., 2010). However, this thalamic SSA recorded was relatively short latency (up to 2 s), and therefore the lack of long-lasting adaptation in nOv is not in conflict with the findings in mammals.
Recent studies suggest that the avian tectofugal pathway carries information about the saliency of the stimulus from the OT to the forebrain (Marín et al., 2007; Reches and Gutfreund, 2009; Reches et al., 2010). Moreover, the tectofugal pathway is considered to be homologous to the mammalian colliculo-pulvinar-cortical pathway (Karten, 1969; Karten and Shimizu, 1989), which has been suggested to play an important role in the coordination of spatial attention between the SC and the cortex (Robinson and Petersen, 1992; Olshausen et al., 1993; Shipp, 2004). Why it is that adaptation is pronounced specifically in structures related to gaze control and attention (OT and E)?
Substantial evidence supports the notion that the OT (and its mammalian homolog, the superior colliculus) contains the circuitry to represent the saliency of the incoming stimuli, a prerequisite for selective attention (Ingle, 1975; Frost et al., 1981; Basso and Wurtz, 1997; McPeek and Keller, 2002; Khanbabaie et al., 2007; Boehnke and Munoz, 2008; Mysore et al., 2010). An important element in stimulus saliency is its history. Stimuli that differ from previous stimuli (i.e., surprising stimuli) are perceived as being more salient (Posner, 1981; Itti and Koch, 2000). Therefore, we propose that specific adaptation is pronounced in the OT as part of its role in stimulus selection, giving rise to an appealing hypothesis that the neural mechanisms for highlighting a salient stimulus in space overlap with the mechanisms for highlighting a salient stimulus in time.
Footnotes
This work was supported by a grant from the Israel Science Foundation. We thank Prof. Shimon Marom for critical reading of the manuscript.
- Correspondence should be addressed to Yoram Gutfreund, Department of Physiology and Biophysics, The Bruce Rappaport Medical School, The Technion, Haifa 31096, Israel. yoramg{at}tx.technion.ac.il