Abstract
All sonic vertebrates face the problem of sound production interfering with their ability to detect and process external acoustic signals, including conspecific vocalizations. Direct efferent inputs to the inner ear of all vertebrates, and the lateral line system of some aquatic vertebrates, represent a potential mechanism to adjust peripheral sensitivity during sound production. We recorded from single efferent neurons that innervate the inner ear and lateral line in a sound-producing teleost fish while evoking fictive vocalizations predictive of the temporal features of natural vocalizations. The majority of efferent neurons showed an increase in activity that occurred in-phase with modulations in the fine temporal structure of the fictive vocalizations. Many of these neurons also showed a decrease in activity at fictive vocal offset. Efferents to the sacculus, the main auditory end organ, showed features especially well adapted for maintaining sensitivity to external acoustic signals during sound production. These included robust phase locking of efferent activity to each cycle of a fictive vocalization and a long-duration rebound suppression after each fictive vocalization that could provide a rapid, long-lasting period of sensitization to external acoustic stimuli such as the call of a conspecific. These results suggest that efferent activation by the vocal motor system can directly modulate auditory sensitivity to self-generated sounds and maintain sensitivity to ongoing external sounds. Given the conserved organization of the auditory efferent system across vertebrates, such mechanisms may be operative among all sonic vertebrates.
Introduction
Sound-producing animals are faced with the challenge of stimulating their own auditory systems with the signals they produce, a phenomenon known generally as sensory reafference (von Holst and Mittelstaedt, 1950). Reafferent signals potentially interfere with the reception of external sounds by either masking these sounds during, or reducing the sensitivity of the auditory system immediately after, sound production. Filtering and removal of these signals at one or more levels of the auditory system is therefore crucial to maintain sensitivity to external sounds (Poulet and Hedwig, 2002 and references therein).
One mechanism for adjusting peripheral auditory sensitivity in vertebrates is the middle ear response. During either sound production or stimulation by intense external sounds, contraction of middle ear muscles limits movement of the middle ear bones, which reduces the transmission of acoustic energy to sensory hair cells in the cochlea (Grassi et al., 1988; Yost, 2000). Although effective, the middle ear response represents a highly derived adaptation with structural requirements (i.e., middle ear bones and muscles) that preclude it from being a gain-control mechanism across vertebrates. In contrast, all vertebrates possess central efferent neurons that modulate the activity of the peripheral auditory system (Roberts and Meredith, 1992), thereby providing a common mechanism by which the vocal motor system could adjust inner ear sensitivity during sound production.
Male plainfin midshipman fish, Porichthys notatus, produce vocalizations during courtship and agonistic interactions that potentially interfere with the sender's ability to detect either conspecific vocalizations or other salient acoustic signals (for review, see Bass et al., 1999). Anatomical studies have shown a connection between the vocal motor system and an efferent nucleus in the rostral medulla that innervates inner ear end organs and mechanosensory lateral line receptor organs (Bass et al., 1994). Given the encoding of vocal parameters by both the auditory (for review, see Bass and McKibben, 2003) and lateral line (Weeg and Bass, 2002) systems in midshipman fish, we hypothesized that the efferent system is activated during sound production, leading to changes in the neuronal activity of both inner ear and lateral line primary afferents. Midshipman and the closely related toadfish are particularly well suited to address this issue because fictive vocalizations, i.e., rhythmic vocal motor output predictive of the temporal features of natural vocalizations, are readily elicited (Bass and Baker, 1990), and the efferent neurons project to their target end organs in bundles that are anatomically distinct from the primary afferent nerves, thereby providing a pure population of efferent fibers for neuronal recordings (Highstein and Baker, 1986).
We report, to our knowledge, the first demonstration of central vocal motor-induced modulation of efferents to the lateral line and inner ear that parallels the fine temporal structure of vocalizations. This includes efferents to the primary auditory end organ, the sacculus (Popper and Fay, 1999; Bass and McKibben, 2003). Given the similarities in the structure and operation of the auditory systems of teleosts and other vertebrates (Popper and Fay, 1999), it seems likely that such mechanisms are also operative among other sonic tetrapods that need to solve sensory reafference problems.
Materials and Methods
Midshipman fish have two adult male reproductive morphs known as type I and type II males (Bass and McKibben, 2003). We focused our studies on type I males because of their dynamic vocal repertoire; they alone acoustically court females and generate agonistic growls and grunt trains during the breeding season, whereas type II males and females infrequently produce isolated grunts during nonreproductive contexts. Nineteen type I male midshipman (35.0-76.2 g; standard length, 13.7-17.9 cm) were used in this study. Animals were collected from nest sites in Tomales Bay, CA, or by trawl in Monterey Bay, CA, and were maintained in artificial seawater tanks at 12-16°C on a regular diet of goldfish. All experiments were conducted under the guidelines of the Cornell University Institutional Animal Care and Use Committee and the National Institutes of Health.
Before surgery, fish were anesthetized by immersion in 0.025% ethyl p-amino benzoate (Sigma, St. Louis, MO) in artificial seawater. The cranium was entered dorsally to expose the midbrain, branches of the auditory nerve, and the ventral occipital nerve roots that innervate sonic muscles attached to the swim bladder (Bass et al., 1994). Excess fluid was blotted from the cranial cavity and replaced with a clear, inert fluid (Fluorinert; 3M, St. Paul, MN). Fish were then transferred to a 32-cm-diameter tank filled with artificial seawater (12-16°C) and placed onto a lucite head holder. A tube inserted into the fish's mouth provided recirculated seawater across the gills. Animals were given intramuscular injections of fentanyl (1.0 mg/kg body weight; Sigma) for analgesia and pancuronium bromide (0.5 mg/kg body weight; Astra Pharmaceutical Products, Westborough, MA) for immobilization, which rendered both sonic and nonsonic muscles inactive during the course of the experiments. The tank rested on a vibration isolation table inside an acoustic isolation chamber (Industrial Acoustics, New York, NY).
Insulated tungsten electrodes (125 μm diameter; 20 μm exposed tips; 5MΩ impedance; A-M Systems, Carlsborg, WA) delivered eight electrical pulses (0.1 ms pulse duration; 300 Hz) to localize vocally active sites within the midbrain. Fictive vocalizations were monitored using paired silver ball electrodes (50-100 μm diameter) insulated with Teflon except at the tips. One electrode was placed intracranially on a ventral occipital nerve root, and the other was placed in the musculature surrounding the cranium. The synchronized output of sonic motor neurons, the sonic motor volley (Fig. 1 A), was amplified, filtered through 300-5000 Hz (A-M Systems 1700), and digitized (AD1, System II; Tucker-Davis Technologies, Alachua, FL) onto a personal computer running commercial data acquisition software (Brainware; Tucker-Davis Technologies). Preliminary experiments suggested that electrical stimulation of the vocal motor system caused a change in efferent activity that was independent of vocal output. To avoid this confounding effect, the tungsten electrode was replaced with a glass micropipette (tip diameter ∼20 μm) filled with 0.1 m glutamate (l-glutamic acid; Sigma) in teleost Ringer's solution (pH 8.0) once a vocally active site was found. Glutamate injections did not cause changes in efferent activity independent of vocal output (see Results).
Extracellular potentials were recorded from single efferent neurons using glass micropipettes filled with 4 m NaCl. Electrodes were placed on the surface of the nerve and advanced using an hydraulic microdrive (David Kopf Instruments, Tujunga, CA). Signals were preamplified using a custom built preamplifier, amplified and filtered through 300-3000 Hz (P15; Grass Instruments, West Warwick, RI), and digitized (AD1; Tucker-Davis Technologies) onto a personal computer running commercial data acquisition software (Brainware; Tucker-Davis Technologies). Visually identified action potentials were discriminated using a window discriminator within Brainware. After isolation of a single efferent neuron, glutamate was picospritzed into the vocally active midbrain site using 10-500 ms, 20-40 psi air pulses (1050 ms interpulse interval) until fictive vocalizations began. Glutamate delivery was then stopped, and fictive vocalizations typically continued for several minutes. Ongoing efferent activity was collected until fictive vocal activity stopped or recordings became unstable. Recordings of efferent and fictive vocal activity were synchronized in time so that the temporal relationship between the two could be examined (Fig. 1 B).
At the end of each experiment, midbrain stimulation sites were marked by either electrolytic lesions or iontophoretic injection (+3.0 μA, 2 min) of 10% dextran tetramethylrhodamine (10 kDa; Molecular Probes, Eugene, OR) in 0.1 m PBS, 10% Alexa Fluor 546 (10 kDa; Molecular Probes) in 0.1 m PBS, or 5% Neurobiotin (Vector Laboratories, Burlingame, CA) in 3 m KCl. Fish were killed either immediately (lesions and fluorescent tracer injections) or after an 8 h survival period (biotin injections) by transcardial perfusion of heparinized teleost Ringer's solution, followed by 4% paraformaldehyde in 0.1 m phosphate buffer (PB). Glutaraldehyde (1%) was added to the fixative for brains injected with biotin. Brains were removed, postfixed for 1 h in fixative, and stored in 0.1 m PB. Brains were placed in 30% sucrose in 0.1 m PB overnight and sectioned frozen (50 μm) on a sliding microtome in the transverse plane. Biotin-injected brains were processed to visualize the label (Weeg and Bass, 2000).
Either 5% Neurobiotin or 10% dextran biotin (10 kDa; Molecular Probes) in 3 m KCl (+3.0 μA, 3-10 min) was iontophoresed into the fiber bundles from which we recorded to verify that they contained the axons of central efferent neurons. After survival times of 8-15 h, fish were perfused with heparinized teleost Ringer's solution, followed by 4% paraformaldehyde/1% glutaraldehyde in 0.1 m PB. Brains were then sectioned frozen and processed to visualize the label (Weeg and Bass, 2000). We iontophoresed either 10% dextran tetramethylrhodamine or 10% Alexa Fluor 546 in 0.1 m PBS into the efferent fiber bundles to identify their targets. After 6 h survival, fish were perfused with heparinized teleost Ringer's solution, followed by 4% paraformaldehyde in 0.1 m PB. All inner ear end organ nerves and sensory epithelia, along with the anterior and posterior lateral line nerves, were removed and cleared through an ascending series of glycerin.
To analyze the temporal relationship between fictive vocalizations and efferent activity, we constructed spike density functions (SDFs) (Paulin, 1992; Szucs, 1998) by convolving each efferent and compound vocal action potential with a Gaussian window function (Fig. 1C). We used SDFs because straight binning caused aliasing inconsistencies in spike rate estimation and the times at which the rates changed (for a more general discussion of the advantages of using SDFs for spike train analysis, see Szucs, 1998). Because we were interested in the detailed relationship of efferent spike times relative to the onset time of fictive vocalizations, determining fictive vocal onset time was important. There was a tradeoff in determining the width of the Gaussian to use. A wide Gaussian makes a very smooth estimate of the spike rate but obscures the rise time estimate when spike rate changes suddenly at the start of a fictive vocalization. A narrow Gaussian does not adequately smooth across spikes to allow reliable detection of a burst. To automatically detect the onset of fictive vocalizations, we chose a Gaussian with a width that dropped the average change in SDF between adjacent vocal spikes to ∼10% of the peak SDF (Fig. 1 D). This choice gave a time resolution of a few milliseconds. The start time of a fictive vocalization was then identified at the 50% rise point of the SDF. Once the fictive vocal start time was determined, all efferent spike times could be measured with respect to the fictive vocal start time. The fictive vocalizations were then aligned at the onset and averaged, and the efferent activity associated with each fictive vocalization was averaged (Fig. 1 E). This resulted in a single SDF containing the average fictive vocal and efferent activity, which provided an estimate of the average efferent activity at the onset of the fictive vocalizations. Alternatively, we could detect the 50% point on the falling edge of a fictive vocalization to obtain the ending time and analyze the average efferent activity at fictive vocal offset. With the estimate of onset and offset times in hand, we could then make new SDFs with higher time resolution (by decreasing the width of the Gaussian function) to more closely examine the temporal relationship between efferent spikes and compound vocal spikes.
Because of the constraints of the experimental paradigm, we were unable to demonstrate a change in primary afferent activity to test the hypothesis that vocal-induced changes in efferent activity affects auditory and lateral line sensitivity. As explained above, we administered paralyzing doses of pancuronium bromide to prevent both the fish from moving and the sonic muscles from contracting during vocal motor volleys. The former is necessary to obtain stable neuronal recordings, whereas the latter is necessary to ensure that the efferent effect did not result from auditory feedback. As an acetylcholine antagonist, however, the levels of pancuronium bromide necessary to achieve immobilization apparently blocked the cholinergic receptors on the hair cells and/or primary afferents, thus eliminating the efferent influence on primary afferent activity (Furukawa, 1981). Attempts to conduct these experiments on anesthetized animals were unsuccessful, because anesthetic severely compromises both vocal and auditory activity in midshipman fish (M. Kittelberger, J. Sisneros, and A. Bass, unpublished observations).
Results
Stimulation sites were localized within the caudal midbrain vocal-acoustic complex (mVAC), ventral to the torus semicircularis (Fig. 2A); biotin injections labeled the descending vocal system in a pattern consistent with that reported previously (Goodson and Bass, 2002). Glutamate injections into this area elicited a rhythmic motor output, the fictive vocalization (Fig. 1A; the duration of each fictive vocalization is indicated by the gray boxes in Figs. 3A1-C1, 4) (see also Fig. 6A1,B1), with temporal properties (discharge frequency/repetition rate and duration) that closely resembled those of grunts recorded from wild populations of midshipman fish (Bass et al., 1999). Because the animals were immobilized during the experiments, the sonic muscles were inactive; thus, stimulation of mVAC activates the descending vocal motor network without the production of sound (see Materials and Methods). Two bundles of efferent neurons were anatomically distinct from the primary afferent nerves and readily accessible for labeling and neural recordings. Biotin injections into either bundle retrogradely labeled cell bodies in the hindbrain efferent nucleus (Fig. 2B). Rhodamine and Alexa Fluor injections into the caudal efferent bundle indicated its primary targets to be the lagena and posterior lateral line neuromasts, with a lighter projection to the posterior semicircular canal. This pattern of caudal efferent bundle targets is similar to that seen in the closely related toadfish (Highstein and Baker, 1986). The second bundle coursed along the dorsal surface of the saccular afferent nerve, distal to the point at which the saccular nerve joined the main trunk of the auditory nerve to enter the brainstem. Reliable injections into this bundle without spread of tracer into the underlying saccular afferent nerve were difficult, but, based on its position, we are confident that these efferents, like those in toadfish (Highstein and Baker, 1986), project exclusively to the sacculus (for identification of saccular efferent neurons in midshipman after labeling of eighth nerve branches to the sacculus, see Bass et al., 1994). The majority of recordings (59 neurons) were made from the caudal efferent bundle, with additional recordings (24 neurons) made from the saccular efferent bundle.
Caudal efferent bundle neurons
The majority of caudal efferent bundle neurons (51 of 59) showed a change in ongoing activity during the production of fictive vocalizations, with the remaining neurons showing no change in activity. In all cases (including recordings from saccular efferent neurons, see below), neurons only showed a systematic change in activity with glutamate injections that elicited fictive vocalizations. Most of the neurons that showed a response exhibited an increase in activity that occurred near the onset of each fictive vocalization (Fig. 3A1,B1), although a subset also showed a decrease in activity near the termination of each fictive vocalization (Fig. 3B1). The remaining responsive neurons showed an overall decrease in activity during fictive vocalizations (Fig. 3C1).
To more thoroughly examine and quantify the nature of the change in efferent activity during fictive vocalizations, we constructed SDFs (see Materials and Methods and Fig. 1) consisting of the mean fictive vocalization activity and mean efferent activity relative to the onset or offset of fictive vocalizations for each neuron (Fig. 3A2,B2,C2). The efferent response was considered to be excitatory if the amplitude of the efferent SDF during fictive vocalization was >2 SDs above the mean efferent SDF before fictive vocal onset and suppressive if <2 SDs below the mean efferent SDF before fictive vocal onset. Rebound suppression was defined as an efferent SDF at fictive vocalization offset that was <2 SDs below the mean efferent SDF after fictive vocal offset.
Using these criteria, the majority (43 of 59) of caudal efferent bundle neurons showed an increase in activity during fictive vocalizations that reached a maximum just after (Table 1, Peak) fictive vocal onset (Fig. 3A2). The duration of excitation was variable (Table 1, Duration) and generally shorter than the mean duration of fictive vocalizations (82.3 ± 5.8, mean ± SE; n = 43). A subset of excitatory neurons (18 of 43) also exhibited a decrease in activity that was correlated with the offset of fictive vocal activity (Fig. 3B2). On average, this rebound suppression was much shorter in duration than the initial excitation (Table 1) and occurred after the termination of each fictive grunt.
A smaller number of neurons (8 of 59) showed suppression, that is, an overall decrease in activity during fictive vocalizations (Fig. 3C2). This change in activity was distinguishable from the rebound suppression after excitation because it was more robustly correlated with the onset, rather than the offset, of fictive vocal activity and occurred during (Fig. 3C2), rather than after, fictive vocalizations (Table 1). As with excitation, the decrease in activity was shorter in duration than the fictive vocalizations but was not restricted to the beginning of fictive vocal activity. In addition, all but two suppressive neurons showed brief excitation immediately preceding the suppression (Fig. 3C2, onset). This excitatory activity was shorter in duration and less robust than the suppressive activity (preexcitation) (Table 1). This is in contrast to the excitatory neurons, in which the excitation was longer and more robust than the rebound suppression.
We constructed plots of instantaneous spike rate (1/interspike interval) for efferent neurons and measured the value of maximum and minimum spike rate (Fig. 4 shows representative examples for each class of neurons). For excitatory neurons, we also measured the time of maximum spike rate relative to fictive vocalization onset and minimum spike rate relative to fictive vocal offset for those neurons showing rebound suppression. For suppressive neurons, we measured the timing of minimum spike rate relative to fictive vocal onset. Excitatory neurons (Fig. 4A,B) had a mean ± SE peak rate of 54.9 ± 5.8 spikes/s that occurred at a latency of 24.4 ± 3.6 ms after fictive vocal onset (these values include both excitation and excitation/suppression neurons). The average amplitude of rebound suppression for the excitation/suppression neurons (Fig. 4B) was 16.0 ± 2.4 spikes/s at a latency of 134.8 ± 29.0 ms after fictive vocal offset. Suppressive neurons (Fig. 4C) had an average amplitude of 12.8 ± 1.9 spikes/s, which occurred at a latency of 160.4 ± 13.4 ms after fictive vocal onset.
By increasing the temporal resolution of SDFs (decreasing the width of the Gaussian convolution), we were able to examine the fine temporal properties of the efferent response. More than one-half (22 of 43) of the excitatory efferent neurons (includes neurons from both the excitation and excitation/suppression classes) regularly fired action potentials that were tightly phase locked to the individual spikes of the fictive vocalization motor volley (Fig. 5A). Efferent action potentials lagged the spikes of the fictive vocal motor volley in most (14 of 22) of these neurons (i.e., efferent activity began after the first spike in the vocal motor volley), with latencies ranging from 1.0 ± 0.71 to 7.2 ± 0.73 ms. Activity in the remaining neurons preceded the fictive vocal motor spikes by 0.67 ± 0.33 to 3.0 ± 1.0 ms (Fig. 5B). For those neurons that lagged fictive vocal activity, there was a trend toward increased spike latency over the course of a fictive grunt, whereas there was no relationship between spike number and latency in neurons that lead fictive vocal activity (Fig. 5C).
Saccular efferent neurons
Unlike caudal efferent bundle neurons, all saccular efferent neurons responded to fictive vocalizations with an increase in activity (Fig. 6A1,B1). This was followed by rebound suppression in 14 of 24 neurons (Fig. 6B1). Like caudal efferent bundle neurons, excitatory activity peaked just after fictive vocal onset, although the mean duration of excitation was longer in saccular efferent neurons (Fig. 6A2; Table 1). The longer excitatory response may have been attributable to the longer mean fictive vocal durations (112.7 ± 10.4 ms; n = 24) during saccular efferent recordings compared with those during caudal efferent bundle recordings (82.3 ± 5.8 ms). Fish used in the saccular efferent bundle experiments were collected later during the breeding season than those used in the caudal efferent bundle experiments, which may explain the differences in fictive vocal durations between the two groups. Rebound suppression peaked earlier after fictive vocal offset and was much longer in duration than that for caudal efferent bundle neurons (Fig. 6B2; Table 1). From instantaneous frequency plots, maximum mean excitatory activity was 71.0 ± 9.7 spikes/s at a latency of 23.4 ± 5.0 ms (Fig. 6A3). Rebound suppression rates dropped to 15.8 ± 3.3 spikes/s at a latency of 136.9 ± 28.9 ms (Fig. 6B3).
Like caudal efferent bundle neurons, the activity of most saccular efferent neurons (16 of 24) was phase locked to the fictive vocalization output (Fig. 7A). However, across saccular efferents, the timing of efferent activity relative to fictive vocal motor activity was more consistent than in caudal efferent bundle neurons. Activity in all but two phase-locked saccular efferent neurons lagged fictive vocal motor activity by 4.0 ± 0.82 to 6.0 ± 0.41 ms (Fig. 7B). Unlike the increase in latency observed in caudal efferent bundle neurons that lagged fictive vocal motor activity, there was no systematic change in the latency of saccular efferents over the duration of the fictive vocalizations (Fig. 7C).
Discussion
We show here a functional link between the vocal motor system and central efferent neurons that innervate the hair cell-based end organs of the inner ear and mechanosensory lateral line in a vertebrate, the plainfin midshipman fish. The majority of efferent neurons exhibited an increase in activity that peaked just after fictive vocalization onset, whereas others showed an overall decrease in activity. This included neurons that target the sacculus and lateral line, two end organs that are sensitive to the frequencies contained within midshipman vocalizations (Weeg and Bass, 2002; Bass and McKibben, 2003). To our knowledge, this is the first direct demonstration of auditory efferent activation by the vocal motor system in any vertebrate. This coupling accomplishes at least three important tasks (Hoy, 2002). First, it tells the animal that an acoustic signal is self-generated; second, it provides timing information to the inner ear and lateral line about when the sound will occur; and third, because saccular and lateral line afferent activity is reduced by efferent activity (Russell, 1971; Russell and Roberts, 1972; Furukawa, 1981; Lin and Faber, 1988), it provides a mechanism whereby peripheral auditory and lateral line sensitivity can be actively reduced during vocalization to prevent overstimulation by reafferent signals.
All saccular efferent neurons responded to fictive vocalizations with an increase in activity correlated with fictive vocal onset, with a subset also showing rebound suppression at fictive vocal offset. The sacculus is the main auditory end organ in most teleosts, including midshipman (Bass and McKibben, 2003). Studies in goldfish show that efferents inhibit saccular afferent activity (Furukawa, 1981). Therefore, the vocal-related increase in efferent activity observed here likely causes a decrease in saccular afferent activity that would effectively decrease auditory sensitivity to self-generated sounds and maintain the sensitivity of hair cells to external sounds by preventing habituation of the saccular hair cell-afferent nerve synapse (Furukawa and Matsura, 1978; Furukawa, 1981; Winslow and Sachs, 1987; Lin and Faber, 1988). The rebound suppression may temporarily increase auditory sensitivity immediately after vocalizations. Consistent with the major role of the sacculus in encoding conspecific vocalizations, this sensitization period lasts much longer in its efferents than those of other end organs.
Caudal efferent bundle neurons showed a greater variety of responses than did saccular efferents. This variability suggests that the efferent system is not a simple modulator but rather consists of multiple components that respond differentially to a given stimulus, which could lead to end organ-specific changes in afferent activity or differential modulation of activity within a single end organ (Tricas and Highstein, 1991). Because the caudal efferent bundle mainly projects to the lagena and posterior lateral line, with a lighter input to the posterior semicircular canal, these different responses may represent efferent neurons that project to different end organs. The posterior lateral line is responsive to a significant portion of the spectral energy of a midshipman call (Weeg and Bass, 2002), whereas the lagena is an otolithic end organ that may serve an auditory function in a limited capacity (Lu et al., 2003). Thus, the posterior lateral line and lagena would both be likely targets of excitatory efferent neurons that would reduce sensitivity of these end organs in a similar manner to the sacculus. Consistent with an overall decrease in inner ear activity during vocalizations, posterior semicircular canal afferents may be the target of suppressive efferent neurons, which should decrease vestibular activity (Highstein and Baker, 1985; Boyle and Highstein, 1990). Alternatively, it may not be necessary to modulate vestibular sensitivity during vocalization, and the posterior semicircular canal may be the target of unresponsive efferent neurons.
The temporal periodicity of efferent responses, especially for saccular efferents, reflects the precise timing information that is inherent to the vocal pattern-generating network that initiates the rhythmic, oscillatory-like firing of sonic motor neurons (Bass and Baker, 1990). This could have important implications for the temporal nature of afferent modulation. Midshipman saccular and lateral line afferents respond to tonal stimuli in a phase-locked manner, firing action potentials only during part of the stimulus waveform (McKibben and Bass, 1999; Weeg and Bass, 2002; Weeg et al., 2002). If the timing of efferent activity matches the phase at which its target afferent responds to a reafferent vocalization, afferent sensitivity could be modulated only during a specific portion of the vocal waveform and remain unchanged during the remainder. This could be a finely tuned temporal mechanism by which sensitivity to reafferent stimuli is reduced while sensitivity to external sounds is essentially maintained.
The fictive vocalizations induced by our experimental paradigm are similar in duration and pulse repetition rate to agonistic grunts produced by midshipman males (Bass et al., 1999). Hums, which are advertisement calls, are much longer (up to 1 h) and are difficult to elicit in a physiological preparation. Given the duration of the efferent response, the midshipman efferent system may be maximally suited for adjusting auditory sensitivity at the onset and offset of vocalizations. It should be noted that we report the timing of the efferent response relative to the timing of the vocal motor volley recorded from the sonic nerve. Because sonic muscle contraction occurs ∼5 ms after sonic nerve activity (Packard, 1960; Skoglund, 1961; Bass and Baker, 1990), the timing of the efferent response relative to sound production will be shifted by this amount. However, this should be a uniform shift across all efferent neurons and does not affect the interpretation of our results. Moreover, because saccular efferent activity on average lagged fictive vocal activity by 4-6 ms, it should closely concur with the onset of an actual vocalization in vivo.
Although the physiological effects of efferent stimulation on afferent activity are well established (Art et al., 1982; Roberts and Meredith, 1992), the behavioral relevance of the efferent system is not well understood. Lateral line efferent activity increases during movement in frogs (Russell, 1971) and dogfish (Roberts and Russell, 1972), whereas behavioral arousal causes an increase in efferent activity that is accompanied by a decrease in lateral line afferent activity in toadfish (Tricas and Highstein, 1990, 1991). Based on these results, it is hypothesized that the efferent system functions to decrease the response of the lateral line to self-generated water motions, such as those caused by swimming or respiratory movements (Tricas and Highstein, 1991). It is likely that vocal-related increases in efferent activity reported here similarly lead to decreased auditory and lateral line afferent activity. Several features of saccular efferents seem especially well adapted to maintaining sensitivity to conspecific and other acoustic signals both during and immediately after sound production. First, as discussed previously, the especially robust phase locking of saccular efferents to each cycle of the fine structure of a vocalization provides a mechanism for maintaining sensitivity to concurrent vocalizations from conspecifics, which is a natural listening task for midshipman especially when producing their advertisement calls (Bass et al., 1999). Second, the early onset and long duration of the rebound suppression of saccular efferents provides a rapid and relatively long-lasting period of sensitization for the detection of acoustic stimuli.
Considering the extensive similarities in the pattern of auditory brainstem organization of teleosts to that of other vertebrates (McCormick, 1999; Bass et al., 2005), including efferents to the inner ear (Roberts and Meredith, 1992; Manley, 2000), we propose that the vocal motor-efferent coupling identified here provides a mechanism for all sonic vertebrates to decrease peripheral auditory sensitivity during sound production. Although such a mechanism may be especially important in vertebrates that lack a middle ear response, it may also supplement the middle ear response by decreasing sensitivity of the cochlea to stimulation via bone conduction, which would not be affected by the middle ear response (Suga and Jen, 1975). Although cochlear microphonic potentials in bats are not consistent with auditory efferent activation during vocalization (Suga and Jen, 1975), cochlear microphonic decay times during vocalization are suggestive of an efferent effect (Goldberg and Henson, 1998). The presence of a shared mechanism of vocal motor-auditory efferent coupling between such distantly related groups as teleosts and tetrapods would be a striking example of convergent evolution among vertebrates that need to solve similar sensory reafference problems.
Footnotes
This research was supported by National Institutes of Health/National Institute on Deafness and Other Communication Disorders Grant DC00092. We thank B. Carlson, H. Farris, and C. Hopkins for providing critical feedback on this manuscript.
Correspondence should be addressed to Andrew H. Bass, Department of Neurobiology and Behavior, Seeley G. Mudd Hall, Cornell University, Ithaca, NY 14853. E-mail: ahb3{at}cornell.edu.
M. S. Weeg's present address: Department of Biology, Colorado State University, Fort Collins, CO 80523. E-mail: mwerg{at}lamar.colostate.edu.
Copyright © 2005 Society for Neuroscience 0270-6474/05/255967-08$15.00/0