Abstract
Interaural time differences (ITDs) are a major cue for localizing low-frequency (<1.5 kHz) sounds. Sensitivity to this cue first occurs in the medial superior olive (MSO), which is thought to perform a coincidence analysis on its monaural inputs. Extracellular single-neuron recordings in MSO are difficult to obtain because (1) MSO action potentials are small and (2) a large field potential locked to the stimulus waveform hampers spike isolation. Consequently, only a limited number of studies report MSO data, and even in these studies data are limited in the variety of stimuli used, in the number of neurons studied, and in spike isolation. More high-quality data are needed to better understand the mechanisms underlying neuronal ITD-sensitivity.
We circumvented these difficulties by recording from the axons of MSO neurons in the lateral lemniscus (LL) of the chinchilla, a species with pronounced low-frequency sensitivity. Employing sharp glass electrodes we successfully recorded from neurons with ITD sensitivity: the location, response properties, latency, and spike shape were consistent with an MSO axonal origin. The main difficulty encountered was mechanical stability. We obtained responses to binaural beats and dichotic noise bursts to characterize the best delay versus characteristic frequency distribution, and compared the data to recordings we obtained in the inferior colliculus (IC). In contrast to most reports in other rodents, many best delays were close to zero ITD, both in MSO and IC, with a majority of the neurons recorded in the LL firing maximally within the presumed ethological ITD range.
Introduction
To localize low-frequency (<1.5 kHz) sounds in the horizontal plane, mammals exploit minute interaural time differences (ITDs) between the sound waves at the two ears. There is general agreement that coincidence detection and internal delays are involved in the neural extraction of ITD, but the source of internal delays and the nature of the neural code are a matter of ongoing debate. Several mechanisms have been proposed from simple axonal (Jeffress, 1948) or cochlear (Schroeder, 1977; Shamma et al., 1989; Joris et al., 2006) delays to precisely timed interactions between excitatory and inhibitory inputs (Grothe, 2003) or differences in rise times of contralateral and ipsilateral synaptic events (Jercog et al., 2010). The common denominator of all these models is the site at which ITD sensitivity emerges: the brainstem medial superior olive (MSO). This nucleus is located deep within the brainstem and is characterized by the following: (1) very small somatic action potentials (Scott et al., 2010) and (2) a stimulus-locked field potential, the neurophonic (Guinan et al., 1972; McLaughlin et al., 2010) that hampers clean single-neuron isolation and makes the use of complex stimuli problematic. Combined, these characteristics render the MSO a difficult nucleus for extracellular recordings and have led to the anomalous situation that, of all main subthalamic auditory nuclei, the MSO is the nucleus whose function is supposedly best understood and most frequently modeled, but which is at the same time one of the physiologically most poorly characterized nuclei.
Most in vivo studies dealing with the representation of ITDs are performed in the midbrain inferior colliculus (IC), which is commonly believed to inherit its ITD sensitivity via direct projections from the MSO (Winer and Schreiner, 2005). However, the IC does not exclusively receive projections from the MSO. In addition to intracollicular connections, various nuclei of the cochlear nuclear complex (CN), the superior olivary complex (SOC), and the lateral lemniscal nuclei (LLN) project to the IC. For a better understanding of the mechanisms of ITD sensitivity, it is therefore imperative to obtain clean single-neuron data from the first site of binaural convergence: the MSO. This was the aim of the present study.
We circumvented the problems associated with direct MSO approaches by recording with sharp glass pipettes from axons ascending in the lateral lemniscus (LL) of the chinchilla. The LL is a fiber bundle containing ascending axons from the CN, the SOC, and the LLN (Fig. 1). Since there is little published data on ITD sensitivity in the IC of the chinchilla (Chen and Sinex, 1999) we also recorded from this nucleus to have a reference dataset. Our distributions of best delay (BD), i.e., the ITD at which response rate is highest, are markedly different from most (McAlpine et al., 2001; Brand et al., 2002; Pecka et al., 2008, 2010) but not all (Day and Semple, 2011) studies of other rodents. Surprisingly, a large fraction of BDs in both nuclei fell within the ethological ITD range of the chinchilla.
Materials and Methods
Animal preparation.
All procedures were approved by the KU Leuven Ethics Committee for Animal Experiments and were in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals. Single-unit recordings were obtained from a total of 54 (NLL = 39; NIC = 15) chinchillas (Chinchilla laniger) of either sex, weighing between 300 and 500 g. At the beginning of each experiment we intramuscularly administered a mixture of ketamine (0.2 ml/kg, 100 mg/ml, Anesketin) and 2% xylazine (0.15 ml/kg, Rompun) to induce anesthesia and 0.08 ml/kg of atropine (0.5 mg/ml) to reduce salivation. Throughout surgical preparation and recording, the animal was kept in an areflexive state (toe-withdrawal reflex) with a 2:1 mixture of ketamine and diazepam (5 mg/ml, Valium). We used this mixture rather than a combination of ketamine and xylazine because diazepam has a larger therapeutic range than xylazine making it easier to use and because diazepam counteracts hypertonia induced by ketamine. This latter aspect is important in relation to the stability of the recordings. It should be noted that diazepam potentiates GABA-mediated inhibition, as does pentobarbital, which has been shown to influence the ITD tuning of IC neurons in the rabbit (Kuwada et al., 1989). This is likely of less importance for the LL recordings since glycine is the predominant inhibitory transmitter in the SOC nuclei of interest (Helfert et al., 1989).
Animals were placed in a double-walled sound-attenuated chamber (IAC) on a homeothermic blanket control unit (Harvard Medical Instruments) that kept body temperature at 37°C. After removal of both pinnas and exposure of the skull a small metal rod was mounted on the frontal part of the skull with dental cement (3M, clearfil S3 bond) to secure the head in a stereotaxic device. We suspended the upper body of the animal in the air in such a way that only the lower body rested on the heating pad. This suspension technique ameliorated breathing artifacts and thus increased recording stability. To equalize pressure between the middle and external ears, the bullas were vented with 30 cm long polyethylene tubes with an inner diameter of 0.58 mm (PE 50, Intramedic Clay Adams). We also routinely opened the cisterna magna to reduce the pressure exerted on the brainstem thus decreasing cardiac pulsation artifacts.
Two types of exposures were performed to approach the LL: (1) a lateral approach (17 animals) and (2) a caudal (posterior fossa) approach (22 animals). For the lateral approach the dorsal part of both bullas was opened, exposing the middle ear cavity. Although only a single bulla needed to be opened to gain access to the brainstem, a symmetrical craniotomy was performed to preserve acoustical symmetry, i.e., to avoid binaural differences generated by unequal acoustic transfer functions between sound source and inner ear. Via the window made in the right bulla, a small hole was made in the medial wall of the bulla with a scalpel blade to expose part of the cerebellum and occipital cortex. We then aspirated a small part of the cerebellum and cortex to reach the LL. This had to be done carefully to avoid hitting a dorsoventrally running plexus of blood vessels that stemmed from the transversal sinus. The caudal approach entailed opening the skull above the vermis of the cerebellum and a partial cerebellectomy to expose the LL and the ventral part of the IC. This exposure could be readily performed making use of the fourth ventricle as a landmark.
The IC was also approached in one of two ways: (1) a dorsal approach (4 animals) for which part of the right occipital cortex needed to be aspirated after making a small hole aligned to the sagittal and lambdoid sutures and the medial aspect of the bulla and (2) a caudal approach (11 animals) similar to the caudal one used to reach the LL. To reach the dorsolateral part of the IC, the region in which ITD-sensitive units are located, we had to enlarge the trepanation in the caudal direction so that we could insert the electrode under a shallow angle. Note that the bullas remained intact in both procedures.
After advancing the electrode a couple of hundred micrometers into the brain tissue we filled the hole left by the aspiration with 1.5% agar dissolved in distilled water to prevent desiccation and to suppress pulsation.
Stimulus generation.
Stimuli were generated digitally with TDT System II (Tucker-Davis Technologies) using custom-written software. Digitally generated sounds were converted to analog (Power DAC) signals, attenuated (PA4, TDT System3) and delivered via dynamic speakers (Supertweeter; Radio Shack). The speakers were coupled to ear bars, which were inserted into the bony auditory meatus with a custom-made coupler and directed toward the tympanic membrane. The coupler was sealed to the bony meatus with Audalin ear impression compound (Microsonic). The system was calibrated in amplitude and phase with a ½-inch condenser microphone (Bruël & Kjaer).
Estimation of acoustical cross talk for the lateral approach.
The lateral approach entails opening of both bullas. Opening both bullas rather than only the bulla on the side of recording has the advantage of maintaining symmetry in the acoustic transfer functions (verified by acoustic calibrations; Rosowski et al., 2006), but has the disadvantage that it may reduce interaural separation. To assess possible acoustical cross talk, we measured cochlear microphonic (CM) potentials of both ears simultaneously with silver ball electrodes placed at the round window of one chinchilla at 10 frequencies spanning a range from 100 to 2133 Hz. Recordings were first made with both bullas closed (using ear impression compound to close the small openings necessary to place the round window electrodes), then with both bullas open. At most frequencies, the interaural attenuation was surprisingly large (>40 dB) and not noticeably affected by opening of the bullas. As expected, the interaural attenuation was smallest at the lowest frequencies (100 and 233 Hz, where it could be as low as 30 dB).
In addition to the CM measurements we also tested monaural, contralaterally driven fibers that we sporadically encountered and that could, for example, stem from the ventral nucleus of the lateral lemniscus (VNLL) or cochlear nucleus (CN). In none of these units did we observe an increase in firing rate above spontaneous rate after ipsilateral stimulation (broadband noise or tones in the range of 110–20,000 Hz tested up to 70 dB sound pressure level; SPL). We therefore conclude that the symmetric opening of both bullas only marginally affects cross talk. Accordingly, we pooled the data obtained from the lateral and caudal approaches.
Electrophysiology and data collection.
We recorded from single units in the LL using micropipettes filled with 3 m NaCl. We used electrodes with low impedances (Z ≃ 1–5 MΩ) for scouting and sharp, high impedance electrodes (Z ≃ 20–60 MΩ) to record from axons. Recordings from single units in the IC were either done with low impedance (Z ≃ 1–5 MΩ) glass micropipettes or home-made Indium electrodes (Z ≃ 7–10 MΩ). In the initial experiments we used a Narishige 6-axis micromanipulator mounted on a stereotaxic device to position the electrodes. However, to increase the stability of the recordings in later experiments we fixed a Delrin chamber with dental acrylic directly to the skull of the animal on which the hydraulic microdrive (Trent Wells;) could be mounted. This technique is similar to the one routinely used in our lab to record from auditory nerve fibers.
Extracellular voltage signals were amplified (for low-impedance electrodes: SRS560, Stanford Research Systems, 1000×, bandpass filtered 0.3–3 kHz, roll-off 6 dB and for high-impedance electrodes: Molecular Devices Axoclamp 2B Intracellular amplifier and an additional PAR amplifier with variable gain setting, bandpass filtered 0.3–3 kHz, roll-off 6 dB), windowed (early experiments: custom-built peak picker; later experiments: window-discriminator from Bak Electronics) and time stamped with a resolution of 1 μs before being digitized (TDT system II) and stored. In 15 LL experiments and two IC experiments we also sampled and stored the spike waveforms, sound waveforms, and the BAK trigger pulses using an RX8 (sampling frequency 50 kHz or 100 kHz, RX8, TDT system 3) and custom-written software (MATLAB; The MathWorks).
While advancing the electrode through the brain tissue either pure tones (110–20,000 Hz, 250 ms with a silent interval of 50 ms at 70 dB SPL) or binaural beats (fbeat = 2 Hz, 5 s with a silent interval of 1 s at 70 dB SPL) were presented binaurally to search for auditory units. When a unit was isolated, we recorded a binaural frequency tuning curve (FTC) at zero ITD with a threshold tracking algorithm (resolution: 0.1 octave steps) to determine its characteristic frequency (CF). Next, we presented binaural beats (BB; level = 70 dB SPL, fbeat = 2 Hz, duration = 5 s, Nrep = 3) over a range of carrier frequencies (fcar) based on the FTC. Subsequently, we collected noise delay functions (NDFs) to correlated (ρ = 1) and anti-correlated (ρ = −1) broadband noise bursts (50–8000 Hz, 6 ms cos2 rise/fall time) of a duration of 1 s or 5 s at a level of 70 dB SPL varying in ITD between −5 and 5 ms in steps of 0.5 ms with three repetitions per condition. Presentation of ITDs was randomized. If stability allowed we collected NDFs with a higher resolution.
The advantage of using binaural beats as a search stimulus is that it immediately reveals interaural temporal sensitivity to the fine-structure of the sound waveform, in the form of “beating” of the spike rate to the ongoing and changing interaural phase difference (IPD) between the sounds to the two ears (Kuwada et al., 1979). A limitation of the binaural beat stimulus is that it does not reveal interaural temporal sensitivity to stimulus envelope, but such sensitivity should be revealed by properties of the NDFs (Joris, 2003). Another limitation is that sensitivity to binaural beats is not restricted to MSO (Yin and Chan, 1990; Spitzer and Semple, 1995; Day and Semple, 2011; van der Heijden et al., 2013) but has also been reported in lateral superior olive (LSO) neurons (Joris and Yin, 1995; Spitzer and Semple, 1995; Tollin and Yin, 2005). As mentioned below (see below, Binaural beat analysis), detailed analysis of binaural beat responses over a range of frequencies allows tentative but not conclusive identification of MSO versus LSO responses. Other issues can be raised such as the sensitivity to low-frequency ITDs in periolivary nuclei of the SOC (Goldberg and Brown, 1969; Spitzer and Semple, 1995; Day and Semple, 2011), in VNLL (Batra and Fitzpatrick, 2002), and in nonprimary coincidence detectors (Goldberg and Brown, 1969; Spitzer and Semple, 1995; Batra et al., 1997; Coffey et al., 2006). Consideration of the known response properties of MSO and non-MSO neurons with ITD sensitivity argues that the bulk of our data is consistent with an MSO origin rather than with a non-MSO origin: we thus label the ITD sensitivity reported here as “originating in MSO” but this is tentative (see Discussion).
Histological reconstruction of the recording site.
To reconstruct the recording site the electrode track was marked either electrolytically (Indium electrodes, 10 s of 10 μA negative pulsating current at the last site of clear auditory activity, Midgard Precision Current Source; Stoelting) or with 1,1′-dioctadecyl-3,3,3′3′-tetramethylindocarbocyanine perchlorate, (DiI; Invitrogen) applied to the glass electrodes before electrode placement. DiI is a hydrophobic cyanine dye that can be used to mark cell membranes (DiCarlo et al., 1996).
After the experiment, animals were killed with an overdose of sodium pentobarbital and perfused transcardially with 10% formalin. After cryoprotection in a 30% sucrose solution for at least 1 d, the brainstem was frozen using dry ice and sectioned (50 μm) either sagittally (caudal and dorsal approaches) or transversely (lateral approach) using a sliding microtome (HM430; Microm). All sections were first viewed under a fluorescent microscope (Olympus BX-61 with a focus control unit BX-UCB and a fluorescent light source X-Cite 120Q) filtered at 565 nm to visualize the DiI signal. After tracing and photographing the fluorescent signal and the outline of the brainstem (Neurolucida; MBF Bioscience) standard Nissl staining was performed to visualize brainstem nuclei. Photographs of the stained sections were taken and digital overlays with the DiI photographs were made and traced (Adobe Photoshop).
Analysis.
Unless otherwise stated, all fits used the least-squares criterion.
Spike shape.
It is a general observation that monopolar spikes are associated with axonal recordings and bipolar spikes with somatic recordings. This is in line with our own observation in recordings from auditory nuclei (e.g., CN or IC) versus recordings from axon bundles (e.g., auditory nerve or trapezoid body). We recorded the spike waveforms in 15 LL experiments. We first determined the mean spike waveform over all stimuli and repetitions per dataset, i.e., per BB, NDF, etc., by averaging the spikes over a window of 8 ms centered on the occurrence of the trigger pulse of the spike discriminator. To classify the spike waveforms as being either monopolar, i.e., axonal, or bipolar, i.e., somatic, we calculated the polarity index (PI) as follows: with Vmax being the maximal voltage and Vmin the minimal voltage of the mean spike waveform. After visual inspection of the PI distribution we choose values ≤1.6 to indicate a monopolar spike waveform (see Fig. 5).
Latency analysis.
Because our tracks were dorsal to or transversed the dorsal nucleus LL (DNLL; Fig. 2), axonal recordings may also stem from ITD-sensitive DNLL neurons projecting to the IC. Therefore, we subjected the two classes of spike shapes, i.e., monopolar and bipolar, to a latency analysis. The rationale for this analysis is that the DNLL is at least one synapse “higher” in the ascending auditory pathway than MSO neurons (Fig. 1). Accordingly, if spikes stem from SOC axons, monopolar spikes should exhibit shorter latencies than bipolar spikes. Moreover, if the recordings are indeed of SOC origin and are not mixed with axons from the DNLL, we expect the latency distribution of monopolar spikes to be unimodal.
We compared response latencies in two ways by tallying the first spikes after stimulus onset, and by constructing poststimulus time histograms (PSTHs). For each neuron, both methods were applied across all stimulus conditions and repetitions, resulting in a single latency profile per neuron for each method. Normalization was performed so that the area of each profile was equal to one per neuron. After summing the individual distributions separately for monopolar and bipolar datasets, we normalized the summed distribution again so that the area under the curve equaled one: these are the profiles shown in Figure 5D and E. No effort was made to remove “spontaneous” spikes from contributing to the first-spike latency data. As will be discussed in Results (Fig. 5D,E), the contribution of early “spontaneous” spikes was small.
FTCs.
The highest CFs we encountered were 12.3 and 4.3 kHz in the LL and IC, respectively. We focused on ITD sensitivity to binaural beats at and near CF: such ITD sensitivity was only observed in units with a CF below 1500 Hz. We therefore restrict analysis in this paper to units with CF below that limit. We did not study ITD sensitivity in the low-frequency “tail” of neurons with higher CF.
We only included FTCs that had a minimal threshold of at most 65 dB SPL. This criterion led to the exclusion of three and four neurons in the LL and IC, respectively. The next criteria for inclusion in our database were based on a rounded exponential filter (roex-filter; Patterson, 1976) fit to the FTCs. The symmetrical roex-filter is an estimate of the shape of the auditory filter, and is parameterized as follows: where g is the normalized deviation of the frequency from the characteristic frequency CF, p is the slope parameter, and |H(f)|2 represents the squared magnitude of the filter. Parameter p can be used to calculate the equivalent rectangular bandwidth (ERB) as follows: In addition to the threshold criterion, an FTC had to satisfy the two following criteria: its ERB had to exceed 40 Hz and the coefficient of determination (R2) of the roex-fit had to be larger than 0.8. These criteria allowed automated rejection of oddly shaped tuning curves.
We smoothed the FTCs depicted in Figure 6 with a moving average of 5 points for display purposes.
Binaural beat analysis.
We characterized the phase-locking features of a unit with a standard vector-averaging approach (Goldberg and Brown, 1969). Responses were binned (period histogram) at the beat period for all fcar and the mean phase was calculated according to the following: and with f being the beat frequency in Hz, t the time of occurrence of the jth spike, and n the number of spikes. Individual spikes are considered to be vectors distributed along the unit circle and the direction, i.e., the phase ϕ, of the mean of this distribution is given by the following: where k equals zero or one depending on the signs of x and y.
The vector strength (VS), i.e., the amplitude of the vector, was then calculated as follows: A VS of 1 indicates perfect phase locking; no phase preference gives a VS near 0. We tested the significance of phase locking with the Rayleigh test of uniformity (Mardia, 1972): We considered p ≤ 0.001 to indicate significant phase locking. Only data that confirmed to this criterion were included in the analysis. Subsequently, we fit a regression line through the ϕ versus fcar data weighing each data point with the product of its VS and spike rate (Kuwada et al., 1987). The slope of the regression line gives the characteristic delay (CD) and the y-intercept yields the characteristic phase (CP) of the unit. The CP indicates whether tone delay curves would coincide at a peak (CP ∼ 0), at a trough (CP ∼ 0.5), or somewhere between a peak and a trough (CPs other than 0, 0.5, and 1) and can thus be used to approximately classify units as peak type or trough type (Batra et al., 1997). As far as known, MSO neurons are mostly peak type and LSO neurons mostly trough type, but the CP is not absolutely dichotomous as a criterion (Yin and Chan, 1990; Joris and Yin, 1995; Tollin and Yin, 2005; Day and Semple, 2011).
We tested for linearity of the ϕ versus fcar relation by using a statistical approach based on the sum of squares of the deviation of the points from the regression line (Yin and Kuwada, 1983, their Appendix). Only units that satisfied the linearity criterion were included in the analysis. Additionally, we excluded weakly driven units, i.e., whose maximal spike rate was <10 spikes/s.
Furthermore, we calculated so-called composite curves to estimate a neuron's BD to wideband stimuli. First, we converted the BB responses to tone delay functions (TDFs) for all fcar tested by multiplying the phase of the period histogram with the period of the corresponding fcar. The composite curve is obtained by linearly summing the TDFs: it is a linear estimate of the response to a broadband (covering the tested fcars) stimulus, e.g., of the NDF to broadband correlated noise.
Noise delay functions.
We classified a unit as being ITD sensitive to broadband noise if it met three different requirements. First, we calculated the modulation depth (MDNDF) for each NDF as follows: with Rmax and Rmin being the maximal response and minimal response of the NDF, respectively (Joris, 2003; Smith and Delgutte, 2007). Next, each NDF was fit with two different functions: where A (Hz) is the amplitude, f (Hz) is the frequency, ϕ (rad) is the phase, and S (Hz) the offset. The second function was a Gaussian, which had the form where A (Hz) represents the amplitude, BD (ms) the best delay, HW (ms) the half-width, and B the offset (Hz). A unit was deemed to be ITD sensitive if its MD was at least 0.7, its maximal firing rate exceeded 10 spikes/s, and if at least one of the two fits had a coefficient of determination (R2) above 0.8. These criteria ensured the exclusion of neurons with poor ITD sensitivity.
Measurement of BD.
For both the responses to binaural beats and noise, we determined the delay yielding the maximal response, i.e., the BD. Because it is a key value, we tried to incorporate as much data as possible in the BD measurements. For units for which NDFs to both correlated and anti-correlated noise were available, we calculated the difference between them. The resulting curve is called the “difcor” (Joris, 2003). The difcor is less noisy than NDFs and provides a simple response linearization. From the fast Fourier transform of the difcor the so-called dominant frequency (DF) can be derived (Joris et al., 2005).
We also obtained a difcor from the responses to tones. Besides the composite curve obtained by linear summation of the TDFs, we constructed a composite curve to responses phase shifted by π before summing, and the two composite curves were then subtracted just as for the NDFs (Joris et al., 2005). For all ITD curves (TDFs, NDFs, and composite curves) positive ITD values indicate a lead in the contralateral ear (i.e., re recording site): a condition which in the free field normally corresponds to contralaterally positioned sound sources.
An important aim of our recordings was to study the distribution of BD as a function of CF. However, CF was not available for all neurons: in those cases we used DF, calculated from the difcor, as a substitute value. It has been shown earlier that the DF derived from NDFs is well correlated to CF up to several kilohertz, both in monaural neurons (auditory nerve: Louage et al., 2004, trapezoid body: Louage et al., 2005) and binaural neurons (Joris et al., 2005). Another frequently used measure is best frequency (BF; i.e., the frequency for which response to tones is maximal), but the correlation of that measure to CF is poor: bushy cells (Joris et al., 1994a,b) and binaural neurons (Joris et al., 2005) commonly show much higher firing rates at low frequencies, in the tail of their tuning curves, than at CF. For the population of neurons for which both measures were available, R2 between CF and DF, based on NDFs, was 0.92 (p ≪ 0.01, N = 24). BF, however, was poorly correlated to CF (R2 = 0.66, p = 0.01, N = 14) and was not used here.
Results
We present data from a total of 177 neurons recorded in the LL and 68 neurons from the IC. A total of 141 FTCs were recorded in the LL (55 lateral approach, 86 caudal approach) and 60 in the IC (47 caudal approach, 13 dorsal approach). Of those a subset of 85 LL and 33 IC neurons had CFs < 1500 Hz and were binaurally sensitive. We obtained responses to BB stimuli from 32 LL (10 lateral approach, 22 caudal approach; 9 CF, 23 DF) and from 7 IC (6 caudal approach, 1 dorsal approach; 7 CF, 0 DF) individual neurons as well as 28 LL (10 lateral approach, 18 caudal approach; 13 CF, 15 DF) and 22 IC (19 caudal approach, 3 dorsal approach; 16 CF, 6 DF) neurons probed with binaurally delayed noise bursts (NDFs). Three of the 177 LL neurons appeared to be monaural when tested with ipsilateral, contralateral, and binaural tone pips at CF. Two of these neurons (1 chopper type and 1 onset type) were contralaterally driven and the remaining chopper-type neuron was ipsilaterally driven. The surprisingly small number of monaurally driven neurons is most likely due to a recording selection bias (e.g., in fiber diameter and/or location) combined with stability issues that prevented us from fully characterizing some encountered neurons, i.e., we could not hold all fibers long enough to measure more than an FTC (these recordings are not further considered here).
Histological reconstruction of the recording site
Figure 2 depicts tracings of a sagittal section (Fig. 2A) through part of the brainstem and midbrain and of a transverse section (Fig. 2B) of two animals, in which the electrodes were aimed at the LL. Borders of the nuclei of the LL such as the DNLL and the VNLL are marked with solid black lines. The borders of the IC and the superior colliculus are indicated with dashed black lines. Locations of neurons that showed Nissl bodies in the cresyl violet stained slices, and which were located within the nuclear boundaries (solid black lines) are depicted as black circles. The electrode tracks are visualized as light gray patches with black edges, which indicate the distribution of DiI seen under fluorescence microscopy. Typically, in both the caudal (Fig. 2A) and the lateral (Fig. 2B) approaches, electrodes penetrated the dorsal portion of the DNLL or the lemniscal axon bundle that exits the DNLL dorsally and projects to the IC, so that axonal recordings could potentially stem from DNLL output axons projecting to the IC instead of SOC axons. However, we will argue below that these two populations can be distinguished by a difference in first-spike latency. Before presenting the results of a latency analysis we first describe spike waveforms.
Spike waveforms
We routinely observed the spike waveforms on an oscilloscope, but in 15 experiments aimed at the LL we recorded not only spike times but also spike waveforms. Figure 3, A and B, shows the spike waveforms of neurons with CFs of 327 Hz (A) and 862 Hz (B) recorded with low-impedance electrodes and stimulated with interaurally delayed, binaural noise bursts (duration = 1000 ms; 5 ms cos2-onset/-offset ramp; BW = 50–8000 Hz; level = 70 dB SPL; ρ = 1; Nrep = 3). We extracted the waveforms 4 ms prior and 4 ms after spike occurrence (Fig. 3A,B, shades of gray) as determined by the BAK window discriminator and averaged the extracted shapes (thick white line). The triggered spikes shown stem from a single neuron. There are no spikes at intervals smaller than a few milliseconds. Note that both cases exhibit a large initial positive deflection followed by an equally large negative deflection (normalized amplitude given in arbitrary units, a.u.). This bipolar shape is typically associated with recordings of somatic origin, i.e., here from DNLL neurons. The corresponding dot raster plots are shown in Figure 3 C and D. In these plots each dot represents the occurrence of one spike and the stimulus parameter varied, i.e., ITD, is plotted on the ordinate with time shown on the bottom abscissa. The vertical solid line indicates the offset of the stationary part of the stimulus, i.e., up to the offset ramp. The thick line with circular symbols shows the mean spike rate in Hz (top abscissa), i.e., the neurons' NDF. The white patch centered at zero ITD indicates the approximate ethological range of ITDs of the chinchilla (von Bismark and Pfeiffer, 1967; Koka et al., 2011; Lupo et al., 2011). Note that the BD of both neurons falls within this range. For the response in Figure 3, A and C, the BD is at negative delay whereas the other neuron's (Fig. 3B,D) BD is positive. As we will show later (Fig. 11) most of the recorded neurons had positive BDs, i.e., would respond best to sounds in the contralateral hemifield.
Figure 4 depicts the spike waveforms (A and B) and dot raster plots (C and D) of responses of two neurons recorded with high-impedance electrodes (left column CF = 375 Hz CF; right column CF = 1137 Hz) in response to ITDs of broadband noise. Individual spikes are shown in shades of gray and the averaged spike shape is shown as a thick white line. The recordings show a large signal-to-noise ratio and spike intervals do not violate the refractory period, again indicating that these are clean single-neuron recordings. In contrast to the spikes shown in Figure 3, A and B, these waveforms exhibit a large positive deflection followed by a small negative deflection, i.e., they are monopolar.
We classified our data in monopolar and bipolar spikes by calculating the PI (see Materials and Methods), for all averaged spike waveforms. The resulting PI values are shown in Figure 5A as a function of the difference between the time of maximal and minimal amplitude. Because there is a break at 1.6, we classified waveforms with PI equal to or smaller than 1.6 to indicate a monopolar spike (dark gray markers) and larger values to indicate bipolar spikes (white markers). The results of this classification are shown in Figure 5B and C. These subplots show the averaged spike waveforms of all spikes classified as monopolar (Fig. 5B) and bipolar (Fig. 5C). Note that we sampled far less bipolar than monopolar waveforms (3 neurons vs 29 neurons) since we aimed at collecting data from axons with high-impedance electrodes. Impedance dropped in the course of a penetration thus biasing the recording toward a somatic spike origin, so we were occasionally able to record from axons and somata within one penetration. The neurons classified as bipolar stem from such penetrations.
Latency analysis
We extracted the first-spike latency of all monopolar and bipolar units to all stimuli and repetitions and calculated the summed probability distributions (see Materials and Methods). The resulting histograms are shown in Figure 5D. The histogram for all monopolar units is shown in dark gray, the one for bipolar units is depicted in white. It is apparent that the histogram for the bipolar, i.e., presumed DNLL recordings, peaks between 10 and 15 ms. The first-spike latency distribution of the monopolar spikes, however, clearly peaks earlier, between 6 and 8 ms. As is visible in Figure 4, C and D, “spontaneous” spikes were often present, particularly in monopolar units, and generate the low plateau of short first-spike latencies in Figure 5D. Because their contribution is small and does not hamper interpretation of the difference between monopolar and bipolar distributions, no effort was made to remove these “spontaneous” spikes from the first-spike latency calculation. A simpler analysis is shown in Figure 5E, which shows cumulative PSTHs for the monopolar and bipolar units. Here again, there is a clear difference in rise time and peak response for bipolar and monopolar units. Note that the absence of a second peak in both distributions of monopolar units (Fig. 5D,E) at latencies similar to the bipolar distribution also suggests that the majority of the data in that set are not of DNLL origin. There are, to our knowledge, no published latency values for chinchilla MSO or DNLL responses. However, considering the slow stimulus rise time and low CFs, the two ranges of latencies obtained here are in the right ballpark to be consistent with an SOC and DNLL origin (cat: Brugge et al., 1970; Guinan et al., 1972). The latencies are thus consistent with the interpretation that the monopolar responses largely reflect activity of axons of SOC neurons and the bipolar responses from DNLL neurons. Note, however, that the latency analysis does not allow for the disambiguation between an MSO or LSO origin.
Together the spike waveform, binaural sensitivity, and latency distributions strongly suggest that with high-impedance electrodes we indeed recorded from SOC axons. Henceforth, “LL neurons” refers to neurons generating monopolar spike waveforms (as documented here and monitored throughout all experiments by online display of spike waveforms on an oscilloscope and by impedance tests). Note that all units reported as stemming from the LL had latencies < 10 ms. We now turn to the characterization of units recorded from the LL and the IC without further considering the three bipolar neurons of presumed DNLL origin.
FTCs
Even though CF figures prominently in discussions of binaural coding (McAlpine et al., 2001), few studies in MSO obtained threshold tuning curves and traditional CF measures, due to the recording problems in MSO mentioned earlier (see above, Introduction). Such measures were one of the aims in the present study. To assess the quality and validity of our CF measure we present an analysis of the FTCs. Figure 6 shows a manually chosen selection of tuning curves obtained in the LL (A) and the IC (B). These representative curves spanning the low-frequency range have been slightly smoothed by applying a running average. Markers indicate thresholds at CF for all ITD-sensitive neurons recorded, not only for those for which FTCs are shown. We also recorded from 41 and 13 neurons with CF > 1500 Hz in the LL (12.3 kHz, highest CF encountered) and IC (4.3 kHz, highest CF encountered), respectively. These neurons were not sensitive to IPD in binaural beats at frequencies around CF and are not further considered here. Possibly these neurons had ITD sensitivity in their low-frequency tail or to envelopes (e.g., using broadband noise or amplitude-modulated sounds), but this was not systematically explored, mostly for reasons of time.
The most striking observation here is the abundance of well tuned and sensitive neurons with CFs as low as a few hundred hertz. The histograms of Figure 6, C and D, provide an overview of the normalized frequency and threshold distributions of the LL (red) and IC (blue). From Figure 6 it becomes clear that we encountered a large number of low-frequency units both in the LL and in the IC. The IC distribution is clearly skewed toward units with CFs < 500 Hz while the LL distribution shows a broader peak < 1000 Hz. Certainly for the IC, the placement of the electrodes biased the sample toward low CFs. To some extent, for both the IC and the LL, the binaural beat search stimulus may have biased the CF distribution, but note that the search stimulus extended up to 20 kHz, so that the low-CF bias in the LL is stronger than expected. The threshold distributions of the LL and the IC (Fig. 6D) are very similar. They span a range from approximately −10 dB SPL up to ∼60 dB SPL. The very low thresholds at low CFs may be exaggerated due to the opening of the bullas (Rosowski et al., 2006). We compared the Q10 values as a function of CF with the trend line reported for chinchilla auditory nerve (AN) (Ruggero and Temchin, 2005), and found good agreement (data not shown). Q10 percentiles (25/50/75) were as follows: LLQ10: 1.00/1.38/2.21; ICQ10: 1.02/1.79/2.43.
The abundance of low-frequency neurons and their sharp and sensitive tuning renders the chinchilla an interesting species for binaural physiology. Next we characterize ITD-sensitivity probed with binaural beat stimuli.
Responses to binaural beats
Figure 7A shows a dot raster plot of a LL recording (CF of 308 Hz) in response to a 2 Hz binaural beat of varying fcar. Poststimulus time and fcar are plotted on the bottom abscissa and the ordinate, respectively. Dots indicate the occurrence of a spike, the vertical dashed line marks stimulus offset. Spike rate (top abscissa) is shown as a black line with circular markers: it peaks at fcars close to the unit's CF of 308 Hz. Furthermore, it is apparent that the responses at most fcars are clustered around a specific phase of the beat frequency. This phase is somewhat later in the IPD cycle for increasing fcar, visible as a gentle tilt to the right in the dot rasters.
To quantify this behavior we performed a standard binaural beat analysis (Yin and Kuwada, 1983) and plotted VS and mean IPD (ϕ) as a function of carrier frequency in Figure 7B. We excluded all points that according to the Rayleigh test were not significantly phase locked. For this example VS (light gray squares) peaks over a broad range of fcar centered near 400 Hz. In this frequency range VS values are close to 0.8 indicating strong IPD sensitivity. Note that, with decreasing fcar, the VS values decrease from this maximum, even though firing rate (Fig. 7A) is still high. We will return to this point below (Fig. 7E). With increasing fcar phase accumulates reaching a maximum of 0.2 cycles at 750 Hz. A linear fit through the ϕ versus fcar relation yields an intercept, i.e., a CP, of 0 cycles and a slope, i.e., a CD, of 300 μs. These are typical values for our population as can be seen in Figure 7C and D. The histogram in Figure 7C shows the CP distribution of our sample of 32 neurons recorded from the LL. We have split the dataset in neurons for which only the DF (red) was available and those for which we obtained the CF (white). The circular mean of CP across DF and CF of 0.1 cycles falls close to 0 cycles; values larger than 0.2 are rare. This indicates that most units were of the peak type. This dominance of peak-type units (CP close to 0) is consistent with the assumption that most of our recordings stem from the MSO. In contrast, LSO neurons, due to their excitatory/inhibitory interactions, exhibit trough-type characteristics with CPs close to 0.5 (Joris, 1996; Batra et al., 1997; Tollin and Yin, 2005). Interestingly, there is a small cluster of three neurons that had a CP close to 0.5. Here it should be noted that not only LSO but also MSO neurons have been reported to exhibit CPs close to 0.5 (gerbil: Day and Semple, 2011). Testing for ILD sensitivity would lend further support for an LSO versus MSO origin but was not performed due to constraints in the time of recording. The CD distribution (Fig. 7D) is centered near zero ITD. With a value of 120 μs, the mean even falls within the ethological range of ∼300 μs of the chinchilla (Koka et al., 2011).
A question of interest (Joris and Verschooten, 2013) is the upper frequency limit at which sensitivity to binaural beats is obtained, as this type of sensitivity stands as a model for the use of temporal fine-structure by the central auditory system. Figure 7E shows VS values as a function of stimulus frequency for our population of IC (blue) and LL (red) neurons. Only significant values are shown (see Materials and Methods). Significant IPD sensitivity is found up to 2 kHz. In the IC this was the highest fcar tested, so we cannot be sure it is an upper limit, but it is the same limit as reported for auditory cortex (Benson and Teas, 1976) and in the LL one neuron was tested up to fcar of 3 kHz but was only IPD sensitive up to 2 kHz. We therefore surmise that 2 kHz is the upper limit of IPD sensitivity or is very close to it. A second feature of note is the sharp drop-off of IPD sensitivity at low frequencies in many neurons, giving many IPD functions a bandpass character. This is particularly the case for stimulus frequencies <∼400 Hz. Examination of the IPD functions at these frequencies shows no evidence for peak splitting, but rather the appearance of a sustained response without any stimulus-related periodicity. This is visible in Figure 7A: the neuron is driven at similar spike rates at 100 and 350 Hz (Fig. 7A, solid line), but IPD sensitivity (Fig. 7B; VS values to fbeat) is poor at the lowest frequency.
Figure 7F shows an inverse relationship between CD and CP for a combined population of IC and LL neurons. Such an inverse relationship was also described in the IC of guinea pig and cat (McAlpine et al., 1996; Joris and Karino, 2012) and in the DNLL of the guinea pig and gerbil (Agapiou and McAlpine, 2008; Lüling et al., 2011). The relationship in Figure 7F is very similar to that in guinea pig, while it is less tight in cat and gerbil. The origin and significance of the relationship are unclear.
As described in the Materials and Methods section we also constructed TDFs and composite curves from the binaural beat data to extract BD values. The contour plot in Figure 8A gives an overview of the TDFs for the example unit of Figure 7A and B. ITD and fcar are the ordinate and abscissa, respectively. Spike rate has been normalized to the maximal rate so that white and black contours correspond to a value of one and zero, respectively. The white solid and dashed hyperbolic lines indicate the extent of a stimulus cycle and 1/4 of a stimulus cycle. The upper white line, referred to as the π-limit because it is separated by half a stimulus cycle from 0 ITD, describes an upper bound that is found empirically in the distribution of CD or BD in different species (Benson and Teas, 1976; McAlpine et al., 1996; Brand et al., 2002; Hancock and Delgutte, 2004; Joris et al., 2006; Pecka et al., 2008). A streak of high spike rates is seen contralaterally close to zero milliseconds but outside the ethological ITD range (indicated with dashed white, horizontal lines). As expected peak activity is reached close to the unit's CF of ∼300 Hz at an ITD of ∼300 μs. Activity outside the π-limit indicates side peaks in the TDFs. The side peaks, necessarily, follow a hyperbolic trajectory and the distance between a given side peak and the main peak near to zero ITD corresponds to the stimulus period. Collapsing the data along the frequency axis onto the ITD axis, i.e., summing all TDFs and normalizing the sum to the resulting maximum, yields the so-called composite curve (Yin et al., 1986), which gives a reasonable prediction of the response to correlated broadband noise. We computed the composite curve and also the corresponding prediction for the response to anti-correlated noise, following the procedure of Joris et al. (2005). Subtraction of the two curves gives the difcor shown in Figure 8B as a thick black line. The BD, again defined as the ITD of maximal response, is positive (i.e., in response to a stimulus leading in the contralateral ear) close to zero ITD but outside of the ethological range. The side peaks of the composite curve are suppressed relative to the side peaks of the TDFs. This is because they do not line up across frequencies (Fig. 8A). Together Figure 8, A and B, demonstrates the procedure used to obtain BDs from binaural beat data. We extracted BDs for all of our units from the simulated difcors. Before discussing these data we first present typical generic NDFs in the next section.
NDFs
Along with the difcors to tones (thick line), Figure 8B also shows the difcor obtained from actual NDFs (thin line). The derivation of such difcors is illustrated in Figure 9, for two LL units (A, B) and one IC unit (C) with similar low CFs (200–300 Hz). Shown are the responses to correlated (ρ = 1; blue) and anti-correlated (ρ = −1; red) Gaussian noise bursts of varying ITD, as well as their subtraction (black). The error bars indicate the SD of the responses. As expected for low-CF neurons, the NDF to anti-correlated noise approximates a mirror image of that to correlated noise, i.e., peaks and troughs were inverted, indicating that the ITD sensitivity to noise is dominated by fine-structure (Joris, 2003).
To compare spike rates in ITD-sensitive IC (blue) and LL (red) neurons, we constructed histograms (Fig. 10) of driven rates at BD (A), nondriven rate (B; i.e., rate after stimulus offset), and modulation depth ((Rmax − Rmin)/Rmax). We used both NDF data and composite curves derived from binaural beat data. LL neurons tended to have higher maximal (p ≪ 0.01, α = 0.01; Kolmogorov–Smirnov test) and nondriven rates (p = 0.004, α = 0.01; Kolmogorov–Smirnov test) than IC neurons, but modulation depth was very similar for the two populations (p = 0.92, α = 0.01; Kolmogorov–Smirnov test).
BD distributions
Finally, in Figure 11 we present BD versus CF or DF for neurons recorded in the LL (red) and in the IC (blue). BDs are shown separately for NDFs (A) and composite curves (B), as well as combined (C). The abscissa indicates either CF (circles) or DF (inverted triangles). If multiple datasets for one neuron were available, we averaged the BDs. The solid and dashed hyperbolic lines represent the π-limit and the 1/4 π-limit, respectively. As already remarked (Fig. 6), the distribution of IC units is biased toward lower CFs compared with the LL.
Several features of the BD distribution are consistent with previous reports in MSO and IC. Most BDs are on the positive half of the ordinate indicating a preference for contralaterally leading binaural stimuli. Furthermore, there is a clear dependence of the range of BDs on CF: the range and mean value are higher at low CF than at high CF. This is particularly clear for the BDs obtained to noise (Fig. 11A).
However, the distribution also differs from most previous reports in rodents in two ways. A first interesting observation is that the majority of BDs in the LL (65%) and a large fraction in the IC (41%) are within the ethological ITD range. These numbers are approximate as the exact ITD range is dependent on head size and frequency, where larger heads and lower frequencies generate a larger ITD range. For example, in the chinchilla the ITD range is ∼250 μs at 1 kHz (Fig. 11, white patch) and ∼300 μs at 250 Hz (Fig. 11, dark gray patch; Koka et al., 2011), and at lower frequencies the ITD range is not known but may be even larger. Note that for almost 60% of the data shown the abscissa indicates CF; for the remainder, DF was used (CFLL: 13 and DFLL: 15; CFIC: 16 and DFIC: 6).
A second and related observation is that small BDs are also found at low CFs and that they are rather widely distributed between the π-limit and the horizontal line at ITD = 0 ms. This is in stark contrast to reports from other rodents (gerbil MSO: Brand et al., 2002; DNLL: Siveke et al., 2006; guinea pig DNLL and IC: Agapiou and McAlpine, 2008; McAlpine et al., 1996, 2001), in which BDs are less dispersed and cluster around the 1/4 π-limit. In this regard, the present data seem to be in better agreement with data from cats (Hancock and Delgutte, 2004; Joris et al., 2006) and a report in gerbil (Day and Semple, 2011).
In the physiological ITD literature, various measures for “optimal delay” have been used (see Discussion). For neurons for which BD estimates from both NDFs (responses to noise) and composite curves (responses to binaural beats) were available, we compared the two (data not shown): the correlation was poor (R2 = 0.35, p = 0.2, N = 15).
Discussion
The circuits that generate ITD sensitivity are well established as a model system to study temporal processing, but characterization of the primary binaural comparison has suffered from the difficulty in obtaining good spike isolation with extracellular recordings (Irvine, 1986). This difficulty limits not only the number of neurons reported, but also the quality and variety of the recordings. The data available are mostly responses to tones (but see Rupert et al., 1966; Yin and Chan, 1990): little is known regarding ITD tuning to broadband stimuli such as noise or clicks, or even regarding very basic properties such as frequency tuning. Recent in vitro studies suggest that the recording difficulty is tied to specialized cellular properties of MSO neurons toward temporal computation (Scott et al., 2010; Golding and Oertel, 2012). Other recording strategies are thus needed. Juxtacellular (van der Heijden et al., 2013) and whole-cell (Franken et al., 2013) recordings are a promising avenue to address the cellular mechanisms of ITD sensitivity in vivo, but their feasibility may be restricted to small species with a superficially located MSO. The trapezoid body–a heavily myelinated bundle of fibers also containing MSO afferents–overlies the MSO and hampers whole-cell recordings in larger species. Here we took an alternative approach in which heavy myelination is actually a bonus: we used sharp micropipettes to record from MSO axons in the LL of the chinchilla. The histological analysis of electrode tracks marked with a fluorescent dye showed that we recorded at the level of the DNLL or dorsal to it (Figs. 1, 2). Consistent with previous experience in regions with myelinated bundles and associated nuclei (SOC: Smith et al., 1993; dorsal and intermediate acoustic strias: Joris and Smith, 1998; LL: P. H. Smith and P. X. Joris, unpublished observations), these recordings are dominated by monopolar spike waveforms. Bipolar spike waveforms occurred only occasionally and were associated with a drop in electrode impedance, likely due to tip damage. Importantly, the latter recordings yielded responses with longer latency (10–15 ms) than the monopolar recordings (∼7 ms; Fig. 5). Moreover the latency distribution of monopolar recordings was unimodal and not bimodal as would be expected if they also sampled DNLL axons. As suggested by a small cluster of CP values near 0.5 (Fig. 7C), there may be a contribution of low-CF LSO axons. Although VNLL neurons show a form of ITD sensitivity (Batra and Fitzpatrick, 2002), it is very different from the sustained ITD sensitivity shown here so that these neurons are not likely to contribute to our database. Ultimately, intra-axonal labeling will be required to firmly establish the MSO origin of the recordings.
The main difficulty encountered was poor mechanical stability. The recordings lasted up to 1 h per fiber and routinely ∼30 min. FTCs were routinely obtained (Fig. 6), as well as high-quality recordings to broadband noise (Figs. 4, 9). It is interesting to observe that the spikes recorded were stable in size for a given recording (Fig. 4), unlike the graded action potentials recorded somatically (Scott et al., 2010). A surprising finding is that the recordings were strongly biased to low-frequency and IPD-sensitive neurons. A spatial sampling bias is unlikely given that we used two, spatially almost orthogonal, approaches. Although the search stimulus used (tone pips and binaural beats) may induce some bias, we surmise that the main source must be in the size of the axons. We are not aware of anatomical studies of axon caliber in LL: our prediction is that MSO axons are larger than fibers from CN, LLN, or other SOC nuclei. Besides the abundance of low-CF binaural neurons, low thresholds and clear frequency tuning (Fig. 6) contribute to the appeal of this experimental model.
Since data on the binaural sensitivity of the chinchilla are limited (Benson and Teas, 1976; Chen and Sinex, 1999) we also collected data from the IC. At the level of analysis presented here, we did not see striking differences between IC and LL. Spike rates tend to be somewhat larger in LL than in IC (Fig. 10A,B). Frequency tuning was similar, with sensitive and well tuned low-CF neurons present in both LL and IC (Fig. 6). There were no obvious differences in the shape of ITD tuning curves (Fig. 9) or in the BD distributions (Fig. 11). Note that the distributions of CFs sampled were somewhat different in the two populations (Fig. 6C). These observations lend credit to the general assumption that binaural properties at the level of the IC largely reflect the properties created at the level of the SOC, but again more extensive analyses along more stimulus and response dimensions and on larger samples are needed to better appreciate any differences between the two structures.
An unexpected and quite striking observation in the chinchilla is the frequent lack of IPD sensitivity at very low frequencies, both in LL and IC (Fig. 7E), despite low thresholds (Fig. 6) and the presence of driven responses (Fig. 7A). The fact that, at these very low frequencies, afferents show good phase locking (auditory nerve: Temchin and Ruggero, 2010, bushy cells: Recio-Spinoso, 2012), and that there is good IPD sensitivity in at least some LL axons and IC neurons, suggests that the limiting factor is at the level of (some neurons of) the MSO itself. We have not explored the phenomenon but it is worth further study as it affects the interpretation of the presence of low-frequency hearing and sound localization ability (Heffner et al., 1994) enabled by large bullas (Webster and Plassmann, 1992) and may give further insight in the process of coincidence detection.
A relation between BD and CF was reported for the guinea pig IC by McAlpine et al. (1996, 2001). Remarkably, BDs respect the so-called π-limit, which corresponds to half a cycle of the CF. This relationship has received much attention as it is unpredicted by a pure axonal time delay model (Jeffress, 1948). Moreover, the BDs in guinea pig cluster around a value of π/4, and are outside of the approximate ethological range of ITDs at the lowest CFs. This observation has led to a revival of “hemispheric” models (Von Békésy, 1960; van Bergeijk, 1962; Hall, 1965), which propose that the code for azimuthal location is in a comparison of overall activity in left and right hemispheres rather than in a topographically mapped array of neurons with different CDs (Jeffress, 1948; Licklider, 1959). In our data there is no obvious clustering of BDs around a value of π/4, and there is an abundance of BDs close to zero ms and within the ethological ITD-range (Fig. 11). In fact the distribution is not very different from that reported in cat (Hancock and Delgutte, 2004; Joris et al., 2006), though the chinchilla seems to have even more neurons tuned to small BDs, which fall inside its approximate ethological ITD range of ∼250 μs (Koka et al., 2011; Lupo et al., 2011), even for CFs < 500 Hz. This is rather unlike the guinea pig, where hardly any BDs are inside that range for CFs < 300 Hz (McAlpine, 2005). This seems to be a genuine species difference or may reflect sampling biases in either species.
In our own view, the BD–CF relationship forms an important constraint regarding the nature of the mechanism(s) generating the BD, but is not very constraining regarding the nature of ITD coding. It is noteworthy that an early study performed in auditory cortex of the chinchilla (Benson and Teas, 1976) also reported a BD–CF relation within the π-limit, from which it also favored a hemispheric coding scheme. However, the BDs reported were for pure tones at BF, which by necessity show a hyperbolic pattern when plotted on a time rather than a phase axis (cf. Reale and Brugge, 1990). Multiple metrics have been used to express the optimal delay (Benson and Teas, 1976) of ITD tuning. The CD to tones, BD to noise, BD of the composite curve calculated from tones, and best ITDs of single tones, are all instructive metrics, but they can differ substantially, which hampers comparisons across studies. Similarly, different measures have been used to express frequency tuning. BF, defined as the frequency giving maximal spike rate, can differ substantially from CF both in binaural neurons and their monaural inputs (Joris et al., 1994a,b). Thus, the choice of metrics for both the abscissa and ordinate can substantially affect distributions as in Figure 11.
How these “technical” issues affect conclusions regarding coding has not been adequately dealt with in the binaural literature and needs to be sorted out. The LL approach developed here opens a new window to the MSO that can contribute to resolving these issues by allowing a wider range of stimuli and a more thorough characterization of binaural properties than has been possible with other recording techniques.
Footnotes
P.X.J. was supported by grants from Fonds voor Wetenschappelijk Onderzoek (G.0714.09, G.0A11.13), BOF (OT/09/50; Flanders, Belgium), and P.B. by a BOF fellowship. We thank Roel Vlayen for expert help with histological processing, Eric Verschooten for assistance with cochlear microphonic measurements, and Ellen Winckelmans for participation in the IC recordings.
The authors declare no competing financial interests.
- Correspondence should be addressed to Philip X. Joris, Laboratory of Auditory Neurophysiology, Campus Gasthuisberg O/N2, K.U. Leuven, Herestraat 49 bus 1021, B-3000 Leuven, Belgium. Philip.Joris{at}med.kuleuven.be