Previous Article | Next Article 
The Journal of Neuroscience, June 1, 2002, 22(11):4625-4638
Context-Dependent Adaptive Coding of Interaural Phase Disparity
in the Auditory Cortex of Awake Macaques
Brian J.
Malone,
Brian H.
Scott, and
Malcolm N.
Semple
The Center for Neural Science, New York University, New York, New
York 10003
 |
ABSTRACT |
In the ascending auditory pathway, the context in which a
particular stimulus occurs can influence the character of the responses that encode it. Here we demonstrate that the cortical representation of
a binaural cue to sound source location is profoundly
context-dependent: spike rates elicited by a 0° interaural phase
disparity (IPD) were very different when preceded by 90° versus
90° IPD. The changes in firing rate associated with equivalent
stimuli occurring in different contexts are comparable to changes in
discharge rate that establish cortical tuning to the cue itself.
Single-unit responses to trapezoidally modulated IPD stimuli were
recorded in the auditory cortices of awake rhesus monkeys. Each
trapezoidal stimulus consisted of linear modulations of IPD between two
steady-state IPDs differing by 90°. The stimulus set was
constructed so that identical IPDs and sweeps through identical IPD
ranges recurred as elements of disparate sequences. We routinely
observed orderly context-induced shifts in IPD tuning. These shifts
reflected an underlying enhancement of the contrast in the
discharge rate representation of different IPDs. This process is
subserved by sensitivity to stimulus events in the recent past,
involving multiple adaptive mechanisms operating on timescales ranging
from tens of milliseconds to seconds. These findings suggest that the
cortical processing of dynamic acoustic signals is dominated by an
adaptive coding strategy that prioritizes the representation of
stimulus changes over actual stimulus values. We show how cortical
selectivity for motion direction in real space could emerge as a
consequence of this general coding principle.
Key words:
binaural; rhesus; adaptation; interaural delay; sound
localization; auditory motion; context
 |
INTRODUCTION |
Sounds originating from locations to
the left or right of the head reach the ears at slightly different
times, resulting in interaural phase disparity (IPD) cues to their
localization in the horizontal plane. Motion of sound sources relative
to the head, produced by motion of either the listener or the source, results in dynamic variations in IPD. Although relative head motion also impacts other cues for sound localization, such as interaural intensity differences and monaural spectral cues, human psychophysical studies suggest that IPD cues dominate sound localization judgments for
broadband sounds when low frequencies are present (Wightman and
Kistler, 1992
). Numerous studies suggest that cortical neurons in a
range of mammalian species, including monkeys, are sensitive to
auditory motion (Sovijärvi and Hyvärinen, 1974
; Ahissar et al., 1992
; Stumpf et al., 1992
; Toronchuk et al., 1992
; Poirier et al.,
1997
). Ablation studies have also implicated auditory cortex in both
sound localization (Neff and Casseday, 1977
; Heffner, 1978
; Kelly,
1980
; Jenkins and Merzenich, 1984
; Kelly and Kavanagh, 1986
; Heffner
and Heffner, 1990
; Heffner, 1997
) and auditory motion detection (Altman
and Kalmykova, 1986
). Although the pioneering studies of Brugge and
Merzenich (1973)
revealed the sensitivity of macaque auditory
cortical neurons to static interaural time and level disparities (ITD
and ILD), cortical responses to dynamic IPDs (Reale and Brugge, 1990
)
have not yet been characterized in a primate model.
Not only is the nature of the cortical representation of time-varying
IPD signals an important question in its own right, such signals are
particularly convenient for analyzing the fidelity of the mapping of
cortical responses to particular IPDs. Advantages include the
robustness of IPD tuning to changes in sound pressure level (SPL) and
the circumscribed range (360°) of the IPD axis itself. The use of
periodic, trapezoidal IPD stimuli, which is unique to this study,
permits detailed analysis of the temporal evolution of contextual
influences both across stimulus periods and across epochs of
steady-state IPDs of relatively long (1 sec) duration.
It is of particular interest whether context dependence is a prominent
feature of cortical representation of IPD, because this coding property
appears to emerge hierarchically in the ascending auditory pathway
(Spitzer and Semple, 1998
). Dynamic interaural disparities in phase
(Spitzer and Semple, 1991
, 1993
, 1995
, 1998
; McAlpine et al., 2000
) and
level (Sanes et al., 1998
), and simulated motion in the free field
(Wilson and O'Neill, 1998
), have revealed a novel form of sensitivity
to stimulus context in the mammalian inferior colliculus (IC). It has
been demonstrated that a particular value of binaural disparity in
phase or level can be consistently associated with widely varying
response rates when the same stimulus occurs in different contexts.
"Conditioned" responses of this sort are not evident in the medial
superior olive.
Monaural stimuli that contain frequency steps or sweeps, which do not
generate a percept of motion, have also been shown to condition the
responses of IC neurons (Malone and Semple, 2001
), suggesting
that conditioning is a general property of the way central auditory
neurons process acoustic signals that "move" along any of the
parameter axes to which they are tuned. Although the current study
focuses on dynamic IPD signals for the reasons described above, the
generality of conditioning effects demonstrated in the midbrain
suggests that the dramatic impact of stimulus history on cortical
processing reported here is not limited to IPD (McKenna et al., 1989
)
and reflects the operation of general synaptic and cellular mechanisms.
 |
MATERIALS AND METHODS |
Subjects, surgical preparation, and physiological
recording. Two adult male monkeys (Macaca mulatta,
designated X and Z) participated in these experiments. All procedures
pertaining to animal use and welfare in this study were reviewed and
approved by the New York University Institutional Animal Care and Use
Committee. Anesthesia was induced with ketamine and sodium thiopental
and maintained with isoflurane (1-4%) while a head-holder that mated
to a specially designed primate chair (Crist) was implanted. A
recording chamber (CalTech) was implanted above the auditory cortex in
the left hemisphere of each animal. The initial placement of the
recording chamber on monkey Z was slightly rostral to allow recordings
across the rostral (R) and rostrotemporal (RT) fields (Hackett et al., 1998
). The back of the initial chamber and the front of the chamber in
its second placement straddled the low-frequency portion of primary
auditory cortex (AI). The implant for animal X was centered over
AI and provided access to caudal R, resulting in a larger sample of
low-frequency IPD-tuned units. All penetrations were made vertically
with respect to the cylinder implants and thus were roughly parallel to
the stereotaxic vertical plane. Both animals are still involved in
experiments, so assignment of recording locations to cortical fields is
based on physiological criteria, such as the tonotopic progression in
AI, and the distribution of response latencies (Scott et al.,
2000
).
Both animals had been extensively trained on binaural lateralization
tasks involving both static and dynamic IPD and ILD cues. During
recordings, blocks of psychophysical trials alternated with passive
listening, when the trapezoidal IPD stimuli described in this report
were presented. Behavioral and recording sessions were all conducted in
a double-walled sound-attenuated chamber (Industrial Acoustics
Company), and the animals were monitored continuously via
closed-circuit television. Single-unit activity was recorded with
tungsten microelectrodes (FHC) advanced into the brain via a stepping
motor microdrive (CalTech). Recording location was referenced to a
stereotaxic positioning system that mounted directly on the implant.
Depths of all recordings were referenced to entry into the brain. Entry
into the superior temporal plane was typically marked by a sudden
increase in activity after a long silent interval and the first
appearance of auditory responsiveness.
Stimulus generation and data acquisition. Stimulus waveforms
were generated by digital synthesizers and custom hardware (MALab, Kaiser Instruments). Stimulus characteristics were specified in software running on the host computer (Macintosh), which communicated with a dedicated microprocessor (MALab) via an IEEE-488 interface. After digital attenuation and digital to analog conversion, the signal
was transduced by electrostatic earphones (STAX Lambda) in custom
housings (Custom Sound Systems) fitted to ear inserts. Before each
experiment, the sound pressure level (SPL) expressed in decibels (dB
re: 20 µPa) at each ear was calibrated under computer control for
level and phase from 40 Hz to 30 kHz, using a previously calibrated
probe tube and condenser microphone (4134, Brüel and Kjær).
Electrical signals from the brain were amplified (variable gain),
filtered (typically from 0.25 to 10 kHz), and passed to oscilloscopes,
an audio speaker, and an event timer (MALab, Kaiser Instruments). The
occurrence of discriminated action potentials and stimulus
synchronization events were logged with a resolution of 1 µsec. Event
times were then retrieved from a "first in, first out" (FIFO)
buffer and stored by the host computer for analysis and display.
Stimulus protocols. IPD sensitivity revealed by responses to
binaural beat search stimuli (see below) was limited to cells with best
frequencies below 2.5 kHz, which was near the behavioral limit of
IPD-based lateralization performance for these subjects (Malone and
Semple, 2000
). Cortical neurons that clearly exhibited a periodic
modulation of their responses to binaural beats were tested with
trapezoidal IPD stimuli (Fig. 1). Unlike
binaural beats, which derive from the presentation of two slightly
different frequencies to each ear, the trapezoidal IPD stimulus is
created by presenting the same frequency to both ears and modulating
the phase at one ear (in these experiments, the left ear). A previous study confirmed that modulating the phase in tandem in both ears did
not modulate the responses of cat IC neurons (Spitzer and Semple,
1993
). Thus, entrainment to interaural phase modulation depends on the
actual IPD cue, rather than the slight frequency change (e.g., 1 Hz for
360°/sec modulation) that necessarily occurs during monaural phase
modulation (i.e., because frequency is the time derivative of phase).
More directly, we have never encountered sensitivity to 1 Hz depth
frequency modulation despite extensive testing with such stimuli as
part of a larger physiological survey of primate auditory cortex.

View larger version (38K):
[in this window]
[in a new window]
|
Figure 1.
Description of the stimuli used in this study.
A, The periodic trapezoidal IPD stimulus set. The 360°
IPD axis is spanned by a set of eight origin-target IPD pairs whose
origins are staggered by 45°; ±180° at the top and
bottom of the IPD axis represents identical values,
because the IPD axis is circular. Each individual stimulus (10 sec)
consists of four periods, and each (2.5 sec) period is composed of two
1 sec duration steady-state IPDs (the origin and target) linked by
linear (250 msec) modulations of IPD. The depth of modulation was 90°
at 360°/sec in all cases. The initial modulation depth is negative
for the stimulus set shown. Note that each sampled IPD point occurs
twice: once as an origin and once as a target. The "partner" IPDs
differ by 180° (e.g., for an initial negative modulation depth, 90°
appearing as the Origin IPD is partnered
with 0°, but 90° appearing as the Target IPD is
partnered with 180°). Thus, the context in which a particular IPD
value occurs varies as much as the 360° IPD axis allows. The
responses during the first origin period were used to estimate the
static IPD tuning function of each neuron (Fig.
2B,C). The interstimulus
interval was always 2 sec. B, Histogram depicting
responses to two presentations of the highlighted stimulus in
A (45° origin, 45° target). Responses during the
interstimulus intervals are shown in gray.
C, Modulation period histogram of the trial-based
responses shown in B. The duration of the
origin-to-target (Sweep1) and target-to-origin
(Sweep2) sweeps was 250 msec. Firing rates were
sometimes calculated separately for each 250 msec epoch of the
steady-state intervals. These four epochs are indicated for the target
IPD ( 45°).
|
|
The carrier frequency and level that elicited the best combination of
discharge rate and synchrony to the period of the binaural beat search
stimuli were selected for the trapezoidal IPD stimuli. The carrier was
nearly always the best frequency of the cell, as determined by
responses to short tone pips with 0° IPD. Sound pressure levels for
physiology (60-80 dB) were comparable to the level (80 dB) at which
the animals performed IPD discriminations.
Each period of each trapezoidal IPD stimulus consisted of an initial
steady-state IPD (origin), a linear phase sweep (90°) to the second
steady-state IPD (target), and a return sweep to the origin IPD (Fig.
1C). The IPD stimulus ensemble was created so that each
steady-state IPD value occurred twice in the sequence, partnered with
maximally different IPDs (Fig. 1A). For example, an
IPD of 45° occurs as the origin in the stimulus modulating repetitively from 45° to
45°, and then again as the target in the
stimulus modulating from 135 to 45°. The difference in stimulus "partners" (e.g.,
45 and 135°) creates the change in stimulus context for the representation of a particular IPD (e.g., 45°). The
IPD axis is circular rather than linear (i.e., 0 and 360° represent
equivalent phases at both ears), and eight origin IPD values, sampled
at 45° intervals, span this axis (Fig. 1A). Thus, an origin IPD tuning function was constructed from the responses to
each of the eight IPD values occurring as origins; similarly, the
target IPD function was constructed from responses to the same IPD
values appearing as targets (see Fig. 2C).
The sweeps in each stimulus of the ensemble involved IPD modulations in
opposite directions over an identical range (e.g., from 45 to
45°
and then back from
45 to 45°). The sweeps followed the presentation
of steady-state IPDs differing by 90°. The responses averaged over
the origin-to-target sweep (see Fig. 2B, Sweep1) were used to construct one IPD tuning function, and the responses from
the target-to-origin sweep were used to construct the other (see Fig.
2B, Sweep 2). The values shown for
the sweeps represent the midpoint of the IPD excursion (i.e., the
response to the sweeps from 45 to
45° and
45 to 45° are shown
at 0°). To monitor response variability over the duration of the
recording, the 0° origin stimulus was presented twice (first and
last) in each sequence. If the responses to the repeated stimulus were
consistent, the data were retained for analysis, and those responses
were averaged before the calculation of the best IPD and tuning
sharpness (described below).
One period of the stimulus is comparable to the modulations of IPD
experienced during listening to a stationary sound source (1 sec),
turning the head (250 msec), listening again (1 sec), and returning to
the original head position (250 msec). In these subjects, a 40° head
rotation produces a 90° IPD for a 1 kHz sound source. By presenting
IPD stimuli dichotically, however, we can produce modulations between
pairs of IPDs positioned at regular intervals throughout the entire IPD
axis, including and beyond the subset of IPDs experienced in the free
field. IPD was linearly modulated by 90° at 360°/sec. The direction
of the initial modulation could be either positive or negative; a
negative initial direction (Fig. 1A) implies a phase
lag at the left ear for the origin-to-target sweep. Each 12 sec trial
consisted of four 2.5 sec periods (Fig. 1A) followed by a 2 sec silent interval. Each
stimulus presentation consisted of two to three trials. Responses
during the origin and target intervals were also analyzed in four 250 msec epochs, as indicated for the target in Figure 1C.
If stimulus context affects the responses of cortical neurons, we
should expect that both the origin and target tuning functions would
differ from the IPD tuning function of the neuron as it is normally
measured
with tone pips separated by silence. We determined the static
tuning function of each neuron by considering only the responses that
occurred during the first period of each origin IPD, which followed 2 sec of silence.
Data analysis. To assess the magnitude of context effects on
the representation of IPD, we calculated the response-weighted best IPD
of each IPD tuning function. The firing rate associated with a
particular IPD value was treated as the length of a vector pointing in
the direction of that IPD. For each IPD function, the vectors
corresponding to the eight tested IPDs were summed to produce a
resultant vector whose direction was the best IPD. The length of this
vector was normalized by the sum of firing rates for all points on the
function, resulting in a measure of tuning sharpness analogous to
vector strength. A tuning sharpness of zero indicates equivalent
responses to all IPDs; a value of one results if only one tested IPD
elicits a response. Best IPD and tuning sharpness were also calculated
separately for different periods, epochs, and intervals (see Results).
The magnitude of the context-induced shift in IPD tuning was computed
as the absolute value of the difference between the best IPDs of the
origin versus target and sweep1 versus sweep2 IPD functions, subject to
the constraint that the value be <180°. The shift magnitude is
independent of differences in overall firing rates: for example,
doubling all firing rates during the target IPDs would not change the
measured shift magnitude (i.e., the best IPD of each function reflects only the direction of the resultant vector, not its length).
 |
RESULTS |
Summary of the data sample
The data described in this report were obtained as part of an
extensive physiological survey of auditory cortex. IPD sensitivity was
initially verified with binaural beat search stimuli in 176 neurons
(112 in X; 64 in Z) with best frequencies below 2.5 kHz, the
approximate upper limit for IPD-based lateralization performance in our
animals. The presentation of trapezoidal IPD stimuli was restricted to
the most stable recordings so that response changes attributable to
fluctuating recording conditions would not be mistaken for
context-dependent changes. Complete sets of responses to the full
sequence of trapezoidal IPD stimuli were obtained for 46 cells (38 from
X; 8 from Z). Some cells (n = 11) were tested at both
positive and negative initial modulation depths, resulting in 57 cases
(X: 45; Z: 12). The initial modulation direction did not impact the IPD
tuning shift magnitude during the sweeps (Wilcoxon, p = 0.39) or steady states (Wilcoxon, p = 0.99). Because
robust conditioning effects were evident in nearly all neurons from
both animals, we combined the data from the two subjects in the
analyses that follow. The trapezoidal IPD carrier frequencies ranged
from 100 to 1700 Hz (median: 700 Hz). Because high SPLs generally
produced better synchronized and more robust binaural beat responses,
trapezoidal IPD data were collected at moderate to high SPLs: the modal
SPL was 80 dB (27 of 57 cases), and 48 of 57 cases were obtained with SPLs from 60 to 80 dB. Ignoring the cases in which context effects were
not significant (see below), neither the carrier nor the SPL predicted
the magnitude of the context-induced IPD shift (Spearman's
,
p > 0.05).
Changes in stimulus context dramatically impact the cortical
representation of identical IPDs
The responses of a single cell to the full IPD stimulus ensemble
are shown as a set of modulation period histograms in Figure 2A, and again as IPD
tuning functions in Figure 2, B and C.
Examination of Figure 2 reveals that the firing rate associated with a
particular IPD value (e.g., 0°) varies substantially with context.
The origin and target IPD curves (Fig. 2C), based on
responses averaged over the full duration (1 sec) of identical
steady-state IPDs, are markedly out of registry because of changes in
the context in which those IPD values occur. Similarly, the sweep
functions shown in Figure 2B, which are based on IPDs
modulated through exactly the same range, are >90° out of phase with
one another. The rightward and leftward shifts of the sweep1 and sweep2
curves, respectively, are opposite the direction of motion, consistent
with previous reports of IC responses (Spitzer and Semple, 1993
; Wilson
and O'Neill, 1998
).

View larger version (25K):
[in this window]
[in a new window]
|
Figure 2.
Cortical responses to identical IPD values and
ranges are profoundly context dependent. A, Modulation
period histograms corresponding to each stimulus shown in Figure
1A. Numbers above each histogram
indicate the origin and target IPDs. The bottom right
histogram (45 to 45°) is the same as that shown in Figure
1C, and all data derive from the cell featured in Figure 1. The response to
45° differs substantially when it is paired with 135 and 45°,
for example. B, Responses during the sweeps in
A are replotted as IPD tuning functions. The response
rate averaged over each sweep is plotted at the IPD midpoint (e.g., the
pair of responses to 45° IPD are based on the sweeps from 0 to 90°
and 90 to 0°, indicated by arrows). The static IPD
tuning function, based on responses during only the first origin
period, is shown as a dotted line. The dashed
line in gray indicates the discharge rate
averaged over all interstimulus intervals. Polar representations of the
same data appear as insets. If cortical neurons were
indifferent to context, the curves in each panel would be identical.
Best IPDs for sweep1 (gray) and
sweep2 (black) differ by 103.5°.
C, Responses during the steady states in
A are replotted as IPD tuning functions, using similar
conventions. Best IPDs for the origin and target functions differ by
49.5°. Discharge rates elicited by a particular IPD (e.g., 45°)
typically differ most for IPDs on the slopes of the static tuning
function.
|
|
If we consider the fidelity of the rate representation of particular
IPDs, it is evident that IPDs eliciting robust responses when
paired with one IPD could suppress the response below the spontaneous rate when paired with another (Fig. 2C, 0°
and
135°). For IPDs associated with the slopes of the static
tuning function, the difference in the firing rates elicited by IPD
excursions through identical ranges (Fig. 2A,
90
to ± 180° and
45 to 45°) can exceed the range of
firing rates used to signal changes in IPD on the static tuning function.
The impact of stimulus context on the cortical representation of IPD
can be quantified in two complementary ways
as a change in the firing
rate associated with a particular IPD value and as a relative change in
the best IPDs (see Materials and Methods) of the origin and target
functions. To verify that such changes are genuine, it is necessary to
compare them against benchmarks for rate and tuning shifts based purely
on response variability. Our method for assessing the magnitude of
context-induced shifts in both rate and tuning for the population is
shown in Figure 3. Firing rates at each
IPD, and the best IPD, were calculated for each of the final three
origin and target periods in the trapezoidal stimulus (Fig.
3A). The initial stimulus period, because it is preceded by
silence, was excluded from this analysis. For each IPD, the absolute
differences in firing rates from different periods of the same IPD
function (Fig. 3B, bottom left) measure changes in rate attributable to response variability alone. Differences in the
firing rates associated with identical IPD values from different (i.e.,
origin versus target) IPD functions (Fig. 3B, bottom
right) were used to generate a complimentary set of six estimates
of context-induced rate shifts. The average of each set of six values
was then computed for each IPD point.

View larger version (43K):
[in this window]
[in a new window]
|
Figure 3.
Method for the statistical verification of
context-induced shifts in rate and IPD tuning. A, The
trapezoidal IPD stimulus set (in the case shown, the initial modulation
depth is positive). Labels above the steady-state
intervals indicate the responses used for the construction of origin
and target IPD functions (O2, O3,
T2, etc.) based on individual stimulus periods. The
first stimulus period was excluded from this analysis because it is
preceded by silence. B, Period-based origin
(gray) and target (black) IPD
tuning functions derived from real data from a single cell.
Context-induced changes in the representation of IPD can be calculated
in two complementary ways: as a shift in the response elicited by each
IPD (e.g., rate shift at 0°) and as a shift in the best IPD of the
entire IPD tuning function (see Materials and Methods). Differences
between IPD tuning curves from the same interval (e.g., the origin
interval: O2, O3, and
O4) reflect period-by-period variability in the
response of the cell. Differences between IPD curves from origin and
target intervals (e.g., O2 versus T3,
etc.) reflect context-dependent changes in the responses to identical
IPD values. C, Sets of estimates (6) of variability
across periods and context-dependent shifts in rate and tuning used to
confirm the validity of putative conditioning effects in individual
cells. An ANOVA (p < 0.05) was performed to
confirm that context-dependent IPD tuning shifts significantly exceeded
shifts in tuning caused by response drift. For the data shown in
B, these values are 73.6, 78.5, 79.5, 78.0, 74.4, and
72.9° for the context-induced shifts (e.g., O2 versus
T3, etc.), and 1.5, 5.6, 4.1, 4.4, 0.8, and 5.2° for
shifts caused by period-by-period variability (e.g., O2
versus O3, etc.).
|
|
The distribution of average firing rate shifts calculated from origin
and target curves normalized to their respective maxima are plotted in
Figure 4A. Most points
lie below the diagonal because context-induced shifts in the rate
representation of IPD generally surpass the changes in rate caused
solely by response variability. It is important to note, however, that
some of the context-induced shifts are small, because the origin and
target curves, although shifted, intersect at two points. Intersections
near one of the eight sampled IPDs (Fig. 2C, 90 and
90°)
produce small values of rate shifts. For similar reasons, the magnitude
of the rate shifts for particular IPDs vary widely across the IPD
range. Differences between the origin and target (or sweep1 and sweep2)
curves tend to be maximal at or near the most steeply sloping
portions of the static tuning function and minimal at the best and
worst static IPDs (Fig. 2). Thus, not only are context-induced shifts
significantly larger (Wilcoxon, p < 0.0001; mean = 0.44) than the typical period-to-period changes in responsiveness
(mean = 0.12), but the context-induced shifts exhibit greater
variance across IPD (Wilcoxon, p < 0.0001).

View larger version (24K):
[in this window]
[in a new window]
|
Figure 4.
Population distributions of shifts in
rate (normalized) and IPD tuning (degrees). A, Each
point represents the average of the six estimates (Fig.
3C) of context-induced and variability-derived rate
shifts. Each case (n = 57) contributes eight points
to the graph, one for each of the eight IPD values comprising the full
stimulus set. To allow for the direct comparison of cells with widely
varying overall discharge rates, the origin and target tuning curves
were first normalized to their respective maxima. Each of the six
estimates represents the absolute value of the normalized difference in
firing rate at each IPD. Significance was assessed on a case by case
basis by the tuning shift test described in the legend of Figure
3C. Points falling below the diagonal represent large
context-induced shifts in the discharge rate representation of
particular IPDs than would be predicted by period-to-period
variability. B, Each point (n = 57)
plotted on this graph represents the average of the six estimates of
context-induced (abscissa) and variability-derived
(ordinate) IPD tuning shifts for the steady-state
(origin and target) tuning curves.
|
|
Although analysis of context effects in terms of rate shifts bears most
directly on the fidelity of the mapping of response rates to particular
IPD values, analysis of context effects in terms of IPD tuning shifts
better captures the orderliness of the shifts in rate because the signs
of rate differences are not discarded. Using the best IPDs calculated
for origin and target intervals in the second to fourth stimulus
periods, six estimates of tuning "shifts" attributable to
variability alone (Fig. 3C, bottom left) were
compared (one-way ANOVA) to the six estimates of context-induced shifts
(Fig. 3C, bottom right). Of the 57 cases, 53 were
significant (p < 0.05), with most cases
significant at the p < 0.01 (50 of 57) and
p < 0.0001 (41 of 57) levels. The means of each set of
estimates are plotted against one another in Figure
4B.
The mean tuning shift for significant cases was 59.8°, whereas the
mean tuning shift attributable to response variability was 10.9°. In
every case in which the response variability was sufficiently low
(<25°), the context-induced shift was significant. Nonsignificant
cases were not associated with smaller shifts (Wilcoxon, p = 0.34) but rather with greater variability
(Wilcoxon, p = 0.0026). On this basis, they were
excluded from the remainder of the data analysis, unless noted
otherwise. In other words, we never encountered a cortical neuron that
maintained an invariant mapping of response rate to IPD as context
varied. In fact, the typical range in discharge rate spanned by
responses to the same IPD as context varied was nearly half (0.44) the
dynamic range available to signal changes in IPD on either function
(origin or target).
IPD tuning shifts are independent of the overall strength and sign
of the neuronal response
Although cortical neurons were uniform in their sensitivity to
context, the character of their responses to trapezoidal IPD varied
widely. The responses depicted in Figure 2 were facilitated at
favorable IPDs, and suppressed at unfavorable IPDs, relative to the
spontaneous rate. We estimated the spontaneous rate by calculating the
average firing rate over all 2 sec interstimulus intervals. Although
this average provides an adequate estimate of the spontaneous rate, for
some cells the firing rate during the interstimulus intervals reflected
the history of stimulus-evoked activity and could be more properly
termed an afterdischarge because it follows the cessation of
stimulus-evoked activity. These relationships are analyzed in detail in
a subsequent section.
Suppression below the spontaneous rate (for at least two contiguous
IPDs) contributed to the IPD tuning of 31 of 46 neurons, consistent
with a prominent role for inhibition in cortical IPD processing.
Responses depicted in Figure 5 represent
the highest stimulus-driven (A-C) and
spontaneous (D-F) firing rates in our sample. The responses shown in Figure 5A-C
exceeded the spontaneous rate for all IPDs, with favorable IPDs
producing sustained firing rates in excess of 200 Hz. At the other
extreme, reliable static IPD tuning and robust context-induced tuning
shifts could also be based entirely on the differential suppression of
vigorous spontaneous activity (Fig.
5D-F). The mean tuning curve shift for
the population, using all periods of the stimulus, was 55° (median:
49.5°) during the steady states and 75.5° (median: 82°) during
the sweeps. The magnitudes of IPD tuning shifts did not depend on the
firing rate averaged over either the duration of the stimuli in the
ensemble (Spearman's
=
0.17; p = 0.23) or the interstimulus intervals (
=
0.05; p = 0.73). Thus, the redistribution of spikes that accounts for the shifts
in IPD tuning (see below) appears to occur independently of the
absolute number of spikes being fired, and the sign of the elicited
response (i.e., excitatory or suppressive).

View larger version (34K):
[in this window]
[in a new window]
|
Figure 5.
Dramatic context-sensitivity was apparent in cells
with widely varying firing rates and response characteristics. Shown
here are the responses of two cells representing the highest driven
(A-C) and spontaneous
(D-F) firing rates observed in the sample
population. Graphing conventions are similar to those in Figure 2. The
responses depicted in B and C exceed the
average afterdischarge rate (gray dashed line) at
all IPDs, whereas the responses in E and
F are powerfully suppressed at all IPDs. Nevertheless,
both cells are clearly tuned to IPD and sensitive to the context in
which particular IPDs occur. Best IPDs for the sweeps are shifted by
82° (B) and 75.7° (E);
best IPDs for the steady states shift by 60.6°
(C) and 46.3° (F).
|
|
Context-induced shifts in rate and tuning reflect increases in
temporal rate contrast
The foregoing analysis shows that particular discharge rates do
not encode particular IPDs (or any other stimulus parameters defining
these stimuli, such as frequency and level) unambiguously across
contexts. The cortical IPD representation appears to be optimized to
signal changes of IPD with changes in firing rate. Shifts in IPD tuning
emerge as a consequence of a process operating at the level of
individual stimuli: as the stimulus modulates between the pair of IPDs
comprising each stimulus, the change in discharge rate
the temporal
rate contrast of the ongoing IPD representation
is enhanced relative
to the rate contrast associated with the same IPDs in the static tuning
function. Because the stimuli are arranged in an ordered set, the
result is an orderly shift of the origin function relative to the
target function.
We define the discharge rate contrast as the difference in firing rate
that presumably signals a difference between two stimuli, such as a
90° difference in IPD. We estimate the static discharge rate contrast
by calculating the absolute value of the rate difference for each pair
of points separated by 90° on the normalized static tuning function,
i.e., all the points corresponding to origin-target pairs in the
stimulus set (e.g., 0 and 90°, 45 and 135°, etc.). We calculate the
dynamic discharge rate contrast by taking the absolute value of the
difference in firing rate for the same origin-target pairs on the
normalized origin and target curves (e.g., 0° on the origin curve and
90° on the target curve). The dynamic rate contrast (population mean:
0.5) is significantly greater (Wilcoxon, p < 0.0001)
than the static rate contrast (0.35). The concentration of points below
the diagonal on Figure 6 is evidence of
the enhanced discharge rate contrast for the dynamic stimuli. This is
equivalent to a transient steepening of the tuning function of the
neuron between each pair of IPDs as they recur during individual
stimuli in the ensemble.

View larger version (36K):
[in this window]
[in a new window]
|
Figure 6.
Context-induced shifts in IPD tuning reflect
enhanced contrast in the discharge rate representation of different
IPDs occurring dynamically. The coordinates of all points in the figure
reflect firing rate differences for IPDs separated by 90°. Normalized
rate contrast at each IPD was computed as the absolute value of the
firing rate difference taken from origin and target curves normalized
to their respective maxima. Static discharge rate contrast was
calculated as the difference in the responses to IPDs separated by
90° on the static tuning function (e.g., 0 and 90°). Dynamic
discharge rate contrast was calculated as the difference in the
responses to the same IPDs as the stimulus modulated between them: the
response to 0° averaged over the second to fourth
origin periods and the response to 90° averaged over
the second to fourth target periods. Values falling
below the diagonal indicate that repetitive modulation enhanced the
firing rate difference associated with an IPD difference of 90°
relative to the same difference (90°) for the same IPD values on the
static tuning function.
|
|
The population average of tuning sharpness for the static function
(0.37) was also significantly lower (Wilcoxon, p = 0.0101) than the tuning sharpness measured in the remaining origin
periods (0.46). Note that both the static tuning sharpness and
normalized rate contrast would be reduced if the significant elevation
of firing rates during the first origin period (vs first target period: Wilcoxon, p = 0.0108; vs second origin period:
p = 0.0002) were independent of IPD. On the other hand,
the addition of a constant onset response across IPD would not affect
the static rate contrast expressed as raw firing rate differences,
which are also significantly (Wilcoxon, p < 0.0001)
smaller than the differences for the same IPDs occurring dynamically.
This suggests that in the cortex, the cost of further degrading the
fidelity of the mapping of instantaneous IPD to discharge rate is
outweighed by the benefits of enhancing the discharge rate
representation of changes in IPD.
Shifts in the rate representation of particular IPDs can be
predicted by stimulus/discharge history
As has been noted (Joris and Yin, 1992
), numerous authors have
"drawn attention to the significance of adaptation as a ubiquitous sensory mechanism to enhance temporal contrast." For example, Wilson
and O'Neill, (1998)
have argued that shifts in the receptive fields of
IC neurons in the unanesthetized mustached bat reflect "spatial
masking," whereby responses to previous stimuli decrease the
responsiveness of a cell in proportion to the level of prior activity" [see also McAlpine et al. (2000)
]. Simply put, the
response to a particular IPD preceded by a stimulus eliciting a weak
response should exceed the response to the same IPD preceded by a
stronger response. In Figure
7A, differences in responses
during oppositely directed sweeps through the same IPD range are
plotted against the differences in the rates for the steady-state
intervals preceding them. Because the weaker sweep response generally
follows the stronger steady-state response, and vice versa, the slope
of the line of fit is negative, as predicted. Similarly, when the
firing rates in the origin and target intervals are equivalent, the
responses averaged over the sweeps are likewise equivalent, producing
an intercept very near zero (0.68 Hz). The strong negative correlation (
=
0.78; p < 0.0001) between the
steady-state and sweep response differences remains when differences
are calculated from the normalized sweep and steady-state tuning
functions (
=
0.74; p < 0.0001) (Fig. 7,
insets). This finding is consistent with the
independence of absolute firing rate and tuning shift magnitude
exemplified in Figure 5.

View larger version (32K):
[in this window]
[in a new window]
|
Figure 7.
Shifts in the responses to identical IPD values
and ranges can be predicted by responses to stimulus events in the
recent past. A, The difference in the responses to
oppositely directed sweeps traversing equivalent ranges of IPD
(sweep1 sweep2) is inversely
correlated with the difference in the responses to the preceding steady
state (origin target), such that
responses to the sweep after the less effective steady state are
stronger, and vice versa. Results of a similar analysis based on
differences calculated for IPD tuning functions (sweep1,
sweep2, origin, and
target) normalized to their respective maxima are shown
in the inset. B, Differences in the
responses to identical IPDs (1-2 on
stimulus icon) in different contexts are inversely correlated with
differences in the responses to the modulation "partner" IPDs
(partner1-partner2).
A similar analysis conducted on normalized firing rates is shown as an
inset.
|
|
Examination of the histograms in Figure 2 reveals that in cases where
the response to a particular IPD (e.g., 0°) varies substantially across contexts, the responses to its partner IPDs (i.e.,
90 versus
90°), the "contexts" for that IPD, also tended to vary substantially. In Figure 7B, the difference in the responses
to the same IPD (e.g., 0° as an origin vs 0° as a target) is
plotted against the difference in the responses to its partners. The
correlation in this case (
=
0.70; p < 0.0001) is nearly as strong as that relating the sweeps to the
preceding steady states, despite the fact that each IPD is separated
from its partner by the duration of the interposed sweep. This suggests
that the continuously modulated components of the stimulus are not
necessary for conditioning to occur [see also Malone and Semple
(2001)
] and that contextual influences can persist for some time; the
strong negative correlation between steady-state IPDs and their
partners indicates that at least one of the mechanisms controlling the
gain of cortical responses operates on a time scale on the order of the
interposed sweeps (250 msec). If this mechanism operated more quickly,
then an enhanced sweep response, conditioned by a weak response to the
preceding steady-state IPD, would curtail the enhancement of the
steady-state response after the enhanced sweep.
Although stimulus events in the recent past shape cortical responses to
time-varying IPDs, "recent" events can apparently be
integrated over a fairly long interval. Sanes et al. (1998)
demonstrated previously that conditioning effects could sometimes affect the "spontaneous" firing rates of IC neurons. In the
current study, afterdischarge rates provide an opportunity to examine the length of the windows that define recent history in the cortex. The
underlying positive correlation in the raw stimulus-driven and
spontaneous firing rates (
=0.24; p < 0.0001), which
reflects differences in absolute firing rates across cells, was removed by normalizing all firing rates to the peaks of their respective IPD
tuning functions. The response during the final target period does not
predict (
=
0.05; p = 0.28) the
afterdischarge rate across IPD, despite the fact that it is separated
from the interstimulus interval only by the return sweep. The firing
rate averaged over the full duration (10 sec) of each stimulus,
however, was inversely and significantly correlated with the
afterdischarge rates across IPD (
=
0.19; p < 0.0001). This correlation is evidence for a gain control process set
by firing rates averaged over intervals on the order of a full
modulation period (2.5 sec), because the changes in afterdischarge rate
evidently reflect the stimulus-driven rate averaged over both the
origin and target periods and the sweeps in between them. Note that
this sensitivity to stimulus history over long intervals (i.e., a
strong negative correlation between the stimulus-driven rate and
the afterdischarge) is not predictive of context effects at timescales
relevant to the sweep shift magnitudes (
=
0.03;
p = 0.82) and is marginally associated (
=
0.28; p = 0.0434) with smaller rather than larger
steady-state tuning shifts. This suggests that adaptive mechanisms
contributing to cortical sensitivity to stimulus history on
different timescales may operate independently.
Contextual influences wax across stimulus periods and wane within
steady-state intervals
The increase in discharge rate contrast for the modulated stimuli
suggests that a dynamic equilibrium in the rate representation of each
pair of IPDs is achieved as the stimulus modulates between them. As
evident in Figures 1A and 5, A and
D, cortical responses to trapezoidal IPD stimuli recur
robustly from period to period. Thus, the firing rates (for all cases
and all IPDs) are equivalent (Kruskal-Wallis, p = 0.99) for all origin/target periods excluding the first. Nevertheless,
progressive changes in IPD tuning reflect the redistribution of spikes
across origin and target intervals described in the previous section.
Tuning shift magnitudes progressively and significantly
(p < 0.01) increase across each period
(population means, periods 1-4: 40.6, 55.6, 61.2, and 65.5°). This
evolution of tuning properties reflects the progressive enhancement of
the discharge rate contrast representing IPD changes of 90° as they recur during each trapezoidal stimulus.
Although changes in IPD regenerate the responses of cortical neurons
across stimulus periods, firing rates decay substantially across
steady-state epochs (Fig. 8), when
the IPD does not change. Thus, the average ratio of the fourth/first
period firing rates (0.97) was significantly larger (indicating a
lesser decay in firing rate; Wilcoxon, p = 0.0002) than
the average ratio of fourth/first epoch rates (0.71; ratios were taken
with origin and targets separate and then these two values were
averaged). Firing rates averaged over all stimuli and all cases decay
significantly (Wilcoxon, p < 0.0001) from the first
epoch (25.4 Hz) to the second (19.8 Hz) but do not differ thereafter
(Kruskal-Wallis, p = 0.48). This is a measure of
adaptation as it is most commonly defined: a progressive decrease in
the response elicited by a constant stimulus. Nevertheless, the effects
of stimulus history are not simply adaptation, because responses
conditioned by changes in stimulus context "adapt" in an
IPD-dependent manner.

View larger version (28K):
[in this window]
[in a new window]
|
Figure 8.
Stimulus-specific adaptation results in the
progressive convergence of context-shifted responses on the static best
IPD. A, Origin (gray) and target
(black) tuning functions calculated for individual
epochs (250 msec) become more similar in later epochs. IPD
tuning shift magnitudes are shown to the right of each
pair of curves. The static best IPD is indicated on each
panel by the dashed vertical line.
B, The curves shown in A
are replotted separately to help illustrate the more rapid decay of
context-shifted responses on the origin and target functions (target:
45 and 0°; origin: 135°) relative to responses to IPDs proximal
to the static best IPD. C, Origin and target curves from
an additional cell showing a similar progressive convergence on the
static best IPD across epochs. D, Responses to the
sampled IPD ( 45°) nearest the static best IPD remain relatively
high, whereas the enhanced responses most responsible for the shift in
the best IPDs for the origin (0 and 45°) and target ( 90 and
135°) functions exhibit pronounced adaptation.
|
|
Shift magnitudes for the population decrease significantly from the
first to second epochs (67 to 55.2°; Wilcoxon, p = 0.0025) and again from the second to third epochs (55.2 to 45.9°;
p = 0.0220), remaining stable thereafter (45.9 vs
48.2°; p = 0.67). It is noteworthy that the decay in
the shift magnitudes from the second to third epochs occurs in the
absence of significant firing rate decay. In addition, the decay in
firing rate from the first to the fourth epochs, averaged across IPD,
is not related to the decay in shift magnitude from the first to fourth
epoch (mean: 0.73;
= 0.07; p = 0.61) on a
cell-by-cell basis.
These apparently paradoxical results are explained by the fact that the
largest epoch-by-epoch changes in firing rate occur for those IPDs
associated with the largest context-dependent response changes. The
epoch-by-epoch evolution of IPD tuning shift magnitudes is depicted for
a single cell in Figure 8, A and B. For example, the rate shifts in the representation of
135 and 0° in Figure 8A diminish substantially from the first to the
fourth epoch because of the accelerated decay in the enhanced responses
at those IPDs (i.e., the origin response at
135° and the target
response at 0° in Fig. 8A). The progressive
convergence of both the origin and target curves on the static best IPD
is more clearly evident when the origin and target curves in Figure
8A are replotted in Figure 8B.
Here, it is apparent that the responses to the sampled IPD nearest the
static best IPD decay less than the enhanced responses at the peaks of
the first-epoch tuning functions (black line, epoch
1). It was also possible for suppressed responses to
recover across epochs, as shown on the target curve (
135°) in
Figure 8B. Responses of a different cell showing
similar properties are shown in Figure 8, C and
D.
These findings suggest that although it is possible for cortical
neurons to display profound shifts in their tuning to IPD, the
distribution of inputs that give rise to the static tuning function
anchor the conditioned changes in rate that may occur at various IPDs.
Responses to the static best IPD are never substantially diminished,
nor are responses to the static worst IPD substantially enhanced.
Responses to IPDs on the slopes of the static tuning function depend
critically on recent stimulus history, but with time, they converge on
their statically defined norms at a rate at least partially determined
by their difference from those norms.
Implications for the processing of auditory motion
In previous sections, contextual influences on the firing rate
representation of IPD were treated as specific instances of a general
sensitivity to stimulus history. Below, we examine cortical responses
during the modulation of IPD in more detail and emphasize the
relationship of IPD processing to the localization of sound sources in
azimuth. A polar plot indicating the static best IPD of each neuron
(n = 46) in the sample appears in Figure
9A. Carrier frequency is
indicated by the radial distance of each point from the origin. The
shaded area delineates IPD values that
could not occur under normal listening conditions because the
ITDs required for their generation exceed the maximum ITD
allowed by the separation of the ears. This area was calculated for the
head radius of the larger of our animals (monkey Z) using Kuhn's
(1987) low-frequency model of ITDs produced by changes in
azimuthal angle (
): ITD = 3(r/c)sin (
),
where r is the head radius and c is the speed of
sound. Although this model was developed for humans, it has been shown
to provide a good fit to ITDs generated by the azimuthal displacement
of free-field sound sources in the rhesus monkey (Spezio et al.,
2000
).

View larger version (28K):
[in this window]
[in a new window]
|
Figure 9.
Static best IPDs and shifts in IPD tuning can also
be expressed in terms of ITDs and equivalent azimuthal angles.
A, Distribution of static best IPDs for each cell
(n = 46) in the population (only data from cases
with negative initial modulation depths are shown for cells where both
cases were available). The carrier frequency at which the static best
IPD was determined is indicated by the radial distance of each point
from the origin. Points falling within the shaded
area correspond to ITDs greater than could be produced by an
azimuthal displacement of a sound source 90° away from midline at that carrier frequency,
given the head radius measured for the larger of the two animals used
in this study. Such IPDs are described as falling outside the
"ecological range." B, Larger differences in ITD are
required to produce an equivalent IPD at lower carrier frequencies. For
a 500 Hz carrier, an IPD of 90° is produced by a 500 µsec delay of
the signal to one ear, but a similar IPD at 1000 Hz would be produced
by an ITD of only 250 µsec. C, Tuning shifts in IPD
for the cells shown in A are recalculated as shifts in
ITD and equivalent azimuth and plotted against the static best ITDs. As
in A, shaded regions indicate values
exceeding the ecological range. Three extreme points ( 1890.6, 731.4;
3203.2, 917.5; 3304.7, 2360.6) based on cells with very low best
frequencies (200, 100, and 100 Hz, respectively) have been
omitted to avoid a drastic rescaling of the axes.
|
|
As expected, the preferred IPDs of most cells corresponded to azimuthal
locations contralateral to the recording site. The mean static best IPD
of the population was
82.5°, and the mean tuning sharpness was
0.44. In small-headed mammals such as the gerbil, IPD tuning functions
typically peak outside the ecological range. It is not surprising that
a larger proportion of macaque cortical neurons should have mean IPDs
in the ecological range because the monkey's larger head size means
that for a given carrier frequency, a greater proportion of the 360°
IPD axis falls within that range. Nevertheless, cortical responses to
IPD were indifferent to the ecological plausibility of such stimuli, as
evidenced by the independence of tuning shift magnitude and static best
IPD (test of linear-circular association; sweeps: p = 0.33; steady-states: p = 0.45).
Psychophysical studies of auditory motion processing suggest that the
effects of motion should also scale with speed (Perrott and Musicant,
1977
, 1981
; Mateeff and Hohnsbein, 1988
). If the effects that we
observed depended on the rate of simulated azimuthal motion, then cells
with the lowest best frequencies would show the largest shifts: for
progressively lower frequencies, a given change in IPD represents a
greater change in ITD (Fig. 9B) and azimuthal angle.
Nonetheless, the magnitude of tuning shifts, expressed in IPD, were
consistent across cells with widely varying best frequencies: carrier
frequency was not predictive of the magnitude of the tuning shifts
observed for the sweeps (
=
0.13; p = 0.37).
Tuning shift magnitudes for all significant cases (n = 53) were converted to ITDs and are shown in Figure 9C. In 14 of 46 cells, shifts are larger than the maximum ecologically plausible ITD (~398 µsec; indicated by the shading on Fig.
9C). All of these cases involved carrier frequencies below
600 Hz. Conversely, the cells preferring small contralateral leads and
exhibiting small ITD shifts were nearly all tuned to frequencies >1000
Hz. Although many of the individual values comprising the shifted IPD
tuning functions are ecologically implausible, it is possible to
express the magnitude of tuning shifts less than twice the maximum ITD (i.e.,
800 µsec, or
90 to 90° in azimuthal angle) as shifts in
azimuthal tuning from 0 to 180° (Fig. 9C). If we consider
only one case per cell, the 43 (of 46) cells falling within this range have a mean azimuthal shift of 64.4° during the sweeps themselves.
Given the magnitude of these shifts, one should consider the
possibility that they reflect a special sensitivity to auditory motion
in the cortex. In the current study, as in others (Spitzer and Semple,
1991
, 1993
; Wilson and O'Neill, 1998
), the shifts in tuning measured
during the sweeps were uniformly opposite the direction of motion:
responses were enhanced by motion toward the static best IPD and
suppressed by motion directed away from it (Fig. 2B).
Genuine sensitivity to motion direction, however, should be expressed
as selectivity for either clockwise or counterclockwise motion in real
space (Wagner and Takahashi, 1992
; Wagner et al., 1994
), which would
appear as a change in the overall gain of responses to sweeps in
opposite directions (i.e., the height of the sweep1 versus sweep2
curves in Figs. 2B,
5B,E). In Figure
10A, the response to
sweep1, averaged across IPD, is plotted against the averaged response
to sweep2 for each stimulus set. The strong linear relationship indicates a lack of selectivity for motion direction assessed throughout the full range of IPD.

View larger version (24K):
[in this window]
[in a new window]
|
Figure 10.
Cortical neurons that are not selective
for the direction of IPD modulation may still be selective for the
direction of simulated azimuthal motion. A, Selectivity
for motion direction per se would result in an overall gain change in
responses to IPDs swept in the preferred direction. Responses to
sweep1 were averaged over all (8) IPD ranges and
compared with the average of the responses to sweep2.
The line of fit to these data is shown. The strong linear relationship
indicates that when responses throughout the full range of IPDs are
averaged, cortical neurons show no preference for the direction of IPD
modulation (i.e., relative shifts of the sweep1 and sweep2 curves occur
without changes in the overall gain of either function). The
filled circle indicates the cell showing the largest
consistent direction preference (Fig. 11B).
Although it is sometimes convenient to consider a given direction of
IPD modulation as equivalent to motion in real space (e.g., a relative
phase advance at the left ear simulates counterclockwise azimuthal
motion), it is not possible to assign a direction of motion, in
azimuth, to IPDs sweeps outside the ecological IPD range (Fig. 9).
B, Across IPD, the peak of the population mean IPD MDSI
occurs at the midline. The MDSI [(clockwise counterclockwise)/(clockwise + counterclockwise)] was calculated for
all IPD ranges (e.g., 0° represents sweeps in the range from 45 to
45° IPD) and all cells. A value of ±0.33 represents a response to
the preferred direction of modulation twice that in the nonpreferred
direction. Each point indicates the mean MDSI for the
population at each IPD range. Vertical error bars indicate SEM.
By convention, positive MDSIs represent clockwise motion, which implies
motion toward contralateral space for sweeps near the midline.
|
|
It is important to realize, however, that had the stimuli in this study
been limited to the ecological range of IPDs, most cells would have
been classified as selective for motion direction. In Figure
5B, for example, the responses to sweep1 (clockwise, in this
case) consistently exceeded the responses to sweep2 throughout the
ecological range (approximately
57 to 57° for a 400 Hz
carrier). If we consider the responses to sweeps about the
midline, and limit the analysis to cells with carriers where the range
of these sweeps (
45 to 45° in IPD) is ecologically plausible, then
25 of 37 cells would be considered motion-direction selective by a
conventional criterion (Poirier et al., 1997
): the response to the
sweep in one direction was more than twice the response to the sweep in
the other. The mean motion direction selectivity index (MDSI) [the
absolute value of (sweep1
sweep2)/(sweep1 + sweep2)], where a
value of 0.33 represents a doubling/halving of the response in one
direction relative to the other, was 0.41.
Although sensitivity to "motion" defined with respect to the static
IPD peak reflects a general sensitivity to stimulus history, this
phenomenon need not be a specialization for auditory motion processing
to impact the processing of auditory motion. Alternatively, a
specialization for motion direction selectivity could be expressed in
the distribution of static tuning properties of the cortical population. For example, the distribution of static best IPDs could be
skewed to support the exploitation of this general mechanism for the
specific purpose of motion-direction selectivity by concentrating context-dependent effects promoting motion-direction selectivity about
the midline, where spatial acuity is maximal (Domnitz and Colburn,
1977
). If we retain the sign of the MDSI, such that positive values
indicate a preference for sounds moving clockwise in azimuth, the
population average MDSI across IPD is a roughly sinusoidal function
with a maximum at 0° IPD (Fig. 10B). In other
words, auditory cortical neurons of the left hemisphere are most
strongly selective for clockwise motion when it occurs about the
midline. Near the midline, clockwise motion is motion toward
contralateral space
toward the location of the mean static best IPD
(
82.5°).
In addition to selectivity for motion direction, it is also
possible that cortical neurons are particularly sensitive to acoustic motion simulated by IPD modulation. For example, the highest response rates were generally observed during the sweep in the preferred direction for each origin-target IPD pair (Fig. 2A).
The responses of this cell during only the sweeps of the trapezoidal
stimuli are shown in Figure
11A. Not only are the
responses to the clockwise ("leftward") sweep stronger than
responses to the counterclockwise sweep through most of the
ecologically plausible range, the responses to the sweeps occupy a much
larger dynamic range than do the responses comprising the static tuning
function. An extreme example of this phenomenon is shown in Figure
11B. This cell was powerfully direction selective
throughout the entirety of the ecologically plausible range. In fact,
this cell was uniquely direction selective throughout the full (360°)
range of IPD (Fig. 10A,
). More striking, however, is the vastly expanded dynamic range available to the neuron during the
sweeps. For example, responses on the static IPD function are
relatively flat from 0 to
90°, but over the same IPD range, the
response to the sweep rises and falls quite steeply, spanning a dynamic
range of nearly 100 Hz. Thus, the previously described increase in rate
contrast for steady-state IPDs occurring in a dynamic context is even
more apparent during the dynamic components of each stimulus.

View larger version (81K):
[in this window]
[in a new window]
|
Figure 11.
Cortical neurons are often selective
for motion direction within the ecological range, and some neurons are
particularly sensitive to auditory motion itself. A,
Responses during the sweep components of the trapezoidal stimuli for
the cell featured in Figure 2. Each point is based on
the average rate calculated in 25 msec intervals (i.e., a bin of the
modulation period histograms in Fig. 2A), which
corresponds to an IPD range of 9°. The static tuning function is
shown as a dashed line, and IPDs outside the ecological
range are shaded. Responses to the sweeps, relative to
the static tuning function, are shifted opposite the direction of
motion, such that they lead responses to the same IPDs predicted from
the static tuning curve. Note that the effects of response latency,
which cause the response to lag the stimulus that elicited it, result
in an underestimation of these effects. A latency of 25 msec, for
example, would result in a 9° underestimation of the phase lead of
sweep responses relative to the static tuning function.
B, Responses of a cell showing both selectivity to
motion direction focused in the ecological range and sensitivity to IPD
modulation itself. The dynamic range of responses to individual sweeps
(e.g., 0 to 90°) vastly exceeded the dynamic range of the static
tuning function itself. This cell responded relatively weakly to
steady-state IPDs but quite strongly to changes of IPD throughout and
slightly beyond the ecological range of IPD. For both cells, the sweeps
between 135 and 135° have been removed for clarity.
|
|
The foregoing observation suggests that there are central adaptive
mechanisms that regulate the gain of cortical responses on a relatively
short (tens of milliseconds) timescale. The contribution of such
mechanisms may explain the fact that shifts in IPD tuning were
significantly larger (Wilcoxon, p < 0.0001) during the
sweeps than the steady states. This increase in shift magnitude cannot be explained by larger firing rate differences between the
steady-states preceding the sweeps (which differ by 90°) than between
the steady-state IPD partners (which differ by 180°), because the
distributions of those differences are statistically indistinguishable
(Wilcoxon, p = 0.41). Shift magnitudes for sweeps are
not correlated with shift magnitudes for steady states (
= 0.07; p = 0.62). Shift magnitudes for the sweeps also
appear to depend more strongly on the adaptive mechanisms that
determine the decay in firing rate by epoch during steady-state
intervals. The ratio of the fourth/first epoch firing rates was
inversely correlated with the tuning shift magnitudes for the sweeps
(
=
0.38; p = 0.0048), such that cells
showing the largest decay in firing rate for a constant stimulus
exhibited the largest shifts when the stimulus was changing. This same
ratio was unrelated to steady-state shift magnitudes (
=
0.06; p = 0.68). These findings further support the
contention that multiple adaptive mechanisms, operating independently and at different timescales, subserve sensitivity to stimuli in the
recent past, at varying degrees of recency.
 |
DISCUSSION |
Context-induced changes in the cortical representation of IPD
signals are ubiquitous and substantial. All neurons showed clear evidence of conditioning, and as context varied, the range of responses
to identical values or equivalent ranges of IPD could surpass the
dynamic range of the static IPD tuning function itself. The average
shift in cortical IPD tuning was more than twice as large as the
typical shift obtained in the IC of anesthetized gerbils with identical
stimuli (Miko et al., 1999
; Malone and Semple, 2000
). What proportion
of the increased prevalence and magnitude of cortical conditioning
reflects the change in structure (cortex vs IC), species (macaque vs
gerbil), or state (awake vs anesthetized)? Does the character of
cortical conditioning reflect patterns of connectivity unique to
cortex, biophysical mechanisms peculiar to cortical neurons, or simply
the ordinal position of cortex in the auditory pathway? Although the
elaboration of context sensitivity likely continues beyond the auditory
midbrain, more important than the locus of this elaboration is the
implication of our basic finding: in the cortex of awake primates, an
invariant mapping of IPD to discharge rate is sacrificed to a coding
strategy that prioritizes stimulus changes over actual stimulus values.
The enhanced representation of stimulus change is at least
partially attributable to a sensitivity to recent stimulus history. McAlpine et al. (2000)
demonstrated that apparent motion sensitivity in
the IC is consistent with cells there being sensitive to their own
discharge history. We also found that discharge history was predictive
of the responses to oppositely directed sweeps and equivalent
steady-state IPDs occurring in different contexts. Analysis of
conditioning elicited by monaural frequency sweeps (Malone and Semple,
2001
), however, demonstrated that for many IC neurons, significant
differences in the responses to a common target stimulus occurred even
when the different origin stimuli preceding the target elicited
statistically equivalent responses. Sensitivity to discharge history
among the afferents of the recorded neuron could support
stimulus-specific conditioning. The demonstration that visual contrast
adaptation can be dissociated from the firing rate of the recorded cell
further suggests that adaptation can be controlled by "information
beyond the scope of the cell or its immediate signal pathway" (Bonds,
1991
).
In this and previous studies of conditioning effects based on simulated
auditory motion (Spitzer and Semple, 1991
, 1993
, 1995
, 1998
; Takahashi
and Keller, 1992
; Sanes et al., 1998
; Wilson and O'Neill, 1998![]()