Abstract
A major challenge for sensory processing in the brain is considering stimulus context, such as stimulus probability, which may be relevant for survival. Excitatory neurons in auditory cortex, for example, adapt to repetitive tones in a stimulus-specific manner without fully generalizing to a low-probability deviant tone (“oddball”) that breaks the preceding regularity. Whether such stimulus-specific adaptation (SSA) also prevails in inhibitory neurons and how it might relate to deviance detection remains elusive. We obtained whole-cell recordings from excitatory neurons and somatostatin- and parvalbumin-positive GABAergic interneurons in layer 2/3 of mouse auditory cortex and measured tone-evoked membrane potential responses. All cell types displayed SSA of fast (“early”) subthreshold and suprathreshold responses with oddball tones of a deviant frequency eliciting enlarged responses compared with adapted standards. SSA was especially strong when oddball frequency matched neuronal preference. In addition, we identified a slower “late” response component (200–400 ms after tone onset), most clearly in excitatory and parvalbumin-positive neurons, which also displayed SSA. For excitatory neurons, this late component reflected genuine deviance detection. Moreover, intracellular blockade of NMDA receptors reduced early and late responses in excitatory but not parvalbumin-positive neurons. The late component in excitatory neurons thus shares time course, deviance detection, and pharmacological features with the deviant-evoked event-related potential known as mismatch negativity (MMN) and provides a potential link between neuronal SSA and MMN. In summary, our results suggest a two-phase cortical activation upon oddball stimulation, with oddball tones first reactivating the adapted auditory cortex circuitry and subsequently triggering delayed reverberating network activity.
SIGNIFICANCE STATEMENT Understanding how the brain encodes sensory context in addition to stimulus feature has been a main focus in neuroscience. Using in vivo targeted whole-cell recordings from excitatory and inhibitory neurons of mouse primary auditory cortex, we report two temporally distinct components of membrane potential responses encoding oddball tones that break stimulus regularity. Both components display stimulus-specific adaptation upon oddball paradigm stimulation in the three recorded cell types. The late response component, in particular, carries signatures of genuine deviance detection. In excitatory but not parvalbumin-positive inhibitory neurons, both early and late components depend on NMDA receptor-signaling. Our work proposes a potential neuronal substrate of a known deviant-evoked event-related potential, which is of fundamental significance in basic and clinical neuroscience.
Introduction
Information processing in the mammalian auditory system is hierarchically organized, with physical sound features extracted near the periphery and contextual features such as stimulus probability being increasingly integrated along the ascending auditory pathway. For example, in primary auditory cortex (A1), stimulus-specific adaptation (SSA) has been reported as the decrease in neuronal response to repeating “standard” tones, which does however not fully generalize to a rare and irregular “oddball” tone (Nelken, 2014; Pérez-González and Malmierca, 2014). In extracellular recording experiments, SSA occurring tens of milliseconds after tone onset has been identified in A1 of various mammalian species (Ulanovsky et al., 2003; von der Behrens et al., 2009; Taaseh et al., 2011; Fishman and Steinschneider, 2012), as well as subcortically in inferior colliculus (Malmierca et al., 2009; Duque et al., 2012) and auditory thalamus (Anderson et al., 2009). Similarly, a prominent electroencephalographic (EEG) signal known as “mismatch negativity” (MMN), typically peaking ∼100–250 ms after stimulus onset, is evoked by deviant tones in the human and animal brain. Functionally, MMN reflects a break in a preceding regularity, i.e., a change in a repetitive stimulus sequence, and thus represents genuine deviance detection (Näätänen et al., 2007; Grimm et al., 2011; Fishman and Steinschneider, 2012). Although SSA has been speculated to be a neuronal substrate of MMN (Ulanovsky et al., 2003; von der Behrens, 2009), their precise relationship remains elusive (Garrido et al., 2009; May and Tiitinen, 2010), largely because the cortical circuit mechanisms underlying SSA and MMN are incompletely understood and their different time courses have not yet been reconciled. Understanding neuronal circuit dynamics underlying brain responses to specific stimulus contexts, such as a rare, regularity-breaking deviant stimulus, is of great importance, because macroscopic signals, such as MMN, are altered in a number of neuropsychiatric diseases, for example schizophrenia (Shelley et al., 1991; Todd et al., 2012).
To better understand circuit mechanisms underlying SSA and MMN, intracellular recordings of membrane potential dynamics from identified neuronal subtypes are desirable. Previous whole-cell recordings from rat A1 excitatory neurons revealed tuning properties of excitatory and inhibitory synaptic conductances and their temporal sequence following auditory stimulation (Wehr and Zador, 2003; Zhang et al., 2003; Tan and Wehr, 2009). Only recently intracellular sharp recordings confirmed SSA at subthreshold level using a frequency-oddball paradigm (Hershenhoren et al., 2014). Critically, auditory responsiveness and adaptation properties of diverse classes of inhibitory interneurons in A1 neuronal circuitry have not been determined. Because not all synapses in A1 are depressing but some GABAergic interneurons even receive facilitating inputs (Levy and Reyes, 2012), neuronal inhibition may variably contribute to cortical SSA. Two major subtypes of genetically defined GABAergic cells, the somatostatin (SST)-expressing and parvalbumin (PV)-expressing interneurons, could be differentially involved in SSA as they inhibit distal dendrites and perisomatic regions, respectively (Adesnik et al., 2012; Wilson et al., 2012). Other A1 interneurons may enhance excitation during SSA by disinhibiting other interneurons (Letzkus et al., 2011; Pi et al., 2013). Moreover, GABAerigc interneurons along the auditory pathway may provide gain control to modulate SSA magnitude (Pérez-González et al., 2012; Duque et al., 2014). Although a recent study applied intracellular recordings from SST and PV interneurons to compare tonal receptive fields (Li et al., 2014), oddball responses and SSA in the diverse interneuron classes so far have remained unexplored.
Here we used two-photon-guided whole-cell recordings (Margrie et al., 2003; Kitamura et al., 2008) in layer 2/3 (L2/3) of mouse A1 to characterize membrane potential dynamics upon oddball paradigm stimulation in excitatory pyramidal neurons as well as GABAergic SST and PV interneurons. Our data show that SSA is present in the early responses of all three cell types. Further, we discovered a secondary late response component, especially in pyramidal neurons and PV interneurons, which is triggered by deviant tones and shares temporal and pharmacological characteristics with MMN. Our results offer a refined view of context-dependent stimulus processing in the heterogeneous cortical microcircuit, indicating two temporally distinct phases of oddball processing: an initial early phase of reactivation of cortical circuit elements, including inhibitory interneurons, presumably through recruitment of nonadapted pathways; and a subsequent secondary, longer-lasting phase, likely reflecting broader reverberating network activity or corticocortical feedback.
Materials and Methods
Animals and surgical preparation.
Experimental procedures conformed to the guidelines of the Swiss Federal Veterinary Office and were approved by the Cantonal Veterinary Office in Zurich. We used different transgenic mouse lines to identify specific neuronal cell types, especially GABAergic interneurons expressing SST and PV, respectively, which form largely nonoverlapping populations and together comprise ∼66% of cortical interneurons (Rudy et al., 2011). SST-IRES-Cre mice (Jackson Laboratories, no. 013044); Hippenmeyer et al., 2005) or PV-IRES-Cre mice (Jackson Laboratories, no. 008069; Taniguchi et al., 2011) were crossed with a tdTomato reporter line (Jackson Laboratories, no. 007914; Madisen et al., 2010), resulting in offspring mice expressing red fluorescent protein tdTomato in SST and PV neurons, respectively. A subset of recordings from pyramidal neurons was obtained from wild-type (WT) mice or another transgenic mouse line with GFP-expressing GABAergic neurons (GAD67; Tamamaki et al., 2003). In total, 44 mice were used (6 GAD67, 1 WT, 22 SST×tdTomato and 15 PV×tdTomato mice), giving rise to 27 measurements for pyramidal neurons (9 cells from GAD67, 2 from WT, 9 from SST lines, and 7 from PV lines), 27 for SST interneurons, and 15 for PV interneurons for tuning experiments. Because the recording quality varied over time, only a subset of the recordings was chosen for different analyses (n = 21, 20, and 11 for Exc, SST, and PV neurons for analyzing membrane potential dynamics; n = 20, 20, and 12 for Exc, SST, and PV cells for SSA analysis). For MK-801 experiments, an additional dataset of 11 pyramidal neurons and 9 PV interneurons was collected from 14 PV×tdTomato mice.
Young adult male or female mice (age 5–10 weeks) were anesthetized with 1.5% isoflurane during surgery (4% during induction). Left primary A1 was identified by stereotactic coordinates (2.5 mm posterior of bregma, 4.5 mm lateral of midline), as well as intrinsic optical imaging through the thinned skull. We illuminated the cortical surface with 630 nm LED light and collected the tissue reflectance signal with a CCD camera (Toshiba TELI-CS3960DCL) through a 4× objective (Olympus). A1 was localized by presenting pure tones (8, 10, or 12 kHz) at 60–80 dB sound pressure level (SPL) and locating the area of largest reflectance decrease. A small craniotomy (∼1 ×1 mm) was performed above the identified A1 and the dura mater removed. Agarose (<2% in Ringer solution) and a cover glass were applied on top of the craniotomy to dampen tissue movement. Animals were kept under light isoflurane-anesthesia (0.5–1%) during the experiment and breathing rate was frequently monitored (≥1 breaths/s).
Two-photon guided in vivo whole-cell recordings.
Neurons in supragranular cortical layers (<300 μm, layer 1 and layer 2/3) were targeted with patch pipettes under a custom-built two-photon microscope system installed in a sound-proofed compartment, equipped with a Ti:sapphire laser (Mai Tai DeepSee or Tsunami; Spectra-Physics), a Pockels cell (Conoptics), galvanometric scan mirrors (Cambridge Technology), and a 16× water-immersion objective (NA 0.8; Nikon). Cells were visualized using 930 nm excitation and by collecting fluorescence through red (610/75 nm) and green (535/50 nm) emission filters (AHF Analysentechnik). Image acquisition was controlled with HelioScan, a LabVIEW-based custom microscope software framework (Langer et al., 2013).
Electrophysiological recordings were obtained with glass pipettes (4–7 MΩ resistance) filled with control internal solution (without MK-801) containing the following (in mm): 135 potassium gluconate, 10 HEPES, 10 sodium phosphocreatine, 4 KCl, 4 Mg-ATP, 0.3 Na3GTP, and 20 AlexaFluor 488 (Invitrogen) for pipette visualization. Extracellular solution contained the following (in mm): 145 NaCl, 5.4 KCl, 10 HEPES, 1 MgCl2, and 1.8 CaCl2. Membrane potential recordings were corrected for liquid junction potential (−10.3 ± 0.6 mV, n = 13 measurements). Voltage recordings were performed with an Axoclamp 2B amplifier (Molecular Devices), preamplified and digitized at 20 kHz with an ITC-18 board (InstruTECH) controlled by custom-written IGOR Pro software (WaveMetrics). Small positive pressure (20–30 mbar) was applied while navigating the pipette in the tissue with a micromanipulator (Luigs and Neumann). Two-photon-guided whole-cell recordings from fluorescent SST or PV neurons and from putative pyramidal neurons, identified as unlabeled “shadow” cells, were obtained as described previously (Margrie et al., 2003; Kitamura et al., 2008). After establishing whole-cell configuration, current pulses of 300–400 ms were injected to measure basic electrophysiological parameters (Table 1). No holding current was applied in current-clamp mode. Access resistance was estimated by applying brief negative current test pulses (<1 nA, 20 ms pulse-width) and compensated on-line. Cells were discarded if initial access resistance was >70 MΩ. For measuring tuning receptive fields and performing oddball experiments (see below), total recording durations of 1–1.5 h were required, during which neurons were held in whole-cell configuration and recording quality was constantly monitored. To confirm cell identity and cell morphology, fluorescence image stacks were acquired at the end of the experiment.
For MK-801 experiments, we added 267 μm MK-801 (Tocris Bioscience) to the control internal solution. To minimize MK-801 contamination during pipette navigation, the pipette was back-filled with 2 μl control internal solution, and then with 7 μl solution containing MK-801. The estimated final pipette concentration of MK-801 was 207 μm. MK-801 is an irreversible, use-dependent NMDA-receptor antagonist. To ensure sufficient blockage of NMDA receptors, oddball responses were measured at least 15 min after pipette break-in.
Auditory stimulation.
Auditory stimuli (pure tones with 5 ms rise and fall time) were generated with a Tucker-Davis System 3 processor (RZ6) and presented in free-field configuration with an electrostatic loudspeaker (ES1, Tucker-Davis). A grounded small faraday cage built around the loudspeaker prevented electromagnetic interference with electrophysiological recordings. The loudspeaker was placed ∼5 cm from the contralateral (right) ear of the animal. To reduce ambient acoustic noise, the recording chamber and the laser system were shielded with sound-proofing foam. The transfer function of the sound system was measured with a condenser microphone (PCB Piezotronics) and calibrated using SigCalRP (Tucker-Davis). The sound system was calibrated for frequency ranges of 4–50 kHz. After calibration, there was an intensity deviation of ±11 dB across the frequency range for a desired sound pressure level. At zero attenuation level, the sound intensity was 90 dB SPL. Intensities of 70–90 dB were used for tuning curve measurements. Sound intensity was 80 dB during the oddball paradigm. The ambient noise of the experimental setup was ∼60 dB (mainly caused by the laser cooling system) predominated by low frequencies (<5 kHz), for which mice have high hearing thresholds.
Oddball experiment design.
Determining frequency tuning is a prerequisite for selecting two appropriate frequencies in the oddball paradigm. For each recorded neuron, frequency selectivity was determined by measuring neuronal responses to pseudorandom blocks of pure tones ranging from 4.00 to 45.26 kHz [0.25 octave apart, 15 frequencies, 100 ms duration, 800 ms interstimulus interval (ISI) onset-to-onset], alternately presented at sound intensities of 0, 10, and 20 dB attenuation levels (corresponding to 90, 80, and 70 dB). Each block contained 60 tones (4 repetitions per frequency). For each sound intensity, two to four blocks of pure tones were presented (8–16 repetitions per frequency). The best frequency (BF) of the recorded neurons was determined as the frequency that evoked the largest response. Due to recording time constraints we determined a coarse tuning curve for frequency-selection for the oddball paradigm in the subsequent experiment, with only a limited set of tone intensities (3) and tone frequencies (15) presented. We therefore do not report detailed tuning receptive fields in the three A1 cell types here. We note that the pseudorandom sequence of pure tones at 80 dB was also used as the “many-standards” paradigm to test the presence of deviance detection.
After determining the neuron's BF, the frequency-oddball experiment was designed by selecting two frequencies (f1 and f2) within the TRF and separated by Δf = 0.5 octaves (Fig. 1A). Both f1 and f2 were pseudorandomly assigned as either oddball or standard tone in alternate blocks. For the main oddball experiment, the probability of the oddball tone (POdd) was 0.1 (100 oddball trials and 900 standard trials in 10 stimulation blocks), with additional constraint that at least six standard tones occurred between oddballs. As control conditions, we further included blocks with POdd = 0.3 (120 oddball trials and 240 standard trials in 4 stimulation blocks) and POdd = 0.5 (100 oddball trials and 100 standard trials in 2 stimulation blocks). All tones were presented with 100 ms duration and 500 ms ISI.
Data analysis.
All data analysis was performed with MATLAB (MathWorks). The average membrane potential of each neuron was estimated as the mean membrane potential during the first 100 ms of current-injection protocol (before injection of a 300–400 ms step-current pulse). The resting membrane potential was measured by manually selecting traces at DOWN-states without action potentials (APs) during the same time period. AP threshold was inferred as the voltage at the time corresponding to local maximum of the third derivative from the membrane potential (down-sampled to 10 kHz; Gentet et al., 2010). AP amplitude was measured from threshold-to-peak and AP half-width was estimated as the full width at half-maximal amplitude. The I–V curve was fitted with a quadratic function and input resistance was calculated as the slope at zero current. Baseline firing rate (FR) was estimated during the initial frequency-tuning experiment within the periods before and after sound stimulation (no baseline firing assumed for cells exhibiting spiking rates <0.1 Hz). Subthreshold membrane potential (Vm) changes were estimated by removing APs from the original voltage recordings using median filtering (Hershenhoren et al., 2014; Li et al., 2014). Vm response amplitude was calculated as the peak amplitude relative to a 10 ms baseline period before tone onset for the early time window (0–100 ms after tone onset) and as mean amplitude relative to baseline for the late time window (200–400 ms after tone onset). Because SST cells, in particular, display tone-evoked early hyperpolarization (Fig. 1), the peak response amplitude provides a better estimate of the immediate excitatory drive. FR changes were calculated in 20 ms bins. Tone-evoked onset latency was estimated as the earliest time point when the average response reached mean + 1 SD prestimulus baseline 10 ms before tone onset (Li et al., 2014).
We quantified the differences in responses to oddballs and pre-oddball standards using a d-prime index for SSA defined as follows: where μ and σ denote the mean and SD of the response amplitudes to oddballs (Odd) and pre-oddball standards (PreOdd), applied to either early or late time window. A positive value for d′SSA indicates stronger responses for the oddball tone. In previous SSA studies, common selectivity indices (CSI) based on the differences between oddball and standard responses and normalized by the overall activity level for both f1 and f2 have been used for quantifying the SSA magnitude (Ulanovsky et al., 2003; von der Behrens et al., 2009; Taaseh et al., 2011; Hershenhoren et al., 2014). CSI was computed as follows: where μOdd,f1, μOdd,f2 andμSta,f1, μSta,f2 represent the mean Vm amplitude or mean FR change evoked by oddball and pre-oddball standard tones for f1 or f2 frequency. Compared with d′SSA, CSI is susceptible to low activity levels such as the sparse firing in L2/3 excitatory cells, as well as the small amplitudes of the late response components, thus giving rise to more boundary values (0, 1) and larger within-group variation. Given the wide spectrum of response levels in A1 cell, d′SSA is therefore more suitable for the present study to quantify SSA.
To investigate the frequency preference effect on SSA, preferred and nonpreferred frequencies were defined by calculating the d′Pref index derived from the peak Vm amplitudes evoked by the first standard tones as follows: where μ and σ are the mean and SD for the first tone-evoked responses to respective frequencies, f1 and f2. For d′Pref > 0, f1 is the preferred frequency, and for d′Pref < 0, f2 is the preferred frequency.
Classification analysis was performed with a probabilistic Bayesian Classifier, as implemented in MATLAB Statistics toolbox (NaïveBayes class). Membrane potential recordings were down-sampled to 200 Hz and a 50 ms sliding window was used (10 data points per classification step). Cross-validation was performed by leaving out 20% of the trials per iteration and testing classifier performance on the left-out data. Average classifier performance was determined as mean over 120 iterations.
Statistical comparisons involving different neurons (e.g., between cell types or control vs MK-801) were performed as unpaired t tests and comparisons between conditions (e.g., oddball vs standard) as paired t tests. In all statistical procedures involving multiple comparisons, comparison results were accordingly adjusted using Bonferroni correction.
Results
Whole-cell recordings of tone-evoked responses in different neuronal cell types in A1
We obtained two-photon-targeted whole-cell recordings from L2/3 SST and PV interneurons in identified A1 of isoflurane-anesthetized SST×tdTomato and PV×tdTomato mice, respectively (Fig. 1A–C). In addition, we recorded from unlabeled excitatory L2/3 pyramidal neurons (Exc) using the “shadow-patch” technique (Kitamura et al., 2008). The three cell types differed in basic electrophysiological properties such as resting membrane potential, input resistance, AP shape, and spontaneous firing rates (Fig. 1; Table 1), consistent with previous reports on L2/3 neurons (Gentet et al., 2012; Li et al., 2014). We stimulated L2/3 neurons with 100 ms pure tones of different sound frequencies (range 4–40 kHz), evoking membrane potential (Vm) changes and incidental APs on top of ongoing spontaneous Vm fluctuations (Fig. 1D). For each neuron we measured a tuning curve and determined the BF at 80 dB, i.e., the sound frequency eliciting the largest response. In all three neuron types we observed clear BF-evoked subthreshold Vm changes (Fig. 1E; APs removed by median filtering). Sound-evoked Vm responses were significantly larger in Exc and PV neurons compared with SST neurons and induced increases in FR, especially in the inhibitory interneurons but less frequently in pyramidal neurons (Fig. 1E,F). BF evoked responses with short onset latency for Vm and FR changes (30.8 ± 3.4, 34.6 ± 4.5, 22.4 ± 3.5 ms for Vm, and 31.9 ± 5.1, 24.1 ± 1.9, 27.1 ± 9.0 ms for FR changes with 0.5 ms bin in Exc, SST, and PV cell, mean ± SEM, n = 22, 17, and 11, respectively; p > 0.1 for Vm and FR, one-way ANOVA). We did not observe slower onsets of SST cells compared with Exc and PV neurons as has recently been reported (Li et al., 2014), probably because of the fewer trials in the current study that leads to larger baseline variation. For each neuron we selected two appropriate frequencies below and above the BF to measure neuronal activity upon oddball paradigm stimulation (Fig. 1A).
Response adaptation during repeated tone sequences
We first analyzed how neuronal responses adapt to repeated standard tone presentation by averaging across the initial block of trials, during which standard tones were repeated for at least six times. The first stimulus evoked a fast and brief Vm depolarization, which caused transient increases in FR in all three cell types (Fig. 2A,B; Vm peak latency from tone onset: Exc, 53.4 ± 30.0 ms, n = 20; SST, 42.1 ± 23.9 ms, n = 20; PV, 57.1 ± 33.2 ms, n = 12; mean ± SD). In the majority of recordings the first tone also evoked a secondary longer-lasting Vm depolarization, most prominently in Exc and PV neurons, that initiated after tone offset and could last for several-hundred milliseconds (onset latency of inflection point from tone onset: Exc, 230 ± 57 ms; SST, 234 ± 34 ms; PV, 208 ± 17 ms; mean ± SD; proportion of cells with clear late component used in this analysis: 17/20, 8/20, 8/12 for Exc, SST, and PV neurons). The late depolarization caused FR increases particularly in PV neurons but not in Exc neurons, which remained mostly silent after the initial response. Similar stimulus-evoked, UP-state-like delayed response components in A1 have been reported in other studies (Metherate and Cruikshank, 1999; Grienberger et al., 2012).
Responses to the second tone and subsequent tones displayed pronounced adaptation of both early and late component. In the early time window, both Vm and FR response amplitudes were significantly reduced in all cell types already for the second tone, with the exception of FR in PV neurons, which showed less adaptation (Fig. 2C,D). For the sixth tone repetition, early Vm response amplitudes reached a steady-state level of 30–50% of the first-tone response (44 ± 59%, 45 ± 46%, 30 ± 40% for Exc, SST, and PV neurons; mean ± SD). The late Vm response component also significantly adapted in Exc and PV neurons, but less so in SST neurons (Fig. 2C). FR changes during the late component adapted significantly in SST and PV interneurons, but not in pyramidal neurons, which seldom fired during the late phase (Fig. 2D). Notably, the adapted FR activity in PV cells upon repeated tone stimulation is higher than those in Exc and SST cells. These results show that the early tone-evoked responses rapidly adapt to repetitive tone presentation in all three cell types. In addition, the late component observed mainly in Exc and PV neurons also exhibits adaptation.
Recovery of early and late responses from adaptation upon oddball tone stimulation
We next examined how distinct A1 neuronal subtypes respond to a rare and irregular deviant tone using an oddball paradigm. We presented a series of repeating standard tones, interleaved with infrequent oddball tones of a different frequency. The two frequencies were chosen flanking each neuron's BF and oddballs were delivered with a probability of POdd = 0.1, switching frequencies for standard and deviant across trial blocks (Fig. 1A). For all cell types, oddballs evoked enlarged early Vm responses compared with the adapted responses to pre-oddball standards (Fig. 3A,B). Oddballs frequently elicited APs in SST and PV neurons but seldom in pyramidal cells (Fig. 3C,D). Furthermore, oddball tones also recovered the late component of the Vm responses, most obvious in Exc and PV neurons but less clear in SST neurons. The late oddball-evoked Vm response remained subthreshold in Exc neurons and evoked extra APs only in PV cells.
We quantified the oddball effect by comparing the amplitude of early and late Vm responses for pre-oddball and oddball stimuli as well as for the first standard stimulus in trial blocks. Early Vm peak amplitudes were significantly enhanced for oddball compared with pre-oddball stimuli for all cell types (Fig. 3E; by a factor of 3.7, 3.1, and 3.7 for Exc, SST, and PV neurons, respectively; p < 0.01; n = 20, 20, 12). For the late component a significant fourfold to fivefold enhancement of the mean Vm amplitude was observed for Exc and PV neurons (p < 0.05, paired t test). Despite this enhancement, oddball Vm responses were, however, generally ∼20–35% smaller than the responses to the first stimulus. Our finding is consistent with a synaptic mechanism underlying SSA, with deviant tones recovering neuronal activity by recruiting distinct, largely nonadapted afferent pathways together with afferents partially overlapping with those adapted by the standard tone.
We also analyzed the selectivity of responses for oddballs versus pre-oddball standards for either early or late time window using a d′ index for SSA (d′SSA). This analysis confirmed that oddball tones (POdd = 0.1) induced significantly enlarged Vm depolarization in both early and late phase for all neuronal types (Fig. 3F; p < 0.01, one-sample right-tailed t test against d′SSA = 0). The same analysis for control experiments with higher POdd values of 0.3 and 0.5 revealed that SSA magnitude depends on the rarity of the oddball stimulus and that it is absent for equal probability of oddball and standards. For SST neurons d′SSA was not significantly different from zero at POdd = 0.3 and SSA generally was less pronounced compared with Exc and PV neurons. Such dependence of SSA magnitudes on oddball probability is as well reflected in the difference traces based on grand average Vm (oddball − pre-oddball standard), whose peak derivatives indicate the onsets of early and late responses (Fig. 4). By taking the first derivative of the difference traces, onsets of early and late Vm responses can be estimated as the latency of the first and the second local maximum respectively (early onsets, POdd = 0.1: 23, 22, and 24 ms for Exc, SST, and PV cells: POdd = 0.3: 26, 26, and 23 ms for the respective three cell types; late onsets, POdd = 0.1: 212 and 237 ms for Exc and PV cells). In addition, we quantified the oddball effect with an alternate definition of SSA index, CSI (Hershenhoren et al., 2014). For the subthreshold Vm responses, the comparison results derived from CSI were similar to those from d′ index (CSI of early Vm 0.56 ± 0.06, 0.49 ± 0.08, 0.58 ± 0.07 in Exc, SST, and PV cells and late Vm 0.89 ± 0.04, 0.19 ± 0.16, 0.58 ± 0.18, mean ± SEM, n = 20, 20, and 12, respectively). We nevertheless found that CSI is susceptible to low activity level such as the sparse spiking activity for L2/3 pyramidal neurons in A1 and the relatively small evoked responses within the late time window (CSI for early FR 0.39 ± 0.11, 0.58 ± 0.10, 0.47 ± 0.14 and late Vm −0.10 ± 0.10, 0.24 ± 0.17, 0.20 ± 0.22 for Exc, SST, and PV cells, mean ± SEM). Small changes under low baseline activity levels would lead to artificially large SI values. In the present study, we therefore preferred to use d′ instead of SI because the d′ index provides a robust measure of selectivity over a wide range of activity levels in A1 neurons.
The analogous quantitative analysis for oddball-evoked FR changes revealed that the recovered early depolarization consistently increased peak FR in the early time window (Fig. 3G; significant increase for oddball vs pre-oddball response for Exc and SST neurons, p < 0.05, paired t test; approaching significance for PV neurons that showed higher variability in spiking, p < 0.1). Consistent with the subthreshold responses, the FR changes recovered by the oddball remained smaller than the FR activity evoked by the first standard tones. For the late component, only PV neurons displayed enhanced FR levels upon oddball stimulation, which did not however reach statistical significance (Fig. 3G). For Exc neurons, no extra APs were elicited during the late time window. Hence, the late component was visible only in the subthreshold regime, presumably reflecting enhanced synaptic input activity upon oddball stimulation during this period. In all three cell types, d′SSA values calculated for the early FR changes revealed significant SSA for POdd = 0.1 and 0.3, but not for POdd = 0.5. For the late response window, d′SSA values were just significantly larger from zero only for the GABAergic interneurons at low oddball probability (Fig. 3H). We conclude that SSA is present in mouse auditory cortex not only in excitatory neurons but also in two major classes of cortical GABAergic interneurons (SST and PV neurons). In addition to the early tone-evoked responses exhibiting SSA, our whole-cell recording method reveals that a rare and irregular oddball tone recovers the subthreshold late components as well.
Dependence of oddball-evoked responses on frequency preference
As A1 neurons typically responded asymmetrically to the two frequencies used in the oddball paradigm (Fig. 1A), we next explored how SSA is influenced by tuning preferences for the presented tones. We defined “preferred” and “nonpreferred” frequencies according to which tone frequency resulted in a larger mean Vm response to the first standard tone. Both preferred and nonpreferred oddball tones displayed significant SSA (i.e., d′SSA > 0) for early Vm and FR responses (Fig. 5B,D; p < 0.05 for all cell types except for early FR evoked by nonpreferred oddballs in Exc, right-sided paired t test against zero). Nevertheless, preferred oddball tones evoked larger early Vm responses and displayed more prominent SSA (Fig. 5A,B; p < 0.01 for Exc and SST, p > 0.1 for PV, paired t test of d′SSA between preferred and nonpreferred oddballs). The frequency preference was also reflected in larger early FR changes evoked by preferred oddball tones (Fig. 5C,D; p < 0.05 for Exc and SST, p > 0.1 for PV, paired t test). The oddball-evoked late Vm responses, on the other hand, exhibited significant SSA for Exc and PV cells (Fig. 5B; p < 0.05 for Exc and PV, right-sided paired t test of d′SSA against zero) but without frequency preference (p > 0.05 for all cell types, paired t test of d′SSA between preferred and nonpreferred oddballs). For late spiking activity, significant SSA was found in PV and SST only for preferred oddballs (Fig. 5D). These results indicate that SSA is not homogeneous across the receptive field (Duque et al., 2012), and that gain mechanisms, such as activation of different amounts of inputs or recruitment of local recurrent subnetworks, exert a modulatory effect on SSA.
We next investigated how stimulus context (low-probability tone violating the preceding regularity) affects the frequency discrimination power of auditory neurons. We trained a probabilistic Bayesian classifier to distinguish between the two frequencies (f1 and f2) based on Vm responses to pre-oddball standard and oddball tones. The classifier discriminated frequencies above chance (50%) under both tone conditions (Fig. 5E). Nevertheless, classification accuracy during the early component was significantly better for oddball tones in all cell types (Fig. 5E; oddball vs pre-oddball: p < 0.01 for all cell types, two-way repeated-measures ANOVA). During the late component, in contrast, frequency discrimination differed between oddball and standard tones in a cell-type-specific manner, with PV and Exc showing enhanced discrimination (Fig. 5E; oddball vs pre-oddball: p < 0.01, two-way repeated-measures ANOVA), whereas no difference was observed for SST interneurons (p > 0.1, two-way repeated-measures ANOVA). In summary, our results demonstrate that responses to low-probability tones are modulated by sound frequency and, conversely, that frequency discrimination is influenced by tone probability, most interestingly in a cell-type-specific manner within the late time window.
The late oddball-evoked response may reflect genuine deviance detection
The classic frequency oddball paradigm (Fig. 1A) does not, on its own, allow distinction between SSA and genuine deviance detection (Nelken and Ulanovsky, 2007; Taaseh et al., 2011; Fishman and Steinschneider, 2012). The latter is considered a higher-level process involving detection of disruptions in an auditory sequence (Näätänen et al., 2005). To address the distinction between SSA and deviance detection, we compared oddball tone responses with the activity evoked by tones of the same frequency in the random tone sequence, which was used to determine the BF for each neuron (Fig. 6A). In this many-standards (MS) condition (Fishman and Steinschneider, 2012), no regularity is established because subsequent tones vary in an unpredictable fashion. If the increased responses to deviant tones in the oddball paradigm would be comparable with responses in the MS condition, this would indicate that the large deviant responses during oddball stimulation can be explained solely by SSA. On the other hand, if the oddball tones evoke larger responses than the same tones under MS stimulation, this would constitute evidence for true deviance detection (Fishman and Steinschneider, 2012). We found that, for all three cell types, initial tone-evoked subthreshold and spiking responses (early responses) were comparable for oddball tones and corresponding tones in the MS condition (Fig. 6B). Both peak Vm and FR did not differ significantly during the early phase after tone onset (Fig. 6C,D; oddball vs MS: all p > 0.05, paired t tests). Interestingly, however, in pyramidal neurons the late Vm component was significantly more depolarized in response to the oddball tones compared with the MS condition (Fig. 6C; oddball vs MS: p < 0.05, paired t test). These results suggest that the late depolarization evoked by oddball tones in L2/3 pyramidal neurons may reflect deviance detection in primary auditory cortex.
Auditory responses depend on NMDA receptor signaling in Exc but not PV neurons
A key property of MMN is its dependence on NMDA-type glutamate receptor (NMDAR) signaling (Javitt et al., 1996; Umbricht et al., 2000), suggesting that nonlinear dendritic integration processes may play a role in deviance detection. To test whether NMDAR activation is involved in the responses evoked by oddball tones, we obtained additional whole-cell recordings in Exc (n = 11) and PV neurons (n = 9) and blocked NMDARs by intracellular infusion of the noncompetitive antagonist MK-801 (Lavzin et al., 2012). Unlike systemic application of NMDAR blockers that affects NMDAR signaling in the entire neuronal network (Tikhonravov et al., 2008; Farley et al., 2010), our intracellular approach blocks the NMDAR-mediated dendritic mechanisms only in the recorded cell. Intracellular MK-801 application clearly reduced evoked Vm responses in pyramidal neurons but not in PV interneurons (Fig. 7A). Spiking responses of pyramidal neurons to oddball tones were largely reduced, whereas FR changes were not affected in PV neurons (Fig. 7B). Oddball tones still evoked significantly larger Vm responses than standard tones during both early and late components (Fig. 7C; for Exc and PV, p < 0.01 for early and p < 0.05 for late, paired t tests between tone types). However, compared with control condition, the overall decreased responses to oddballs, as well as to pre-oddball standards led to smaller SSA indices after MK-801 infusion, especially for the late time window (Fig. 7D; control vs MK-801: p > 0.1 for Exc early and p < 0.01 for late, two-sample t tests between control and MK-801 conditions). By comparing oddball-evoked activity with MS control, such reduction in oddball-evoked late Vm in Exc cell after MK-801 blockade further results in absent deviance detection (Odd vs MS, 0.83 ± 0.25 vs 0.77 ± 0.29 mV for late Vm, p > 0.05, two-sample t test between Odd and MS). Comparisons between control and MK-801 conditions for Vm responses of PV neurons were not significantly different (Fig. 7D; p > 0.1 for PV early and late, two-sample t test). Similar results were obtained for FR changes after MK-801 application (Fig. 7E). Notably, the selective early FR increase in Exc neurons in response to oddball tones was strongly suppressed by MK-801 (Fig. 7F; control vs MK-801: p < 0.05 for Exc early and p > 0.1 for late, two-sample t test between control and MK-801). In contrast, FR responses of PV neurons were not influenced by MK-801 (Fig. 7F; p > 0.1 for PV early and late, two-sample t test). These findings suggest a possible link between macroscopic brain activity evoked by deviant stimuli and the corresponding membrane potential dynamics in defined neuronal cell types. They further suggest that frequency- and context-specific activation of pyramidal neurons in A1 may be achieved, at least partly, by NMDAR-mediated, active dendritic integration mechanisms (Gambino et al., 2014; Palmer et al., 2014), and may be required to trigger the secondary, longer-lasting late response component.
Discussion
Recording membrane potential dynamics from genetically defined neuronal types provides critical information for understanding how cortical circuits process oddball stimuli that break a previously established regularity. Using two-photon-guided in vivo whole-cell recordings in mouse A1, we compared subthreshold and suprathreshold responses upon oddball stimulation in excitatory and inhibitory neurons. Our main findings are as follows: (1) we report two temporally distinct early and late response components that encode oddball tones in the studied cell types; (2) SSA is present in Exc neurons as well as in PV and SST interneurons, with oddball tones recovering both early and late response components compared with the preceding standards, suggesting that a large portion of A1 neurons adapts to repetitive stimuli, and then becomes reactivated through the synaptic drive recruited by a deviant stimulus; and (3) the late component, particularly in Exc cells, shares characteristics such as time course, deviance-encoding, and pharmacological properties with the MMN EEG signal component, and therefore may be the “missing link” between neuronal SSA and MMN.
SSA is present in SST and PV interneurons
We found that both SST and PV interneurons in A1 adapt to repeating standard tones (Fig. 2) and subsequently recover their activity when the oddball tone is presented (Fig. 3). Although subcortical SSA has been reported primarily in the nonlemniscal part of inferior colliculus (Malmierca et al., 2009; Ayala and Malmierca, 2012) and the medial geniculate body of thalamus (Anderson et al., 2009; Antunes et al., 2010), the prominent SSA seen in the lemniscal A1 in our study and previous studies (Ulanovsky et al., 2003; Hershenhoren et al., 2014) indicates de novo generation or amplification of SSA in cortical circuitry (Nelken, 2014). Two mechanisms may contribute to the recovered early responses evoked by oddballs. First, increased inhibition may build up during repeated presentation of standards and an oddball tone would release neuronal inhibition. This model is analogous to the earlier “disinhibition hypothesis” regarding MMN in tonopically organized A1 (Näätänen, 1984; Picton et al., 2000). Our results are inconsistent with this scenario as we found that SSA is present in both SST and PV interneurons. Rather, the similarity in the early oddball responses in excitatory and inhibitory neurons suggests overall excitation–inhibition balance during SSA (Wehr and Zador, 2003; Higley and Contreras, 2006), possibly through common synaptic mechanisms in the excitatory pathways of direct or indirect thalamocortical projections (Smith and Populin, 2001; Barbour and Callaway, 2008). Nonetheless, inhibition may participate in SSA in more subtle manners. For example, slower adaptation in the spiking activity of PV interneurons (Fig. 2) could contribute to the attenuation of standard-evoked activity (in Exc cells) during tone sequences (Pérez-González et al., 2012; Duque et al., 2014). In addition, different rates of recovery from adaptation between excitatory and inhibitory neurons could shape oddball responses (Malina et al., 2013). Finally, the rapid adaptation seen in SST interneurons (Fig. 2) may arise from inhibitory action of VIP-expressing interneurons (Pi et al., 2013), which were not studied here.
The alternative explanation for SSA is that repeating standard tones cause depression in a subset of activated excitatory inputs, and an oddball activates another subset of nonadapted synapses thus recovering the response amplitude (Fig. 8). This model is analogous to a more recent “adaptation hypothesis” of MMN in A1 (May et al., 1999; May and Tiitinen, 2010). Our results are more consistent with this model and indicate synaptic depression affecting not only excitatory neurons but also PV and SST interneurons. The oddball tone presumably enhances both excitatory and inhibitory conductances in A1 neurons through activation of nonadapted synaptic projections.
We also found that tone frequency as a stimulus feature modulates SSA, especially for the early response component, with larger SSA magnitude at subthreshold level occurring in all three neuronal classes when the oddball matched the preferred frequency (Fig. 5). At suprathreshold level, this effect was most apparent in the sparsely active pyramidal neurons. The highly stimulus-specific AP generation in L2/3 pyramidal neurons appears to depend on NMDAR signaling (Fig. 7), suggesting the involvement of active dendritic mechanisms (Smith et al., 2013; Palmer et al., 2014). In contrast, interneurons may track circuit activity with lower stimulus-specificity and reflect subthreshold inputs in a more linear fashion, which perhaps is critical for maintaining excitation–inhibition balance under a wide range of stimulus context. Such balance could be achieved by nonselective pooling of inputs from excitatory neurons (Kerlin et al., 2010; Hofer et al., 2011) or unspecific inhibition by other interneurons (Pfeffer et al., 2013). Accordingly, oddball responses in PV interneurons were independent of NMDAR signaling, indicating that the lack of nonlinear dendritic conductances (Vervaeke et al., 2012) may underlie the linear input–output transformation in interneurons (Harris and Mrsic-Flogel, 2013).
A late response component encodes stimulus context and possibly deviance detection
We discovered an additional response component evoked by oddballs (and first tones) occurring >100 ms after tone offset, which was particularly evident in pyramidal neurons. Because it mainly remained subthreshold in L2/3 Exc neurons, this late component possibly has been missed in previous extracellular studies (Ulanovsky et al., 2003; von der Behrens et al., 2009). The late component resembles thalamocortically driven “UP states” (MacLean et al., 2005; Rigas and Castro-Alamancos, 2007) and may be related to the late activation of sound-evoked “network calcium transients” (Grienberger et al., 2012). As this late network activation adapts to repetitive stimuli and is reactivated by an oddball stimulus, it shares the dependence on stimulus context (i.e., the history of preceding stimuli) with the early component. Generation of the late component may further depend on the stimulation protocol. For shorter ISI and briefer tone durations, the onset of the late component could be shifted earlier and merge with the early component (Taaseh et al., 2011).
Furthermore, we found evidence for genuine deviance detection in the oddball-evoked late component. The early responses in the many-standards paradigm were largely comparable with oddball responses (Fig. 6), suggesting that synaptic adaptation underlies the early oddball response. In contrast, late Vm depolarization was more prominent for oddball stimuli compared with the many-standards condition, suggesting that the late but not early component carries the signature of genuine deviance detection in rodent A1. The subthreshold late depolarization in L2/3 Exc cells might merely be the signature of deviance detection involving processing across cortical layers, as current-source profiles in response to oddballs indicate that SSA occurs in all A1 layers, albeit to a variable degree (Szymanski et al., 2009). Another possibility is that a small subset of L2/3 pyramidal neurons still exhibits suprathreshold late responses (Grienberger et al., 2012). It could be that oddball-evoked late spiking responses arise in middle layers receiving direct thalamocortical inputs (Metherate and Cruikshank, 1999). Excitatory cells in deeper layers may integrate supragranular inputs (Ojima et al., 1991) with nonlemniscal thalamic inputs (Kimura et al., 2003) to detect deviant stimuli and convey this information further via various corticofugal projections (Winer, 2006; Malmierca et al., 2015).
Possible link of the late response component to MMN
The features of oddball-evoked late components revealed by intracellular recordings offer a putative link to MMN measured in human EEG and MMN-like activity recorded in rodents (Umbricht et al., 2005; Nakamura et al., 2011). Middle-latency responses peaking at approximately tens of milliseconds have similar time courses to early responses during SSA (Slabu et al., 2010; Grimm et al., 2011). Nonetheless the relationship between SSA and MMN has remained elusive. In addition to both carrying true deviance information, the time course of MMN and the late component match surprisingly well. This becomes especially obvious when comparing the derivatives of oddball-evoked Vm responses (Fig. 4), which can serve as a first approximation of the extracellular currents that summate to generate EEG signals. Using blind intracellular recordings, a recent study reported the early oddball-evoked membrane potential responses although depolarization within the late time window was not mentioned (Hershenhoren et al., 2014). Here, we describe the precise time course and cell-type-specificity of the oddball-evoked late component. The late activity triggered by relevant stimulus context may be an important mechanism of sensory integration in behaving animals (Sachidhanandam et al., 2013).
Another critical argument is the lack of NMDAR dependence in SSA (Farley et al., 2010), but not in MMN (Tikhonravov et al., 2008), after systemic NMDAR blockade. We found that both oddball-evoked early and late components in pyramidal neurons were strongly reduced after intracellular infusion of the NMDAR antagonist MK-801. Although SSA was still present in the early subthreshold responses of Exc cells, it was significantly reduced for the late component (Fig. 7). With its NMDAR dependence, the oddball-evoked late component shares yet another feature with MMN.
The origin of the late response component remains to be elucidated. We suspect that it reflects reverberating network activity that persists beyond the actual sensory drive, similar to cortical UP states (Fig. 8). Whether such network activity may occur locally within a columnar circuit, possibly through cross-laminar interactions, or whether it requires reverberation in larger-scale networks, such as the thalamocortical or corticocortical circuitry, awaits further investigation.
Footnotes
This work was supported by the Forschungskredit of the University of Zurich (H.L.), the Hartmann-Müller Stiftung (H.L.), a grant from the Novartis Foundation for Medical Biological Research (H.L., F.H.), the National Centre of Competence in Research “Neural Plasticity and Repair” of the Swiss National Science Foundation (F.H.), the Swiss SystemsX.ch initiative (F.H.), and by a Roche Postdoctoral Fellowship awarded to H.L. We thank H. Kasper, R. Krüppel, M. van't Hoff, and F. Voigt for technical assistance, and U. Gerber for helpful discussion and providing MK-801.
The authors declare no competing financial interests.
- Correspondence should be addressed to either of the following: Dr. Henry Lütcke, ETH Zurich, Weinbergstr 11, CH-8092, Zurich, Switzerland, hluetcke{at}ethz.ch; or Prof. Fritjof Helmchen, Brain Research Institute, University of Zurich, CH-8057, Zurich, Switzerland, helmchen{at}hifo.uzh.ch