Abstract
Sensory experience during a critical period alters sensory cortical responses and organization. We find that the earliest sound-driven activity in the mouse auditory cortex (ACX) starts before ear-canal opening (ECO). The effects of auditory experience before ECO on ACX development are unknown. We find that in mouse ACX subplate neurons (SPNs), crucial in thalamocortical maturation, respond to sounds before ECO showing oddball selectivity. Before ECO, SPNs are more selective to oddball sounds in auditory streams than thalamo-recipient layer 4 (L4) neurons and not after ECO. We hypothesize that SPN's oddball selectivity can direct the development of L4 responses before ECO. Exposing mice, of either sex, before ECO to a rarely occurring tone in a stream of another tone occurring frequently leads to strengthening the adult cortical representation of the rare tone, but not that of the frequent tone. Results of control exposure experiments at multiple developmental windows that also use only a single tone corroborate the observations. We further explain the strengthening of deviant inputs before ECO and not after ECO using a binary network model mimicking the hierarchical structure of subplate and L4 neurons and response properties derived from our data, with synapses following Hebbian spike time-dependent plasticity learning rule. Information-theoretic analysis with sparse coding assumptions also predicts the observations. Thus, relatively salient low probability sounds in the earliest auditory environment cause long-term changes in the ACX.
SIGNIFICANCE STATEMENT Early auditory experience can change the organization and responses of the auditory cortex in adulthood. However, little is known about how auditory experience at prenatal ages changes neural circuits and response properties. In mice at equivalent early developmental stages, we find that auditory experience of a particular kind, with a less frequently occurring sound in a stream of another sound, alters adult cortical responsiveness, specifically of the less frequent sound. However, at the previously known critical period of development, the opposite is observed, where the more frequent sound's representation is strengthened in the adult compared with the less frequent sound. We thus show that a specific type of auditory environment can influence adult auditory processing at the earliest ages.
Introduction
Sensory cortical circuitry development is activity-dependent and is based on the sensory environment to which an animal is exposed (Hensch, 2005; Schreiner and Polley, 2014). Even the earliest experiences, such as in utero exposure to a mother's voice, may lead to maternal speech preference in newborns (Decasper and Fifer, 1980) and near-term fetal responses to the mother's spoken words (Voegtline et al., 2013). It is well known that, in the auditory system, altering the natural auditory environment during specific periods (critical period) (De Villers-Sidani et al., 2007; Barkat et al., 2011; Takesian et al., 2018) of development alters cortical circuitry and organization remarkably (Zhang et al., 2001; Zhou and Merzenich, 2008). However, it is unknown whether the auditory system can adapt to environmental changes before the known critical period. Subplate neurons (SPNs), the firstborn cortical neurons (Kanold and Luhmann, 2010; Luhmann et al., 2018), present in large numbers during and before the critical period (De Villers-Sidani et al., 2007; Hoerder-Suabedissen and Molnár, 2013), play a crucial role in the development of functional cortical circuitry and organization (McConnell et al., 1989; Ghosh et al., 1990; Ghosh and Shatz, 1992; Kanold et al., 2003; Kanold and Shatz, 2006). Further, SPNs in the auditory cortex (ACX) are known to be the earliest to respond to sounds, at least in ferrets (Wess et al., 2017). However, the properties of early SPN sound responses, which are likely to play a crucial role in ACX development, are unknown, especially in mice. SPNs receive inputs from the thalamic axons projecting to L4 (Luhmann et al., 2018) in the ACX (Kanold and Luhmann, 2010; Holzhauer et al., 2017) and drive L4 activity over a time window of development (Hanganu et al., 2002; Luhmann et al., 2018). Thus, it is essential to know what kinds of natural stimuli activate SPNs as their activity coincident with thalamic inputs can sculpt the earliest feedforward thalamocortical circuitry (Kanold et al., 2003; Kanold and Shatz, 2006) in the ACX. SP and L4 neurons play a crucial role in the maturation of thalamocortical circuitry, so it becomes important to know how the response properties of SPNs and L4 neurons change, starting from the onset of ACX sound-driven activity.
ACX responses have been shown to exist in ferrets (Wess et al., 2017) at ages equivalent to P6 (postnatal day 6) in mice (Clancy et al., 2001), and loud tone-driven activity in wide-field fluorescence is seen in P8-P9 mice (Meng et al., 2021). However, no study reports sound-driven single-unit activity in the ACX of mice at ages before ear-canal opening (ECO) (P11/P12). Low threshold hearing onset coincides with ECO (De Villers-Sidani et al., 2007; Polley et al., 2013) in the mouse, with outer hair cell (OHC)-based sensitivity allowing detection of near 0 dB SPL sounds. The developing auditory pathway is functional before ECO (A. Chang et al., 2018), as known from ACX sound-driven responses obtained by surgically opening the ear canal (Meng et al., 2020) in mice. However, moderate-intensity sounds, such as conversational levels (∼70 dB SPL), may drive spiking activity in the ACX before ECO without surgically opening the ear canal through bone conduction (Sohmer et al., 2001). The presence of responses before ECO suggests that activity-driven plasticity in the ACX may start earlier than the known critical period (P11-P15, after ECO). However, previous studies with a single tone exposure at ages before ECO show no changes in tonotopy (De Villers-Sidani et al., 2007; Barkat et al., 2011) or ACX organization in the adult. Thus, even if responses in the ACX exist before ECO, it is important to understand the response properties in more naturalistic stimulus paradigms to test the possibility of activity-driven ACX plasticity starting before ECO.
Thus, we first investigate the possibility of ACX responses of the L4 cortical plate and SPNs in mice with ear canals closed as would be required for the earliest natural sensory experience to drive cortical activity. We show that ACX sound-driven activity with ear canals closed starts as early as postnatal day 7 (P7) with multiple techniques. We further show that, before ECO, SPNs have unique response properties. SPNs, unlike cortical plate neurons, show selectivity to oddball sounds in a stream of repeating sounds before ECO, while after ECO, cortical plate L4 neurons show stronger deviant detection than remaining SPNs. The observed deviant detection shown by SPNs in the earliest phase can direct the thalamus to L4 synapses (Kanold et al., 2003; Kanold and Shatz, 2006). The above opens the possibility that specific kinds of most initial sensory experiences can modify cortical development. For example, exposure to music or the structured sound environment during the fetal period leads to long-term plastic effects with enhanced responsiveness to the sounds experienced in the prenatal period (Partanen et al., 2013). We use a new auditory exposure paradigm with a low probability, salient/oddball tone (deviant [D]) in a stream of a repeatedly occurring tone (standard [S]). We show that continuous exposure to the above S-D acoustic environment at ages before the established auditory critical period (before ECO) (Hensch, 2005; De Villers-Sidani et al., 2007; Barkat et al., 2011; Takesian et al., 2018) can remarkably alter the functional ACX responses, which persist into adulthood. The alterations are specific to the deviant or rare exposure tone (D). Performing control exposures with only a single tone during the same developmental window do not alter adult ACX responses. The observed plasticity is an outcome of the unique deviant detection properties we find of SPNs, before ECO, with strong responses to only the rare stimulus (D). We use a computational network model derived from our experimental results and known thalamus, SP and L4 connectivity to better understand the mechanism of such plasticity. Further, using mutual information maximization and sparse coding principles, we show that the outcome of early exposure with our new paradigm can be predicted theoretically. Hence, our results revise the auditory critical period's established timelines and concept and may generalize to other sensory systems.
Materials and Methods
Subjects
All experiments were performed on mice, Mus musculus of either sex, and all procedures were approved by the Institutional Animal Ethics Committee of the Indian Institute of Technology Kharagpur.
Acute craniotomy for wide-field imaging and 2-photon Ca2+ imaging
Animals were initially anesthetized under isoflurane (induction at 5% isoflurane, followed by 2% isoflurane). After removing the skin and tissue above the skull, a metallic head plate was implanted above the estimated auditory cortical area. Craniotomy of ∼3 mm diameter was performed over the left auditory area. Smaller size craniotomy was preferred, as it gave us the advantage to minimize brain motion. A craniotomy was filled with 1.5% low melting-point Agarose. Immediately after that, a 5 mm coverslip was embedded over the craniotomy to dampen the pulsation. Animals were then transferred to the 2-photon imaging setup, and the level of anesthesia was reduced to 0.1%-0.2% isoflurane during the recordings. For recording responses at lower sound levels, awake recordings were performed at age P9-P10. For awake recordings, immediately after completing the surgery, pups were transferred to the sound chamber and allowed to recover from anesthesia. This time window also allowed pups to habituate before the recordings were performed. Throughout the experiment, the animal temperature was maintained at 37°C by a heating pad.
Wide-field imaging
For wide-field calcium imaging, imaging signals were acquired using a 14-bit Cool SNAP HQ2 CCD camera (Photometrics). Images of surface vasculature were acquired using blue LED illumination (470 nm), and wide-field GCaMP6f signals were recorded at ∼30 Hz using blue illumination (470 nm) (Gee et al., 2014). Fluorescence signals were acquired at 4× binning of 1040 × 1392 pixels, using a 4× objective (Olympus). Based on the auditory landmarks (medial cerebral artery and rhinal vein position) and the responses to different frequencies, the ROI and the focal plane of the image (150-300 µm) were moved to desired coordinates for performing 2-photon imaging.
Two-photon calcium imaging
Two-photon calcium imaging was performed using a commercially available 2-photon microscopy system (Prairie View Technologies). The microscopy was controlled by Prairie View software (Prairie View Technologies). The excitation beam of wavelength 860 nm generated from Insight laser (Spectra-Physics) was directed over the exposed brain volume for imaging. The laser was delivered through a 20×/0.8 NA water immersion objective (2 mm WD, Olympus). The laser power was adjusted from 50 to 80 mW depending on the condition of the specimen. Frames in the ROI (∼150 µm × 200 µm with 1.16 µm pixel size) were imaged at ∼4-6 Hz (160-250 ms frame period, 1.2-4 µs dwell time) with the stimulus presented at usually the sixth (to the 10th) frame in a sequence of 20-30 frames per stimulus.
Auditory stimulation for imaging and electrophysiology
The stimulus was presented through the electrostatic speaker (ES1) placed 2 cm away from the right ear of the mouse pups. Stimulus was generated through custom-written software in MATLAB (The MathWorks), passed through TDT RZ6 multifunctional processor. Acoustic calibrations were performed using microphone 4939 (Brüel & Kjær) and showed a typical flat (±7 dB) calibration curve from 4 to 60 kHz. Auditory stimuli consisted of broadband noise (6-48 kHz, 40 repetitions), tones (40 repetitions of respective frequencies), and series of sound tokens representing the SD paradigm (30 repetitions). Only for 1 of the animals (P12, after ECO), 5 repetitions of each pure frequency tone were delivered. Each of the stimuli was interleaved with a gap of 5 s. The loudest intensities of sounds at 80-90 dB SPL were presented to the ear to allow mechano-transduction at younger ages (before ECO). However, in awake recordings, 50-90 dB of sounds were presented at age P9-P10.
The stimulation apparatus used for electrophysiology experiments were similar to those used for imaging, except that a TDT RX6 multifunctional processor was used to generate the stimulus. Furthermore, the stimulus was presented inside the soundproof chamber, 10 cm away from the right ear (contralateral) of the mouse.
Anatomical labeling
Nissl staining was performed at different age groups to identify the SP layer and to guide in vivo electrophysiology experiments. Mice of ages P6 (n = 2 mice), P13 (n = 2 mice), P16 (n = 2 mice), and P18 (n = 2 mice) were initially perfused, and the brains were harvested. Since ECO is after P11 and as the cortical size stabilizes by P21, the above ages were chosen initially for the characterization. Nissl stains were performed after obtaining coronal sections of the auditory regions based on mouse atlas (Gould et al., 2012).
Immunohistochemistry (IHC) was also performed on brain slices across different developmental time points (P06-P07, n = 2 mice; P08-P11, n = 3 mice, P12-P13, n = 5 mice; P14-P16, n = 2 mice; P17-P19, n = 2 mice; P20-P28, n = 3 mice; >P28, n = 2 mice) with known markers of SP, which are Complexin 3 (Cplx3), Synaptic Systems (122302); and Connective Tissue-Derived Growth Factor (CTGF), Santa Cruz Biotechnology (sc-14939). PFA fixed brain tissue was embedded in a 3% agarose block. The brain embedded block was fixed on a vibratome (Leica Microsystems) sectioning platform; 50-μm-thin sections were cut, blocked, and permeabilized in a buffer containing 1% BSA, 0.3% Triton X, and 10% Normal Donkey Serum (for CTGF) or 10% Normal Goat Serum (for Cplx3). Sections were incubated in primary antibodies (Cplx3: 1:500 and CTGF: 1:500) overnight and followed by washing with 1× PBS 5 times. After that, sections were incubated for 2 h in secondary antibodies (Cplx3: anti-rabbit 594, Invitrogen; and CTGF: anti-goat 488, Invitrogen) diluted in the buffer, 1% BSA, 3% serum, and 0.3% Triton X at room temperature. Sections were rinsed, mounted, and imaged under epifluorescence microscopy. Depth of SP in different regions and at different developmental time scales was determined from these initial studies and was used as a guide for performing electrophysiology recordings. Further confirmation about recordings from input layer (L4) was also obtained from SCNN1-td-tomato (cross of SCNN1-cre and floxed-td-tomato: JAX mice) mice, which have SCNN1-expressing neurons (specific to L4) labeled, allowing confirmation of recordings in L4. Sections were imaged using epifluorescence microscopy (Xcite illumination, Bruker) and analyzed for cytoarchitecture using custom-written codes in MATLAB.
In vivo extracellular single-unit recordings
A standard surgical procedure was performed as mentioned above in in vivo 2-photon imaging recordings. Extracellular recordings were performed from the left ACX using a tungsten microelectrodes array of impedance 3-5 Mega ohms (MicroProbes). A 4 × 4 custom-designed metal microelectrodes array with an interelectrode spacing of 125 μm was lowered into the ACX using a micromanipulator (MP-285, Sutter Instrument). We also performed recording using arrays separated by 400 or 600 μm in length to record across laminae simultaneously (e.g., L4 and SP). Signals were acquired after passing through a unity gain head stage (Plexon, HST16o25) and followed by PBX3 (Plexon) preamp with a gain of X1000. Wideband signal (used to extract local field potential [LFP], 0.7 Hz to 6 kHz) and spike signals (150 Hz to 8 kHz) were obtained in parallel and acquired through National Instruments Data Acquisition Card (NI-PCI-6259) at 20 kHz sampling rate, controlled through custom-written MATLAB (MathWorks) routines. Further, all online/offline analysis was performed using custom-written MATLAB routines. A laminar profile of responses was obtained from the primary ACX (A1) by advancing the electrodes in depth at 100–150 μm steps.
Since the size and location of A1 overdevelopment varies, it is essential to confirm the location of recording to be A1. For A1 recordings, additional physiological information based on tuning, the existence of broad tonotopy, across electrodes in the recording array, latency of responses, and selectivity to tone stimuli were used to confirm recordings in primary auditory areas (A1 and anterior auditory field [AAF]). The location of electrode tracks and comparison with mouse brain atlas and landmarks (location relative to the rhinal vein, size of the hippocampus in coronal slices) also provides additional confirmation of recording location. In many experiments, after recording a DiI crystal (Torii and Levitt, 2005), a lipophilic tracer was inserted into the cortex at the recording location. After allowing passive travel of the tracer for 20-30 d at 35°C, brain slices were cut to look for labeled fibers and cell bodies in the ventral division of the MGB (lemniscal structure projecting to A1), allowing confirmation of recordings in A1. We further confirmed recordings from ACX by electrode location (using electrode tracks) in primary regions and from broad tonotopy observed. The recording depths targeted for L4 and SPN were based on known depths of L4 and SPNs in ACX; also, in a separate group of mice (P8-P11, n = 3, P12-P13, n = 7; P14-P16, n = 5), depth of SPNs was determined to be >800 µm based on two methods. The first by cell morphology in Nissl stains of coronal slices and the second by immunostaining coronal slices containing A1 with antibodies of SPN markers, Complexin3 and CTGF, at different ages and depth range of L4 from the expression of SCNN1 (B6.Cg-Tg(Scnn1a-cre)3Aibs/J mouse, Jax Labs, 09613 crossed with CAG-tdTomato mouse Jax Labs, 07909), an L4 specific marker. In a subset of mice, we confirmed the electrode recording site in the SP layer from electrode tracks along with immunostaining as above and electrodes in L4 by performing recordings in the SCNN1-cre-td-Tomato (n = 3, P12, P14, and P18) mice. Depths of other layers (L2/3 and L5/6) were determined based on the above results.
Exposure protocol
For another set of experiments, animal litters were exposed to sound sequences inside a soundproof chamber with a maintained 8 h light and 16 h dark cycle, run continuously for 5 d in 4 age groups (P0-P5/6, P6-P10/11, P11-P15/16, and P16-P21).
The litters were kept along with the breeders in a cage. Each exposure stimulus was synthesized online and presented through an electrostatic speaker, kept 5 cm above the cage, using custom-written codes on MATLAB. All the age groups were presented with a sequence of stimuli having 50 ms sound tokens with a 250 ms intertoken gap at 70 and 90 dB SPL. The sequence (SD) had a token of two frequencies: a standard (S) having an arrival probability of 90% and a deviant (D) arriving with a 10% probability. For the only standard exposure (SS), there was no D frequency. After sound exposure, animals were returned immediately to home cages in the animal house and reared up in a normal environment until experiments were performed. During recording sessions, mapping of the primary ACX was performed. Tone pips (50 ms, 5 ms up down ramps, 6-48 kHz, 0.25 octaves difference at 10-50 dB above threshold, 10 dB increments, 5 repetitions of each frequency-intensity combination) were presented to the right ear. Extracellular responses were recorded from the thalamo-recipient layer (deep layer, L3 and L4) at depth of 350-500 μm. A frequency response area was constructed for each unit for a minimum of three sound levels played. Single units from A1 having response peak latency within 9-31 ms were included in the analysis. This window was selected based on the population mean and variance of response peak latencies obtained across all sound levels.
Data analysis
Wide-field imaging analysis
Wide-field imaging analysis was performed using custom codes written in MATLAB (The MathWorks). For analysis of the wide-field data, raw images collected were binned at 4× and then smoothed with a moving window of 4 × 4. Further analysis was done on the smoothed images. First, a one-sided t test across all the iterations was performed between the 5 frames immediately preceding the stimulus onset, and a moving window of 5 frames after the stimulus onset in all 130 µm × 130 µm square areas to find the significant pixels. The pixels, which gave a p-value of <0.05 on at least one window after the stimulus onset, were considered for further analysis. The mean change in df/f with time traces was constructed for pixels showing significant positive responses. These traces were further smoothed with a moving average window of 5 frames, and a one-sided t test was performed again to find the first significant poststimulus frame and the frame with the most significant response.
Two-photon imaging analysis
Two-photon imaging analysis was performed using custom codes written in MATLAB (The MathWorks). Imaging sequences were aligned by performing X-Y drift correction. Cells were selected manually by selecting the center point of the cell on the motion-corrected mean image. ROIs (5 µm radius) were drawn based on the cell centers. Raw fluorescence signal over time (F) of the selected ROIs across all frames was extracted. Relative fluorescence for each trial was computed by using ΔF/F0 = (F – F0)/F0), where F0 corresponds to baseline fluorescence. Baseline fluorescence amplitude was estimated by calculating the mean fluorescence values over all the preceding stimulus frames, except the first 2 frames. Neurons on stimulation to tones and noise were considered to be significantly responsive as follows. One-sided paired t tests were performed between mean df/f over three of the successive frames before the stimulus and each one of 6 windows of three consecutive frames after stimulus onset (up to the eighth frame, ∼1.5-1.8 s). If any of the 6 moving windows showed a significant increase in mean df/f (40 repetitions) over baseline, the neuron was considered responsive. Multiple comparisons correction was not used as we use moving average data, which make successive frames dependent. Further presence of response introduces correlations indicated by significant correlation coefficient between frame number and mean responsive df/f and significant autocorrelations up to lags of four frames.
For significant responses in single trials to compute reliability, a trial was deemed responsive if the mean df/f of 4 frames (∼1 s) after the stimulus after stimulus onset was higher than 1 × STD (STD obtained from 40 repetitions) of the mean of the four baseline frames preceding the stimulus. Population analysis was performed on only those neurons which evoked significant responses to >25% of the trials (>25% reliability). Reliability was computed as the proportion of trials that showed significant responses of 40 trials. For comparing reliability across age, we have considered all the neurons recorded at each age regardless of the significance.
Identification of SP layer
The identification of the SP layer was based on separation drawn by the sparse cell zone above and the horizontally oriented cytoarchitecture of the SP neurons. Based on the above features, manual points (perpendicular to lamina) on slice images were selected to define the depth and width of the SP layer using MATLAB. A similar analysis was performed on the immunolabelled brain slice images.
LFP analysis
For LFP analysis, baseline shifting was performed to have a mean of baseline 200 ms preceding the stimulus at 0. To obtain LFP, acquired wideband signals were notch filtered (50 Hz, Butterworth eighth order, to remove AC supply line noise) and then bandpass filtered (between 5 and 300 Hz, Butterworth second order). LFPs were smoothed further using a Gaussian window, SD 150 ms. LFP responses crossing the ±200 mV threshold throughout the recorded time window were excluded from population analysis. A spontaneous window of 400 ms before the onset and a response window of 500 ms after the onset were considered for testing the significance of LFP response. The entire stretch of 900 ms (spontaneous + response) was divided into 9 bins of 100 ms each, making 4 bins in the spontaneous window and 5 bins in the response window. A one-sided paired t test was performed between the mean of the four spontaneous bins and each of the five response bins, and only those traces with a p-value <0.05 in at least one of the five response bins were taken for further analysis. The normalized population root mean square (RMS) plots were plotted for these traces, where each of the post-onset RMS values was normalized with the mean of the four spontaneous RMS values. Slopes of RMS values for a period after stimulus onset (bin size 100 ms) obtained from significant responding sites from the L4 and SP layer were used to evaluate long-lasting oscillatory activity.
Rate calculation
The neuronal spike rate was computed by calculating the mean response within stimulus duration or token. Only single units that responded (compared with spontaneous activity 400 ms preceding stimulus, two-tailed t test, p < 0.05) to at least one of the stimuli (sound token f or N as standard or deviant) used were considered for analysis.
Latency analysis
For each neuron, tuning parameters (best frequency [BF] and latency) were defined using custom-written codes in MATLAB. After ECO, the BF for a neuron is the frequency that elicits a maximum response within stimulus duration (50 ms) to a given frequency at a given sound level. At ages after ECO, latency to stimulus onset was defined as the time bin (5 ms resolution) at which peak response was observed for a BF. For calculating latency at ages before ECO, we first considered all peristimulus time histograms (PSTHs) of neurons binned at 50 ms with z score >2 within 500 ms after stimulus onset. After that, a peak response within 300 ms after stimulus onset was selected as the latency to the presented stimulus.
Adaptation time constants
A time course analysis was performed to fit the response rates for SSS….SDS….SS stimuli. Across all age groups and cortical layers, we used exponential fitting to only those responsive units with significant tone responses as standards. The average spike rate in each token before the deviant was fitted with a first-degree exponential curve to compute exponential decay time constants, based on the least square error method. Fitting was performed using the 'fit' function in MATLAB.
Construction of reference frequency (RF) plots
To investigate whether the effect of exposure on spike rate was frequency-specific, the RF plots were constructed to show the rate for the chosen RF (S or D) as a function of its distance from the BF for each tuning curve (see Fig. 7C). All the tuning curves obtained were divided into groups based on the difference between BF and RF in octaves (in steps of 0.25 octave; see Fig. 7C, different colored tuning curves). The mean rate at RF obtained from these tuning curves was used to make the RF plot. For example, in Figure 7C, we look at the blue tuning curves where the BF (black arrow) is 0.5 octaves higher than the RF (pink arrow). We collected the rate at RF (depicted by ♦) and plotted its mean value (depicted by ♦) with its X coordinate as the difference between BF and RF in octaves (i.e., 0.5). This was repeated for all the tuning curves, and the RF plot was constructed.
Absolute and normalized spike rates at BF
The absolute spike rates at BF were obtained from the center of the RF plot, where the difference between BF and RF was zero (BF = RF; see Fig. 7E). The RF plot was constructed 13 times for the control data, considering all 13 frequencies as RF. Thus, we got 13 different RF plots. First, the mean of the RF rates from all these plots at the center (where BF = RF) was taken as the absolute firing rate at BF for control. Further, the maximum of all these values was taken as the maximum boundary of the control (see Fig. 7E, gray dashed line). The normalized spike rate at BF was calculated by dividing the RF rate at the center of the RF plot (where BF = RF) with the mean of the RF rates where the absolute difference between BF and RF was >0.5 octaves (|BF-RF|>0.5 octaves).
Connectivity between pairs
Cross-correlation between pairs of PSTHs (20-30 repetitions of SD stimulus and swap, bin size = 5 ms) was used to detect connections between simultaneously recorded pairs of neurons (SP-L4 pairs). Bootstrap analysis was done to get variability in the cross-correlograms (MATLAB 'xcorr' function, up to lag of 20 ms) by randomly resampling (same number of repeats from the existing 20-30 repeats) with replacement, 100 times. Thus, a mean and SD (dark line with error bar) of the correlograms were obtained. To compare with spurious cross-correlations, all repeats from both neurons were mixed to give 2× number of repetitions spike trains, and randomly a number of reps of them were assigned to each of the neurons; and from these PSTHs, cross-correlograms were obtained with mean and SD by repeating the process 100 times (gray line with error bars). Nonoverlap of error bars was considered to be significant connections between the pair of neurons. We used 1.65 SD for data in P12 and above groups and 1 SD in before ECO group because of the large difference in spike rates of single units in the 2 cases.
The cross-correlation at best provides a linear time-shifted dependence between the activities of L4 and SP; although convincing enough for our result, it is unable to capture the changing temporal covariability between the two signals. To address this issue, we also verified the connectivity proportion results using Granger causality (GC), which provides more stringent criteria to find the direction of the information flow (causation). We performed this between the mean PSTH in 5 ms binning of the SP and L4 responses.
Details of binary network model of L4 and SP with TC inputs
We built a binary neural network consisting of two separate thalamic input units for standard and deviant. The thalamic units project to a SP neuron and further to a cortical neuron. The SP neuron also projects to the cortical neuron. A similar model was used to demonstrate ocular dominance in the visual cortex (Kanold and Shatz, 2006; Kanold and Luhmann, 2010). The spike rate of the two thalamic inputs is given by the following:
The synaptic weight of the thalamo-SP projections was kept constant at a value of 0.2. The projections received by the cortical neuron were kept plastic, and learning was implemented following the below plasticity rule.
Where
We performed simulations for two regimens: one for the early development phase and one for the later. For the early phase, we kept the synaptic weight of the SP projections at a value of 0.22 and thalamocortical projections at a smaller value of 0.02. It was done to mimic the early stages of cortical development where the thalamic projections are unable to drive the cortical activity alone and are aided by the SP for driving the cortical activity (Kanold and Shatz, 2006; Wess et al., 2017).
In the later phase, both of these weights were kept at an intermediate value of 0.04 and 0.12, respectively. This reflects the intermediate stage of development where the thalamocortical projections are strengthening, and the SP-cortical connections are weakening (Wess et al., 2017).
All the synapses were kept depressing throughout the simulation with the following equations governing their dynamics (Mill et al., 2011):
Where M = 1 whenever a spike occurs.
For thalamus,
For SP,
The value of τir was chosen per our electrophysiology data, which shows a decrease in adaptation in later ages.
The SP and the cortical neuron were modeled on the principles of integrate-and-fire neuron. The membrane potential can be written as follows:
Where g(t) is the conductance and can be written as follows:
The threshold of the L4 neuron was kept constant at 0.1 for both the regimens, whereas that of the SPN was kept at 0.05 for the early regimen and 0.15 for the late regimen.
The refractory period dynamics after each spike were modeled with a separate exponential given by the following (Kanold and Shatz, 2006):
The voltage drops by the above value after every spike. The refractory period lasts for 20 ms after the spike. A leak of 10% was also incorporated (Kanold and Shatz, 2006).
The model gets a drive from the thalamic activity, modeled by uncorrelated Poisson spikes in the two thalamic units. The stimulus train consisted of standard and deviant tokens occurring with a probability of 0.9 and 0.1, respectively, in the standard deviant (SD) protocol and consisted of only standard tokens in case of only standard protocol. Each token is 50 ms long, separated by a 250 ms intertoken interval with a sampling rate of 1000/s.
Results
Mouse ACX and inferior colliculus (IC) have auditory responses before ECO
We performed wide-field fluorescence imaging of the ACX in Thy1-GCaMP6f (Jax Labs) at early ages (Fig. 1A, left) before (N = 10 mice; P7-P8 = 3, P09 = 4, P10 = 4 mice) and after ECO (P13, Fig. 1A, right), to investigate the presence of sound-driven Ca2+ responses in the ACX. After ECO, by P13, mice show strong Ca2+ responses and defined rostro-caudal gradient of high to low frequencies (Stiebler et al., 1997), marking out the primary ACX (A1) and other auditory responsive regions (Fig. 1A, right). Since the auditory pathway is immature yet functional before ECO, we next investigate the presence of auditory responses to tones in the ACX of mice before ECO (De Villers-Sidani et al., 2007; Polley et al., 2013) without surgically opening the ear canals. We found that sound-evoked Ca2+ responses are present in the mouse ACX at high (90 dB SPL) to moderate intensities (70 dB SPL, in awake state) at ages before ECO (Fig. 1B and Fig. 1D, respectively). At very young ages, the percentage of the responding area is <5%. However, only a positive change in response is considered; hence, the significance is being tested at 2.5%. With increasing age, before ECO, a progressively higher percentage of areas of the cortical surface showed significant responses to multiple tone frequencies (Fig. 1B; Table 1), from 3% in P7 to 48.2% in P10 and then 84.2% in P13 after ECO (Pearson correlation, r = 0.96, p = 0.007, Fig. 1C). The percentage of frequencies of the tone stimuli tested that showed significant wide-field imaging responses also progressively increased (Table 1) from 20% in P7 to 71% in P13. Both of the above reflect the gradual maturation of the ACX, starting from the onset of sound-responsive activity at P7 to achieving tonotopy in A1 by P13. For further confirmation of sound-evoked responses in the auditory pathway before ECO wide-field fluorescence imaging in the IC of Thy1-GCaMP6f (Jax Labs) mice at P9-10 was performed. As in the ACX, we found significant sound-evoked responses in wide-field fluorescence on the IC surface (Fig. 1E, N = 2 mice, P9: 31.32%, P10: 38.06%).
Percentage of responding frequencies and areas over age
Wide-field fluorescence imaging-based responses to tones in the ACX and IC of Gcamp6f mouse before ECO. A, Left, Experimental timeline. A, Right, Wide-field fluorescence imaging performed at P13 Gcamp6f mouse shows defined rostro-caudal gradient of high to low frequencies. Smoothed df/f traces with error bars indicating SEMs of selected regions (square) responding to selected frequencies (color) are shown to the right of each image. B, Wide-field fluorescence imaging-based significant response regions (white contours) in the ACX to 90 dB SPL tone in 4 different ages (P07-P10) before ECO. Black horizontal bar represents the stimulus duration. C, Percentage responding area at respective ages (Table 1). D, Same as in B, wide-field fluorescence imaging in the ACX based significant response to 70 and 80 dB SPL tone at P09 and P10 awake mouse. Respective df/f traces with SEMs of the selected region are shown at the bottom of each figure. E, Same as in A, wide-field fluorescence responses before ECO in the IC at P9 and P10 to 90 dB SPL tone. Purple “.” represents the first significant window. Sound-evoked Ca2+ responses are present in the mouse ACX at high (90 dB SPL) to moderate intensities (70 dB SPL) at ages before ECO (one-sided paired t test): *p < 0.05; **p < 0.01; ***p < 0.001; black, unless the p-value is specified).
L2/3 and L4 ACX single neurons have responses before ECO
The presence of significant auditory responses in Ca2+-dependent fluorescence in wide-field imaging (Fig. 1) could potentially indicate either the activity of input axons or spiking responses of neurons in the ACX, or both. To rule out only input axon activity in the ACX wide-field imaging, we performed 2-photon Ca2+ imaging of single neurons in L2/3 and L4 of the ACX of mice (N = 14 mice) before ECO and after ECO (N = 2 mice; P12-P13) (Fig. 2A) with high to moderate intensities of tones. Each 50-ms-long tone stimulus was repeated 40 times and consisted of 3-7 frequencies from 6 to 48 kHz, half octave apart, presented at 60-90 dB SPL, with a 5-7 s gap between repetitions. Single neurons in L2/3 in partially anesthetized mice showed Ca2+ transients in response to tones at ages before ECO (P7-P9, Fig. 2B, top). The population mean df/f in response to different frequency tones (Fig. 2B, bottom) was investigated, considering all the neurons that responded with at least 25% reliability (fraction of repetitions, showing a significant response; see Materials and Methods). We found an increase in the mean population response strength before ECO (N = 14 mice; P07-P08 = 3, P09 = 5, P10-P11 = 6, ANOVA with Tukey post hoc analysis, F(2,576) = 33.5, p <10−4, Fig. 2C). The above indicated a gradual maturation of responses up to ECO (P12-P13) in L2/3 ACX neurons. Along with an increase in tone response strength over age, response reliability increased significantly with age (Fig. 2C, bottom, N = 14 mice; P07-P08 = 3, P09 = 5, P10-P11 = 6, ANOVA with Tukey post hoc analysis, F(2,6955) = 375.84, p < 0.0001) before ECO, also showing gradual response development. The percentage of overall cases (imaged neuron–tone stimulus pairs) that showed significant responses also showed a gradual increase (Pearson correlation, r = 0.97, n = 6, p < 10−3) with age from 2.32% at P7 to 20.2% at P12/13 (Table 1). Furthermore, neuron–tone stimuli pairs also showed higher significant responses to the presented different frequencies (Table 2) as age progressed. All the above observations indicate the presence of sound-driven Ca2+ activity before ECO in single neurons and that such activity varies with age in a manner expected from increasing maturity of the neural pathway.
Sound-evoked activity of ACX L2/3 and L4 neurons in the Gcamp6f mouse before ECO. A, Experimental timeline. B, Top, Example traces of df/f of single L2/3 neurons in response to 0 dB attenuation tones, respectively, at three different ages (P08, P09, and P13). Black vertical line on the bottom of each trace indicates stimulus onset. Corresponding green vertical shading bar in each example trace represents a significant moving window (3 frames), which shows the earliest significant p-value among all the frames (corresponding p-value shown above, one-sided paired t test, *p < 0.05). Bottom, Population mean df/f of the age groups (P07 = 35, P08 = 51, P09 = 299, P10 = 110, P11 = 84, P12-P13 = 707, neuron–tone stimulus pairs; Table 2) of L2/3 neurons shown above. C, Top, Box plot showing response peak strength of all L2/3 neurons imaged at different ages (14 mice; P07-P08 = 3, P09 = 5, P10-P11 = 6, ANOVA with Tukey post hoc analysis, corresponding significance value shown to the right). C, Bottom, Box plot showing response reliability of all L2/3 neurons imaged at different ages, as shown above (ANOVA with Tukey post hoc analysis, ***p < 0.001, corresponding significance value shown to the right). D, Similarly, as shown in B, example traces of df/f of single L2/3 neurons of ACX in response to 60 dB-90 dB SPL tones at age group P9-P10 of an awake mouse, with corresponding significance values of the earliest significant window (3 frames) obtained. Bottom, Population mean df/f of L2/3 neurons across different sound intensity level (60-90 dB SPL, 4 mice; 60 dB: 107, 70 dB: 556, 80 dB: 348, 90 dB: 359 neuron–tone stimulus pairs; Table 3). E, Extended 2-photon Ca2+ imaging data, showing population mean df/f (below) at two ages (P09, 2 mice, 666 neuron–tone pairs and P10, 138 neuron–tone pairs) before ECO, indicating long (>2 s) spontaneous window. F, Population mean df/f of the same age groups in response to sound (left) and the same neurons without sound stimulation (right), with 40 repetitions in each case (P09-P10, 3mice, 461 neuron–tone pairs). G, Left, Example traces of df/f of 3 different L4 neurons (cyan) in response to 0 dB attenuation tones at two different frequencies (top row, 10 Khz and bottom row, 17 kHz). Green patch represents the earliest significant moving window obtained with respective significance values. Right, Population mean df/f of L4 neurons which showed significant responses (P9-10, 2 mice, 64 neuron- tone pairs). L2/3 and L4 ACX single neurons respond to sound before ECO (one-sided paired t test, *p < 0.05; **p < 0.01; ***p < 0.001, black asterisk, unless the p-value is specified).
Percentage of significantly responding neurons at different frequencies and overall cases
Fraction of cases (neurons in response to tones) at different intensities showing significant responses
Details of all animals, litters, number of units, and number of tuning curves in exposure and control experiments
In a subset of mice with stable recordings from neurons at all the different frequencies tested (6-48 kHz, 0.5 octave steps, in N = 7 mice, P7-P08 = 2, P09-P10 = 3, P12-P13 = 2, of 14 mice reported above), we investigated the tone peak response strength and response reliability to the BF of the neuron. The above allows ruling out any bias of the effect of possible same few neurons responding to different frequencies. We found a significant increase in strength (ANOVA with Tukey post hoc analysis, F(720) = 82.02, p < 10−3) and reliability (F(720) = 215.98, p < 10−3) of the response of neurons to tones at their BF as a function of age. Furthermore, the percentage of responding neurons also increases with age from P7-P8 (18.75%) to P12-P13 (78.40%).
The wide-field Ca2+ responses at moderate intensities (Fig. 1D) were also ruled out to reflect only input axon activity as single neurons of L2/3 showed clear Ca2+ transients in response to tones down to 60 dB SPL (Fig. 2D, top) in awake young animals. The population mean df/f traces of all neurons (N = 4 mice; 90 dB: 359, 80 dB: 348, 70 dB: 556, 60 dB: n = 144 neuron–tone stimulus pairs) that responded significantly (with 25% reliability; see Materials and Methods) to tones of different frequencies (8.5 kHz: 28.52%, 12 kHz: 52.15%, 17 kHz: 62.03%) at the four intensities used, showed an increase in response strength with increasing intensity, except 80 dB SPL (Fig. 2D, bottom). The percentage of cases of neuron–tone stimulus pairs that showed significant responses also showed a similar trend, increasing from 14% at 60 dB SPL to 69.57% at 90 dB SPL (Table 2).
The previous set of data were collected with usually a little over 1 s of baseline preceding the stimulus. In order to rule out a contribution from any coincident spontaneous oscillating activity (Blankenship and Feller, 2010) to the response, we obtained extended 2-photon imaging data before ECO (N = 3 mice, P9-P10, 6 ROIs) with >2 s of baseline (Fig. 2E) and also with and without sound stimulation (Fig. 2F). The population mean df/f traces of all neurons in L2/3, showing significant activity above baseline in response to tones at P9 (n = 666 neuron–tone pairs) and P10 (n = 138 neuron–tone stimulus pairs), portray the same nature of responses as in Figure 2B. The long duration mean baseline activity from 40 repetitions fluctuated at ∼0, as expected. We confirmed the presence of sound-evoked responses before ECO, considering any coincident ongoing spontaneous oscillations by comparing mean df/f traces of all neurons imaged in with and without sound cases (N = 3 mice, n = 461 neurons-tone pairs, P9-P10, Fig. 2F). No significant activity was observed under the no-sound condition.
As in L2/3, putative L4 single neurons based on the depth of 2-photon imaging (300-350 μm from pia) also showed Ca2+ responses. Mean df/f-based responses of three representative L4 neurons to 2 frequencies are shown in Figure 2G (left). As in L2/3, the population mean of all significantly responding cases in L4 (N = 2 mice, n = 64 neuron–tone stimulus pairs) showed responses to sound of different frequencies (8.5 kHz: 10.34%, 12 kHz: 11%, 17 kHz: 10.94%, 24 kHz: 12.80%, 34 kHz: 7.14%) with >2 s of baseline activity showing no spontaneous oscillatory activity (Fig. 2G, right). Thus, based on 2-photon Ca2+ imaging of single neurons, we find both L2/3 and L4 to be tone sound-responsive before ECO, showing responses as early as P7 and down to moderate intensities of 60 dB SPL. The above data indicate that wide-field fluorescence imaging-based sound responses in ACX are not because of only input axon-related activity.
SPNs in mice are driven by sounds before ECO
Two-photon Ca2+ imaging of single neurons, simultaneously from L4 and SPNs, was beyond our scope (Yildirim et al., 2019). Although SPNs are known to be the first to respond to sounds in ferrets, equivalent properties of SPNs in the mouse at the corresponding age of P6 (Wess et al., 2017) are unknown. Further, our Ca2+ responses do not show the nature of spiking activity in L4, thus requiring extracellular electrophysiology in mice before ECO. To reliably target SPNs with electrodes at different developmental ages, we first characterized the depth of SPNs based on SPN-specific markers (Hoerder-Suabedissen et al., 2009; Osheroff and Hatten, 2009) CTGF and Cplx3 (Fig. 3A, B, green and red, respectively) using IHC. Furthermore, cell morphology from Nissl stains (Viswanathan et al., 2012) (Fig. 3D) was also used to characterize the SPNs depth. Electrode tracks were also routinely confirmed to be in the SPN layer (P12, P13, and P18, Fig. 3C, top). In a subset of experiments in SCNN1-td-tomato mice (Jax 9613, P14, Fig. 3C, bottom) with staggered electrodes, confirmation of electrodes being simultaneously in L4 (L4 neurons expressing td-tomato) and SPN (from IHC, Fig. 3C) was done (N = 3 mice, P12, P14, and P17). The thickness of the SPN layer reduced with age, as known (Wang et al., 2010; Hoerder-Suabedissen and Molnár, 2013). At multiple ages, in subsets of experiments, confirmation of electrode location in ACX was done with DiI crystal insertion in the recording site and observing labeling of MGBv (Fig. 3E). Also, in higher age groups, A1 recording was confirmed with the presence of general tonotopy (Fig. 3F) based on the spatial location of BFs of units from multiple electrode penetrations.
Histologic confirmation of recording depth for L4 and SP. A, IHC showing labeling of SP neurons with antibodies of SPN markers, Cplx3 (red) and CTGF (green). The location of SPN is marked by an arrow (magenta). B, SP depth determined with IHC by labeling SPNs with SPN markers Cplx3 (red) and CTGF (green). C, Slices were cut from brains (fixed) in which single-unit recordings were performed in the ACX (representative examples at 3 ages, P12-P14, top), and IHC was performed. Electrode tracks (dotted white lines) show recordings performed in SPN and, in the case of dual recordings in the SCNN1-td-tomato mouse (P14), show electrodes in L4 and SPN simultaneously. The location of SP (magenta arrow) and L4 (cyan) neurons is marked with respective colored arrows. D, Depth and thickness of SP as a percentage of cortical thickness over age determined from cell morphology with Nissl stains (top). Colored arrows on Nissl images indicate the respective location of L4 (cyan) and SP (magenta) layers. E, Representative confirmation of recording in A1 in lower age groups P12, P13, P18 with DiI injections in the recording site with labeling in MGBv (inset). F, Sample locations of recordings of single units mapped onto the cortical surface showing broad clearer tonotopy in P29 than in P15. Black arrow indicates the location of a branch of the rhinal vein, representing a landmark for ACX.
We next performed extracellular recordings in the mouse before ECO (P7-P12) from the ACX SPNs and L4 simultaneously with custom-made staggered electrodes arrays (4 × 4 arrays x-y spacing 125 μm, 8 in SPN and 8 in L4, 400 or 600 μm apart in-depth, Microprobes for Life Sciences). Raw LFPs have been used extensively in the literature to support responses (Liu et al., 2015; De Assis et al., 2016), especially to refer to local synaptic activity in multiple species (Kafaligonul et al., 2018; Sołyga and Barkat, 2019) and multiple brain areas (Galindo-Leon and Liu, 2010; Kafaligonul et al., 2018). Sound-evoked LFPs (see Materials and Methods) simultaneously obtained from SP (≥800 μm; P8-P9, N = 3 mice, responsive/total sites: 28 of 44 (63.63%) and P10-P11, N = 4 mice, responsive/total sites: 44 of 72 (61.11%) and L4 (300-350 μm, P8-P9, N = 3 mice: responsive/total sites: 27 of 44 (61.36%) and P10-P11, N = 4 mice: responsive/total sites: 52 of 72 (72.22%) locations) before ECO showed significant sound-driven activity (Fig. 4A,B). Longer-lasting oscillatory LFPs (mean slope difference; see Materials and Methods, L4: F(2,390) = 62.07, p < 0.001; SP: F(2,743) = 121.97, p < 0.001) were observed at ages before ECO (Fig. 4A,B) than after ECO (P12-P13, Fig. 4C). The above results are in accordance with the pattern of LFP activity observed in ferrets at ages before ECO (Wess et al., 2017). The population mean LFPs (population data, Fig. 4A,B) were quantified with the RMS values in 50 ms bins normalized by the mean RMS values of the baseline (200 ms preceding the stimulus onset). Significant responses in both L4 (P8-P9; one-sided paired t test, significant responding site-tone pairs (n), n = 65, t(64) = 8.18, p < 0.0001, P10-P11; n = 140, t(139) = 11.42 p < 0.0001) and SP (P8-P9; n = 72, t(71) = 9.377, p < 0.0001, P10-P11; n = 138, t(137) = 10.43 p < 0.0001) were observed before ECO that last for >1 s (population data: Fig. 4A,B, top and bottom) unlike after ECO (population data: Fig. 4C, top and bottom, L4 and SP, respectively). Moreover, the peak RMS change (within 500 ms after stimulus onset), a reflection of response strength, increases with age (L4: F(2,390) = 80.87, p < 0.001; SP: F(2,743) = 138.36, p < 0.001).
L4 and SPNs in the mouse ACX respond to tones before ECO. A, B, Example traces LFP in a single channel in L4 (cyan, top) and SP (magenta, bottom), in response to tone at age, P09 and P10. Green patch represents the window across which the significance was calculated (L4 and SP ACX single neurons respond to sound before ECO; one-sided paired t test: *p < 0.05; **p < 0.01; ***p < 0.001; see Materials and Methods). Horizontal red bar represents the stimulus onset. A, B, Population data: Population average of all the significant recording sites and SEM of RMS values of LFP with corresponding p-values (one-sided paired t test: ***p < 0.001). Red vertical line indicates stimulus onset. C, Similar plots of single-site and mean population LFP activity at ages after ECO (P12-P13), as shown in A and B, respectively. D, E, Example traces: Dot raster and z score of an L4 (cyan, top two plots) and SP (magenta, bottom two plots) neuron, in response to tone at age, P9 and P10. Red vertical line indicates stimulus onset. Right, Inset, Spike shape. D, E, Population data: Population mean PSTHs of all the recorded SP (P8-P9, 4 mice; P10-P11, 7mice) and L4 (P8-P9, 4 mice; P10-P11, 9 mice) neuron–tone pairs across different age groups which had a z score > 2 in at least 1 time bin within 500 ms after stimulus onset, with respective significance values (one-sided paired t test). Red vertical line indicates the stimulus onset. F, Similar plots of single-unit and mean population spike activity at ages after ECO (P12-P13) as shown in D and E, respectively. G, Latencies of L4 and SP neurons recorded simultaneously in P8-P11 (before ECO), P12-P13 and P14-P15 age groups. H, The population mean latencies revealed lowering of latencies with age and SPNs having lower latency than L4 neurons at ages before ECO and P12-P13 (one-sided unpaired t test: *p < 0.05; **p < 0.01).
Single units in L4 (P8-P9, N = 4 mice; P10-P11, N = 9 mice, cyan) and SPN (P8-P9, N = 4 mice; P10-P11, N = 7 mice, magenta) layer showed the presence of spiking responses (z > 2) to tones before ECO (Fig. 4D,E, top and bottom rows, respectively) and after ECO (Fig. 4F). Sample dot raster (top) and z scores (bottom) of spiking responses in 50 ms time bins are shown in Figure 4D, E, indicating the presence of significant sound-evoked spiking before ECO. Population PSTHs of all significantly responding neuron–tone pairs (n) in SP (P8-P9; one-sided paired t test, n = 30, 30 of 82 (36.59%), t(29) = 5.17, p < 0.001; P10-P11, n = 107, 107 of 487 (21.97%), t(106) = 11.48, p < 0.0001) and L4 (P8-P9; n = 30, 30 of 112 (26.79%), t(29) = 6.46, p < 0.001; P10-P11, n = 261, 261 of 678 (38.5%), t(260) = 15.90, p < 0.001) show significant sound-driven activity before ECO, which get stronger with age (Fig. 4D–F).
The response peak latencies (see Materials and Methods) of L4 and SP single units were obtained at ages before (P09 example trace, Fig. 4G) and after ECO (P12 example trace, Fig. 4G). Before ECO (neuron–tone pairs; nSP = 107 and nL4 = 193) and at P12-P13 (nSP = 108 and nL4 = 146), the mean peak latencies of SPNs were found to be significantly lower than L4 neurons (one-sided t test, before ECO, t(298) = −1.86, p = 0.031; P12-P13, t(252) = −2.935, p = 0.002, Fig. 4H). The above results suggest that SPNs respond at shorter latency relative to L4 neurons before ECO, and the response properties mature with age. Furthermore, the L4 (ANOVA with Tukey post hoc analysis, F(394) = 166.63, p < 0.001) and SP (F(233) = 62.65, p < 0.001) responding latencies decreased with age. Thus, based on the data from multiple techniques, we conclude that neurons in the different cortical layers and SP layer respond to sounds before ECO without surgically opening the ear canal. Auditory responses are present as early as P7 in mice and respond down to 60 dB SPL tones.
SPNs, unlike L4 neurons, show oddball selectivity before ECO
Having established the presence of auditory responses before ECO, we investigate the response properties of SPNs and L4 neurons in the context of stimulus-specific adaptation (SSA) (Ulanovsky et al., 2003; Malmierca et al., 2014; Pérez-González and Malmierca, 2014). We use oddball stimuli with a stream of standard tokens (S) with an embedded deviant token (D) to mimic the natural situation of a continuous auditory environment with important sounds occurring with low probability (Ulanovsky et al., 2003; Pérez-González and Malmierca, 2014) (Fig. 5A). The stimuli consisted of 50-ms-long sound tokens presented at 3.3 or 4 Hz of the pattern SSS…SDS…S (D at eighth or seventh position of 15 or 10 total tokens) for the oddball response characterization. One of the S and D tokens was a tone (f, range 6-34 kHz) and the other a broadband noise (N, bandwidth 6-48 kHz with equivalent sound level). Responses were collected in pairs with the S and D swapped to normalize for a neuron's inherent selectivity for a particular S or D (Fig. 5A). Each SD set of a pair was presented with 10-40 repetitions and 5-7 s silence duration between each iteration. A common selectivity index (CSI) was used to quantify a neuron's deviant detection strength as follows:
SPNs, unlike L4 neurons, show higher oddball selectivity before ECO. A, Schematic represents the tone-noise (f-N) oddball stimulation protocol with the swap on the right. B, PSTHs of a single SP neuron, showing higher response at deviant tone (f, 8.5 kHz) token (swap, 250 ms response window gray patch, right). C, Dot raster and PSTH of the same SP neuron showing response to the same tone (8.5 kHz), when presented as tone-pip stimuli. D, First two columns, Example PSTH and z score of 2 L4 neurons (top rows) and 2 SP neurons (bottom rows), showing response to tone (f, standard in green and deviant in red, top) and noise (N, standard in green and deviant in red, bottom). Light gray patch represents the mean rate window (250 ms) across which CSI(F) was computed. E, Mean population spike rate and mean z score of all L4 and SP neurons in response to tone (f, standard in green and deviant in red, top) and noise (standard in green and deviant in red, bottom) presented at 3.33 Hz. E, Last 2 columns, Mean population spike rate and z score of L4 and SP neurons in responses to tone and noise, delivered at 4 Hz. Gray vertical line indicates a z score of 2. F, Population (3.3 and 4 Hz combined) cumulative distribution of CSI(F) and CSI for SP and L4 (magenta and cyan trace, respectively) shows significantly higher oddball selectivity in SP (one-sided unpaired t test, ***p < 0.001). Similar cumulative distributions with mean CSI calculated for each of 1000 random pair of SD response windows selections for S response and D response either before deviant (light dashed lines, F, left) or after deviant (light dashed lines, F, right). No significant difference between L4 and SP in >90% of the random window sets. Triangles: Magenta and cyan represent mean CSI(F) or CSI for SP and L4 neurons. G, Top and middle rows represent the mean CSI values of the population of L4 and SP neurons, respectively, for different choices of response latencies and response window sizes. *Response latency and response window size combination used in F. The two columns are for the two different CSIs used CSI and CSI(F). Bottom row represents which combinations of response attributes have significantly higher CSI in SP than in L4 (brown, blue indicates NS).
ACX L4 (300-350 μm, 11 mice, 84 neurons) and SP (>800 μm, 11 mice, 34 neurons) single-unit responses were obtained before ECO in response to the presentation of the SD stimulus for f-N and its swap. Sample PSTHs of SPN before ECO show the oddball's preference or to the deviant tone (f) token (Fig. 5B). The tone of a particular frequency was chosen based on the frequency that evoked a significant response (z > 2, Fig. 5C). The SD stimulus–response generally showed peaks following the D-token response, mainly in SPN and the first S-token or onset, primarily in L4 (Fig. 5D, L4 top row, SP bottom row). The earliest response peaks of L4 and SPN at ages before ECO (P9-P12) with the SD stimuli occur in a 250 ms window with a latency of 50-100 ms from the first token and in a similar window following the deviant (Fig. 5D). Comparing the mean response to the S-token and that of the D-token (Fig. 5D; S: green, D: red) shows the above. In the overall population, the mean response before ECO (P9-P12) for all the L4 neurons and SPNs recorded indicates the presence of deviant detection in SP and not L4 (Fig. 5E). Further, based on the spontaneous activity (100 ms preceding the stimulus), z scores of the example neurons (Fig. 5D) and population of neurons (Fig. 5E) demonstrate a significant response to D and S and that in the population, SP has significant activity at the deviant and not L4. To quantify the oddball's selectivity, we obtained spike rates in 250 ms windows because of the width of the peaks and calculated CSI(F) and CSI for all SPNs and L4 neurons. We observed higher deviant detection in SP (n = 34) than in L4 (n = 84) for both CSI(F) (one-sided unpaired t test, t(116) = 3.954, p < 0.001, K-S test, p = 0.007, Fig. 5F, left). and CSI (t(116) = 3.666, p < 0.001, K-S test, p = 0.002, Fig. 5F, right).
To rule out spurious deviant detection because of coincident spontaneous oscillating activity during the stimulus, we also considered CSIs expected by chance. Two nonoverlapping windows 250 ms long were taken randomly in the standard region before the deviant or after the deviant, not considering the deviant response. The mean CSIs were not significantly different between L4 and SP in >90% of the random window sets. More importantly, the distributions of mean random CSIs for SPN and L4 showed that the means were not significantly different from 0 (Fig. 5F, light dotted lines). Further, to ensure that our results were not sensitive to the choice of latency and window size, we also varied the response window from 50 to 300 ms (in steps of 50 ms) and latency from 0 to 350 ms (in steps of 50 ms). We obtained the CSIs for all neurons, and we found for what combinations SPNs had significantly higher oddball selectivity than L4 neurons (Fig. 5G). The same results were obtained for a substantial range of values around the chosen latency and window size, indicating deviant detection in SPN and not in L4 before ECO.
Unlike before ECO, L4 shows higher oddball selectivity than SP following ECO
Oddball selectivity in adults is a common phenomenon in all cortical layers of ACX (Szymanski et al., 2009). Given the observed absence of oddball selectivity in populations of L4 neurons before ECO, we hypothesize that such selectivity in L4 starts after ECO. Thus, we next investigated how the oddball selectivity changes in L4 and SPNs after ECO. Single-unit responses to the same SD paradigm with f-N sound tokens (schematic: Fig. 6A) were collected simultaneously from L4 (P12-P13, N = 10 mice, n = 45 units, 57 cases; P14-P15, N = 8 mice, n = 66 units, 115 cases; P16-P21, N = 4 mice, n = 20 units, 31 cases; P22-P28, N = 6 mice, n = 38 units, 97 cases; >P28, N = 7 mice, n = 44 units, 63 cases) and SPNs after ECO from P12 to the juvenile/adult at >P28 (P12-P13, N = 10 mice, n = 54 SPN units, 101 SPN cases; P14-P15, N = 8 mice, n = 59 SPN units, 67 SPN cases). We again quantify the selectivity based on CSI(F) and CSI. Comparison of population PSTH segments in response to tone (Fig. 6A, top two rows) or the noise (Fig. 6A, bottom two rows) as the first S token (green), all S tokens (blue), and the deviant (red) show how selectivity to the oddball changes in L4 and SP over age after ECO starting from P12 to P13. Since the inherent selectivity to the f or N confounds direct comparison, we compare CSIs over age (Fig. 6B), including the CSIs before ECO. L4 neurons start showing oddball selectivity immediately after ECO, which gradually declines to stable values with age until adulthood (CSI(F): ANOVA with Tukey post hoc analysis, F(4,358) = 37.68, p < 0.0001; CSI: F(4,358) = 12.74, p < 0.0001). On the other hand, SPNs with a significantly lower number at P12-P13 compared with peak at P8 (Wang et al., 2010) show significantly lower CSIs than L4 after ECO (CSI(F): one-sided unpaired t test, P12-13L4 vs P12-13SP, t(156) = −1.944, p = 0.027; P14-P15L4 vs P14-P15SP, t(180) = −1.10, p = 0.13; CSI: P12-13L4 vs P12-13SP, t(156) = −1.6213, p = 0.050; P14-P15L4 vs P14-P15SP, t(180) = −1.41, p = 0.07). We considered both CSI(F) and CSI since at early ages because of high adaptation (decrease in time constant with age: FL4(4,455) = 11.92, p < 10−4f) in L4 neurons with robust responses at the first S token (responding tokens increases: FL4(4,455) = 6.34, p < 0.0001) and not any others. The same trend was observed in both CSIs, showing that before ECO, SPNs are stronger deviant detectors than L4, while the relative deviant detection strength switches after ECO.
Sound-evoked activity of ACX and MGB neurons after ECO. A, Schematic showing the tone-noise (f-N) oddball stimulation protocol with the swap. Mean population PSTH of all significant responding neurons (top, L4 neurons; bottom, SP neurons), indicating the deviant response (red), the first standard response (green), and the mean of responses at all the standards (blue) to tone (top 2 rows) and noise (bottom 2 rows) for different age groups after ECO. B, Overall mean oddball selectivity for noise and tone tokens measured by CSI(F) (top) and CSI (bottom), respectively, as a function of age. C, Similar mean population PSTH for tone-tone (f1-f2) oddball stimulation protocol. D, Same as in B, overall mean oddball selectivity for tone stimuli measured by CSI(F) and CSI, respectively, as a function of age. E, Left, Example of electrode tracks (white arrowheads) shown in Nissl stains of brain slices showing recording locations in MGB of the thalamus. E, Right, Magnified view of the relevant part of a coronal section with MGB, in which single-unit recordings from MGBv were performed (P14). F, Mean and SEs of CSI(F) (top) and CSI (bottom) for the thalamic neurons, by age groups. *p < 0.05. **p < 0.01. ***p < 0.001.
Above, we used N-f SD stimuli to mimic a repeating broadband auditory environment with the tone as a low probability stimulus (like a pup or adult mouse vocalizations in a normal auditory environment) (Holy and Guo, 2005). Since deviant selectivity has been studied primarily with two tones (Ulanovsky et al., 2003; Pérez-González and Malmierca, 2014) (f1 and f2), we also used an f1-f2 SD stimulus set to investigate whether our results also hold in such a case. Further, using tones as both tokens allows more direct comparisons between responses. f1 and f2 were spaced 1/4, 1/2, or 1 octave apart. Data were obtained from multiple L4 (P12-P13, N = 3 mice, n = 28 units, 47 cases; P14-P15, N = 3 mice, n = 41 units, 63 cases; P16-P21, N = 3 mice, n = 11 units, 15 cases; P22-P28, N = 4 mice, n = 24 units, 53 cases; >P28, N = 3 mice, n = 26 units, 74 cases) and SP (P12-P13, N = 3 mice, n = 9 SPN units, 14 SPN cases; P14-P15, N = 3 mice, n = 34 SPN units, 51 SPN cases) single units across different developmental windows. Data were collected from single units simultaneously, and hence f1 and f2 were at different distances from the BF of each of the units. One of the frequencies (f1 or f2) was chosen to be at or near the BF of the majority of units. The SD paradigm responses were used to compare mean PSTH segments in response to the first S token, all other S tokens, and the D token (Fig. 6C) as previously. A similar trend of decrease in CSIs in L4 over age was observed, as with N-f stimuli (CSI(F): ANOVA with Tukey post hoc analysis, F(4,247) = 30.42, p < 0.0001; CSI: F(4,247) = 13.83, p < 0.0001) (Fig. 6D). From CSI(F) and CSI, we found that L4 neurons are more selective (CSI(F): one-sided unpaired t test, P12-13L4 vs P12-13SP, t(59) = 1.8731, p = 0.033; P14-P15L4 vs P14-P15SP, t(112) = 2.866, p = 0.002; CSI: P12-13L4 vs P12-13SP, t(59) = 3.343, p < 0.001; P14-P15L4 vs P14-P15SP, t(112) = 2.163, p = 0.016) to oddballs than SPNs after ECO.
The observed high oddball selectivity in SPN or L4 at different developmental ages could reflect subcortical deviant selectivity (Antunes et al., 2010; Parras et al., 2017). The MGBv is the main thalamic nucleus from which A1, L4, and SPN receive inputs (Kanold and Luhmann, 2010; Holzhauer et al., 2017) (Fig. 6E). Thus, we quantified CSIs from MGBv single-unit responses to the N-f SD stimuli (Fig. 6F). In both age groups considered (P14-P15 and P24-P26), we find that CSI(F) and CSI were lower than that of L4 (CSI(F): P14-P15L4 vs P14-P15MGBv, t(180) = −4.373, p < 0.0001; P22-P28L4 vs P24-P26MGBv, t(163) = −0.031, p = 0.488; CSI: P14-P15L4 vs P14- P15MGBv, t(180) = −7.370, p < 0.0001; P22-P28L4 vs P24-P26MGBv, t(163) = −6.209, p < 0.0001) and SPN (CSI(F): P14-P15SP vs P14-P15MGBv: t(132) = −2.881, p = 0.002; CSI: P14-P15SP vs P14-P15MGBv: t(132) = −4.006, p < 0.0001) in developing mice and also lower than L4 in the adult. Thus, even at the early ages, the observed oddball selectivity is emergent in the cortex and is, expectedly, a network phenomenon (Yarden and Nelken, 2017).
Exposure in a SD acoustic environment before and after ECO shows differential developmental plasticity for the standard and the deviant frequencies
Having found higher deviant detection in ACX SP than in L4 before ECO (P7-P11) and the opposite after ECO (P12-P15) during the known critical period, we next investigated the functional implications of our observations. Previous studies (De Villers-Sidani et al., 2007; Barkat et al., 2011) have shown no change in the ACX organization in adult rodents following exposure to a continuous stream of a single frequency tone before ECO. Our results indicate that while L4 and SPNs are responsive before ECO, the above observation is expected given the strong SSA observed in SPNs before ECO (Fig. 5D), responding to the D stimulus and not the S. We hypothesize that, because of higher deviant detection by SPNs before ECO (Fig. 5D–F), and because SPNs drive L4 neurons (Holzhauer et al., 2017) at early ages, to an SD auditory environment before ECO could influence the activity-driven plasticity of the ACX.
Adult ACX frequency representation of normally reared mice (EX0, control, N = 6 animals, n = 400 units/815 tuning curves) was compared with mice exposed to an oddball sound sequence consisting of a continuous stream of sound tokens (pure tones), with each token either being a standard (S) or a deviant (D) (S occurring with probability 0.9 and D with probability 0.1) at four different developmental stages between P6 and P21 (Fig. 6A, EX1-EX4). We exposed each of the groups to an f1-f2 combination of S = 12 kHz and D = 17 kHz. The before ECO group (EX2, N = 5, n = 256/504) and the after ECO group (EX3, N = 5, n = 367/700) were the most relevant age groups for our study; hence, we performed additional experiments with another f1-f2 combination (EX2: S = 20 kHz and D = 14 kHz, N = 3, n = 197/525; EX3: S = 14 kHz and D = 10 kHz, N = 3, n = 233/553). In all the above cases, stimuli at 90 dB SPL were used to ensure sound-driven activity in ACX before ECO. Since we found that with ear closed responses in the ACX existed for as low as 60 dB SPL, for our exposure study to be functionally relevant, we did additional experiments by exposing the P6-P11 group at 70 dB SPL also (EX5, N = 5, n = 158/104). Exposure at P0-5 (EX1, N = 4, n = 183/370) and P16-P21 (EX4, N = 4, n = 130/255) was performed as age controls. Finally, as stimulus control, we performed exposures of P6-P11 (EX6, N = 3, n = 194/487) and P11-P16 (EX7, N = 3, n = 327/642) groups with only standard SS paradigm (S occurring with probability 1) in which S = 12 kHz was used (EX6 and EX7, respectively).
Single-unit recordings in L4 (depth 350-450 m) in response to pure tones (13 frequencies, 6-48 kHz, 1/eighth octaves apart, 50-90 dB SPL, 10 dB steps) from A1 and AAF were performed in the exposed (EX1-EX7) and normally reared (EX0) mice at P28-P42 (Fig. 7A). With our limited resolution BF mapping, we did not find any apparent over-representation of any particular frequency in BF maps showing broad tonotopy (A1 and AAF) on the cortical surface in any of the exposure cases (before and after ECO), unlike other studies (De Villers-Sidani et al., 2007) with the equivalent of the EX3 group (Fig. 7B). Other than the lower spatial resolution of single-unit sampling, the above may also be because of exposure parameters (sound frequency, duration, and repetition rate). We considered only mice where recordings based on BF gradients were in A1/AAF. All significant response spike rates from the tuning curves (cartoon tuning curves, Fig. 7C) for each of EX0 to EX7 were obtained. EX1, EX4, and control (EX0) had the same median spike rate (rank-sum test, 12 sp/s for every case, pEX1 = 0.8475, pEX4 = 0.1875). All other groups, EX2-3 and EX5-7, show increased median spike rates (rank-sum test, 16-24 sp/s, pEX2,3,5,6,7, <10−4) compared with EX0. Thus, the overall median spike rates changed with auditory environment manipulations in the most relevant groups, P6-P11 and P11-P16 (the known critical period).
Exposure to an SD stimulus protocol before ECO induces long-term plastic changes. A, Different exposure paradigms by age and type of exposure stimulus with details (Table 4). For the SD case, green represents standard (S) and red represents deviant (D). For the SS case, blue represents standard (S). B, Bar plot represents the % change in an area of each of the preferred frequencies in A1 and AAF between control (EX0 group of mice) and the only standard P10-P15 group (EX6). C, Schematic explaining the construction of RF plots. Tuning curves (shown with different colors) with the octave difference between the BF (black arrow) and the RF (pink arrow) written on top (♦ represents rate at RF on different tuning curves, with respective color). The mean RF plot (solid black line, below) showing the mean rate at RF (represented by ♦ with respective color). D, Bar plots of all the exposure groups showing RF rates at BF normalized by the RF rates at all the other frequencies 0.5 octaves away on either side (see Materials and Methods). Striped bars in the P6-P11 group represent the SD exposure protocol at 70 dB SPL (EX5). Gray bar represents the control. *p < 0.05. *p < 0.01. Mean of the 13 control cases shown in gray bar in the center (see Materials and Methods). E, Bar plot summarizes population mean absolute response to RF for neurons with BF at RF of all 6 EX cases (EX2-7). Mean of the 13 control cases shown in gray bar in the center. Gray dashed line indicates the maximum boundary of control (see Materials and Methods). F, Bar plot showing population mean absolute response to RF for neurons with BF at RF when presented with loud sound levels (90-70 dB SPL, left) and soft sound levels (50-60 dB SPL, right).
The absence of over-representation of any particular frequency and increased spike rates in both the groups exposed during the known critical period as well as groups exposed during the period immediately preceding the critical period suggested plasticity in the strength of responses. Thus, we next investigated whether the effect of exposure was on spike rate responses to tones and whether changes were frequency-specific. From the tuning curves, the population mean response rates to an RF were plotted as a function of the difference between BF and RF in octaves (Fig. 7C; see Materials and Methods). We termed these plots as RF plots. The RF could be the frequency of S or D or that of SS. For the SD exposure protocol (EX1-5), the RF was taken as the standard frequency (S, Fig. 7D–F, green) and the deviant frequency (D, Fig. 7D–F, red); and for the only standard exposure protocol (SS), the RF was taken as the standard frequency (SS, Fig. 7D,E, blue). We obtained two such plots for SD exposures (EX1-5), with S as RF and D as RF; for SS exposures (EX6-7), S was the RF. For the control group C (gray), since there was no actual RF, each of the 13 frequencies presented was considered as RFs separately, and the RF plot was constructed (EX0, Fig. 7D–F, gray; for details, see Materials and Methods).
Although we did not observe any over-representation, we found a higher median rate for the relevant exposed groups. We investigated the effect of the exposure paradigm on the spike rates of all the neurons with respect to their BFs. We took the RF rates at the center of the RF plot (BF = RF, Fig. 7C, see Materials and Methods) and normalized them with the mean of the RF rates where the absolute difference between BF and RF was >0.5 octaves (|BF-RF|>0.5 octaves). A value >1 will signify an increase in the spike rate at the BF for the neurons with the BF as RF, whereas a value <1 will indicate an increase in spike rate at BF for neurons with BF away from the RF (Fig. 7D, normalized spike rate at BF; see Materials and Methods). We found that, in the SD exposure, before ECO, the normalized spike rate at BF for the neurons where BF and RF is the D frequency is significantly higher than the neurons with BF and RF as S (Fig. 7D, P6-P11; green bar vs red bar, one-sided unpaired t test, t(120) = 2.15, p = 0.016; pKS-test = 0.007 at 90 dB [filled bar] and t(57) = 1.76, p = 0.041; pKS-test = 0.006 at 70 dB SPL [striped bar]). The opposite is true after ECO (Fig. 7D, P11-P15; green bar vs red bar, one-sided unpaired t test, t(107) = 2.71, p = 0.004; pKS-test < 10−3). The normalized spike rate of neurons with BF and RF as S before ECO was significantly less than 1 (Fig. 7D, P6-P11, green bar, one-sample t test, t(78) = −2.75, p = 0.0078), but does not show any effect after ECO (Fig. 7D, P11-P15, green bar, one-sample t test, t(75) = −0.83 p = 0.39). In the SS exposure, the normalized spike rate at BF for neurons with BF and RF as S was not significantly different from 1 both before ECO (Fig. 7D, P6-P11, blue bar, one-sample t test, t(6) = −0.37 p = 0.72) and after ECO (t(17) = −1.23 p = 0.23). Comparisons (significant or not) with a population of 13 control rate profiles are marked on respective bars (gray '*' or NS). We also looked into the frequency pairs (12/17 kHz and 20/14 kHz) individually and got similar results for both before ECO (Fig. 7D, P6-P11, one-sided unpaired t test; 20/14 kHz: t(90) = 2.16, p = 0.016 and 12/17 kHz: t(28) = 0.95, p = 0.17. 12/17 kHz had comparatively lesser number of tuning curves, both 90 dB SPL [filled bar] and t(57) = 1.71, p = 0.041 12/17, 70 dB SPL [striped bar]) and after ECO (Fig. 7D, P11-P15, 14/10 kHz: t(51) = 1.61 p = 0.056 and 12/17 kHz: t(54) = 2.19 p = 0.016). Thus, there is a differential long-term plasticity of responses to the low probability sound frequency depending on the period of development during which animals were exposed to the SD environment. It coincides with the differential oddball selectivity before and after ECO observed in the ACX L4 and SPNs involved early in development in setting up the ACX circuitry.
The population mean absolute spike rate at BF as a function of the difference between BF and RF in octaves for EX2&6, EX3&7, and EX5 (Fig. 7A) showed differential effects on response strength at BF to the S and D for SD exposure before ECO and after ECO. Exposure after ECO in the known critical period of P11-P15, the response to S frequency in neurons with BF at S is strengthened compared with the response to D frequency in neurons with BF at D (Fig. 7E, P11-P15; green bar vs red bar, one-sided unpaired t test, t(107) = 2.65, p = 0.005; pKS-test = 0.008). However, before ECO, the opposite effect is observed (Fig. 7E, P6-P11; green bar vs red bar, one-sided unpaired t test, t(120) = 3.08, p = 0.0013; pKS-test = 0.0013). No such changes are observed in exposure before P6 (Fig. 7E, P0-6; green bar vs red bar, one-sided unpaired t test, t(81) = 0.15, p = 0.55; pKS-test = 0.81) and after P15 (Fig. 7E, P16-P21; green bar vs red bar, one-sided unpaired t test, t(58) = 0.88, p = 0.81; pKS-test = 0.33), similarly at 70 dB SPL exposure (Fig. 7E, P6-P11; striped green bar vs red bar, t(57) = 1.507, p = 0.06; pKS-test = 0.011). In the SS exposure in P11-P15, there is an increase in response to S frequency in neurons with S as BF beyond the boundary of control (Fig. 7E, P11-P15, blue bar and gray dashed line; for details, see Materials and Methods; one-sample t test, t(17) = 2.68, p = 0.015). A higher response than the boundary of control (for details, see Materials and Methods) for D frequency in neurons with BF as D was found before ECO in SD exposure at both 90 (Fig. 7E, P6-P11, red bar and gray dashed line, one-sample t test, t(42) = 2.80, p = 0.007) and 70 dB SPL (Fig. 7E, P6-P11, striped red bar and gray dashed line, one-sample t test, t(38) = 5.06, p < 10−4). As observed in normalized spike rate, when considering each set of exposure frequencies, for the merged groups of SD exposure with different sets of S and D frequencies, the response to D in neurons with BF as D was greater than that of S in neurons with BF as S before ECO (one-sided unpaired t test, 20/14 kHz: t(90) = 3.07, p = 0.0014 and 12/17 kHz: t(28) = 0.95, p = 0.17, 12/17 kHz had comparatively lesser number of curves, both 90 dB SPL and t(57) = 1.5, p = 0.06 12/17, 70 dB SPL). After ECO, the response for D in neurons with BF as D was found to be less than that of S in neurons with BF as S (14/10 kHz: t(51) = 1.53 p = 0.06 and 12/17 kHz: t(54) = 2.11 p = 0.019). Tuning curves collected after the presentation of pure tones at louder (70-90 dB SPL) and softer (50-60 dB SPL) sound levels were also analyzed separately to probe the effect of intensities (Fig. 7F). We found the effect to be more consistent at softer sound levels (EX2: t(20) = 2.03, p = 0.023, pKS-test = 0.01; EX3: t(42) = 2.71, p = 0.004, pKS-test = 0.016; EX5: t(5) = 5.47, p = 0.001, pKS-test = 0.01) compared with the louder sound level (EX2: t(58) = 2.64, p = 0.005, pKS-test = 0.008, EX3: t(63) = 0.95, p = 0.173, pKS-test = 0.055, EX5: t(50) = 0.66, p = 0.254, pKS-test = 0.107).
Model binary network with observed oddball selectivity of SPNs and L4 shows a stronger representation of deviant frequency before ECO and standard frequency after ECO
Having found long-term plastic changes specific to the deviant in the ACX that had exposure to an SD environment before ECO (P6-P11), we next explain our observations based on known circuitry of SPN, L4, and thalamic inputs (Kanold and Luhmann, 2010; Holzhauer et al., 2017). SPNs receive stronger thalamic inputs than L4 early on, and SPNs project to L4 neurons (Holzhauer et al., 2017) (Fig. 8A). In our simultaneous SP-L4 recordings with staggered electrodes before ECO, 42 pairs of neurons (between L4 and SP) were found to be connected (Fig. 8B,C; see Materials and Methods) of 136 simultaneously recorded L4-SP unit pairs. After ECO (P12-P15), we found 176 of 244 L4-SP unit pairs to be connected. The direction of connections was obtained based on significant cross-correlations at positive or negative time lags and confirmed via the GC test (SP->L4, magenta or L4->SP, cyan, corresponding examples in Fig. 8B and population data in Fig. 8C). Assuming columnar sizes of ∼100 µm, we grouped the pairs as columnar (≤125 µm) or not (>125 µm), based on the lateral distance between the electrodes from which connected pairs were obtained (Fig. 8C). Observed directionality of the connected pairs before ECO suggests the existence of connections from SPN->L4 (12 of 16 pairs showed connections from SPN->L4 within a spatial distance of ≤125 µm, while 13 of 26 pairs were found to be connected at a distance of >125 µm, Fig. 8C, first column, top). However, the L4->SP connection probability within ≤125 µm (4 of 16 connected pairs) was found to be significantly lower than SPN->L4 (one-tailed z test, z = −2.828, p = 0.002, Fig. 8C, first column, top). Moreover, L4->SP connections were found to be significantly higher than SP->L4 at ages after ECO (one-tailed z test, z = 4.746, p < 10−4, Fig. 8C, first column, bottom). The summary of connectivity between L4 neurons and SPNs (Fig. 8C) shows that SP->L4 connections are significantly reduced (one-tailed z test, z = −2.600, p = 0.004) from before ECO to after ECO. GC analysis (Granger, 1969; Francis et al., 2018) also revealed similar connectivity profiles between SP and L4 at ages before and after ECO, as observed using cross-correlation (Fig. 8C, second column, top and bottom). It is also evident from the spike rates of L4 neurons that the thalamic drive increases after ECO, as also observed in slice recordings (Holzhauer et al., 2017). We also found that recurrent connections, obtained with cross-correlation as well as GC, within SP (SP->SP) and within L4 (L4->L4) were differentially altered from before ECO to after ECO (Fig. 8D). Significantly more recurrent connections were present within SP than within L4 before ECO (z = 5.437, p < 10−4, Fig. 8D, top) and which switched after ECO (z = 9.655, p < 10−4, Fig. 8D, bottom). SP to SP recurrent connections decreased with ECO while those in L4 increased, very much like the observed CSI values in SPN and L4 before and after ECO, expected from the role played by recurrent connections in deviant detection (Yarden and Nelken, 2017). Our observations on deviant detection strength measured by CSIs are thus further corroborated by the lateral connection changes observed in the two stages in the two layers.
The binary network model of a developing ACX with SP and TC inputs. A binary model is shown for both before ECO and after ECO, with the size of the blobs representing synaptic weights, which changes across time. A, A network schematic showing standard (S, as green in SD and as blue in SS paradigm) and deviant (D, as red in SD paradigm) TC inputs to both SP and L4 and an SP input to L4 (top), SD and only standard (SS) stimulation protocol (bottom). B, Example pair of correlograms for SP to L4 (left) and L4 to SP (right) connections. Gray plot represents the bootstrapped shuffled data (control). *Significant time windows (95% CI) for both before ECO and after ECO. An example EPSP profile showing synaptic depression. C, Population bar plots showing connection probability between SP and L4 for different distances according to cross-correlation (left) and GC (right) before and after ECO. D, Connection probability for SP-SP and L4-L4 connections. E, Top, Example EPSP profile showing synaptic depression. Bottom, The asymmetric spike time-dependent plasticity learning followed by all the plastic synapses, showing a larger LTD window. The synaptic weight modifications occurred over −80 to 40 ms of post-pre spike times. F, Example PSTHs showing higher oddball selectivity in SP (magenta) than L4 (cyan) before ECO and the opposite after ECO. G, Temporal evolution of the relevant synaptic strengths (S) in the two exposure periods (before ECO, SD: EX2; SS: EX6; and after ECO, SD: EX3; SS: EX7) for simulations of exposure with SD or SS protocol. Inset, Schematic represents the final point of the simulation with the blob sizes representing relative synaptic weights. H, Plots represent the mean spike time difference (post-pre) across time. I, Mean relative change in synaptic weight (Δw/w). *p < 0.05. **p < 0.01. ***p < 0.001. Theoretical derivations explaining results are provided in Extended Data Figure 8-1.
Figure 8-1
Theoretical analysis for optimization based on mutual information maximization and sparseness. Download Figure 8-1, PDF file.
Thus, at the earliest stages, stimuli through thalamic inputs that drive SPNs would drive the L4 neurons. Such activation of L4 neurons with coincident activity in the thalamus to L4 synapses would strengthen the synapses following Hebbian plasticity. Thus, the observed deviant detection of the SPNs responding to D and not the S would strengthen the synapses on L4 conveying the D information. On the other hand, after ECO, the thalamus to L4 synapses are already stronger than before ECO, and the L4 neuron responds to both the S and D stimulus, preferring S more. The SP to L4 synapse is weaker than before ECO and is not as strong in evoking spikes in L4 by itself. Thus, on exposure to SD after ECO, synapses conveying D information from the thalamus weaken while conveying S information strengthens.
We test the above explanation for the observed exposure-dependent developmental plasticity based on L4 and SPN deviant detection through a binary network model (Fig. 8A) of early developing TC circuitry of the ACX (Holzhauer et al., 2017) similar to that considered in the visual cortex (Kanold and Shatz, 2006). To have a minimal model that could potentially explain the results, we did not consider lateral connections, although the presence of recurrence will, in principle, only improve the results. Thalamic inputs selective for either of two inputs, S or D (green and red, Fig. 8A), are considered to make synapses on an SPN, sending collateral inputs to L4 neuron (Luhmann et al., 2018). The SP and L4 neurons are modeled as integrate-and-fire neurons with absolute and relative refraction (Kanold and Shatz, 2006). The SPN makes a synapse onto the L4 neuron, as known (Luhmann et al., 2018). All synapses were considered depressing in nature (Mill et al., 2011; Yarden and Nelken, 2017) such that L4 and SPN responses show similar adaptation observed early in development (Fig. 8E, top). Also, all synapses, except the thalamic inputs to SPN, are plastic following an asymmetric Hebbian spike time-dependent plasticity (Kanold and Shatz, 2006) rule (Fig. 8E, bottom). The abscissa in Figure 8E, below represents the difference in presynaptic and postsynaptic spike times (δτ, post – pre), with positive δτ leading to LTP and negative δτ leading to LTD. The ordinate represents the percentage relative change in synaptic weight (Δw/w) for each presynaptic and postsynaptic spike pair.
We consider the model with two different starting points: one representing the period before ECO (Fig. 8A, left) and another after ECO (Fig. 8A, right). In the earlier developmental period (before ECO), the SP CSI was 0.41, and that of L4 was −0.02. In the later developmental period, SP CSI was −0.17, and that of L4 was 0.4 (Fig. 8F), as in our observations (Fig. 6B). The differences in the initial point of the network (see Materials and Methods) in the 2 cases (Fig. 8A, left and right) are supported by other data (Holzhauer et al., 2017) as well as our observations of larger time constants later in development, increased thalamic synaptic weights on L4 because of normal development, lower SPN to L4 connectivity (Fig. 8B,C) and reduced SPN rates through higher thresholds. We use the same exposure protocols, SD and SS, for the two periods to show the nature of how the thalamic inputs to L4 develop (Fig. 8G). Before ECO with higher oddball selectivity in SP than in L4, with an SD exposure, the D thalamic input to L4 strengthens faster than the S thalamic input (Fig. 8G, before ECO).
On the other hand, after ECO, with the oddball selectivity reversed in SP and L4, D to L4 synapse weakens, and S to L4 strengthens, as L4 spikes more because of thalamic input spikes in that of S and not for that of D (Fig. 8G, after ECO). The above is evident when considering the relative presynaptic and postsynaptic spike times (δτ, post – pre) driving LTP or LTD of the S->L4 and D->L4 synapses in the 2 cases, before and after ECO (Fig. 8H; see Materials and Methods). The mean strength of weight change occurring in the synapses based on all Δw/w for each presynaptic and postsynaptic spike pair separated by δτ also shows the same effect on synaptic strength change (Fig. 8I). Mean δτ(D->L4) is positive before ECO and negative after ECO, with a significant difference. Thus, the effect of SD exposure before ECO is overall long-term potentiation of the D->L4 synapse. After ECO, the same synapse weakens as observed in the model results (Fig. 8G). The opposite is true for the S->L4 synapse. Before ECO with an SS exposure, the S thalamic input does not show strengthening at a fast rate (as D, above), leading to the S input's normal representation (Fig. 7D, bar plot P6-P11) equivalent to S in the SD case. After ECO, with the only S stimulus, the S inputs expectedly strengthen, leading to the normal response of S, as observed (Fig. 7D, bar plot).
Sparse coding and information maximization principles imply a strengthening of response to low probability stimuli
While a network model mechanistically addresses the observations, we theoretically address the observations of early exposure to a low probability stimulus leading to the strengthening of its responses and its weakening in later exposure case. Our observations and model results are contrary to the general ideas of long-term plasticity; a repeated presentation of a stimulus in the critical period leads to the strengthening of the representation of that particular stimulus in the long-term (Zhang et al., 2001; De Villers-Sidani et al., 2007; Barkat et al., 2011; Schreiner and Polley, 2014) as theoretically expected from standard Hebbian plasticity. Thus, it is important to reconcile our observations with previous observations from a theoretical perspective as well. Auditory cortical spiking activity is known to be sparse, and sparse coding principles theoretically explain several known receptive field types in the auditory pathway (Smith and Lewicki, 2006; Hromádka et al., 2008). We assumed that a minimal activity-based coding in single-neurons underlying sparse representation in populations and that such representation is one of the developing auditory system's goals while maximizing information about the stimulus in the responses. We use an objective function (Eq. 1) as the desired optimization performed by activity-driven plasticity with these principles (Extended Data Fig. 8-1). The objective function has a mutual information term between the response (R) and the stimuli token (T) that will increase as successive stimuli tokens come. A sparseness constraint term containing the mean rate limits the increase in mutual information. We call this metric as the Constrained Information (CI). This function was optimized in two separate regimens corresponding to the two separate regimens (i.e., before ECO and after ECO) as follows:
To keep the solution mathematically tangible and straightforward to understand, we optimized a single neuron receiving tokens of stimuli that are either standard (S, occurring with a higher probability) or deviant (D, occurring with a lower probability). We also took the liberty of considering the probability distribution of the responses for standard and deviant as Gaussian with a constant variance but variable mean. A detailed mathematical optimization is provided in Extended Data Figure 8-1. However, the basis is as follows. We focus on the distribution of a single neuron's response to standard and deviant for both before ECO and after ECO regime. For the before ECO case, we have kept the mean response for deviant higher than that of standard and the opposite for the after ECO case, as we got in our experiments (Fig. 5). We see that the mutual information would increase as the separation between the mean response of the two distributions would increase (a will increase; Extended Data Fig. 8-1). However, since the sparse activity requires a minimal response by single neurons to code the two stimuli, α must be positive and small to minimize the summed activity. As such, the response to the deviant needs to be larger than that of the standard. However, the relative weightage of MI and the summed activity (controlled by ρ) can allow a to be increasing or decreasing and still cause CI to be increasing.
We perform this optimization separately for the two regimens and analytically find out the upper bound and lower bound for the Lagrange multiplier ρ and hence the relative weightage of the sparseness, which will keep increasing the mutual information with successive tokens (Extended Data Fig. 8-1). Based on these results, we show that sparseness in later stages (after ECO), when spike rates are higher, needs to be weighted more than earlier stages (before ECO) to explain the differential effect observed before and after ECO in response to the low probability stimulus.
Discussion
Earliest sound-driven activity in the mouse auditory pathway
We show that the mouse auditory pathway, ACX and IC, is driven by sounds with ear canals closed as early as P7 and down to 60 dB SPL intensities (Figs. 1 and 2). The above result is perhaps not surprising as auditory brainstem responses in mice have been shown before ECO (A. Chang et al., 2018) and that there are long-term changes in Prestin expression in mice when exposed to loud sounds at these early ages (A. Chang et al., 2018) (P6-P11). Studies based on sound-evoked (A. Chang et al., 2018) and spontaneous activity (Babola et al., 2018) show that the central auditory pathway is functional before ECO. SPN circuits are different in mice with sound exposure and no sound exposure at ages before ECO (Meng et al., 2021). However, the peripheral auditory system may be immature at those ages. The external auditory meatus in mice reopens at P7, and the outer ear is fully mature by that time (Anthwal and Thompson, 2016). Further, lever arm lengths of middle ear bones are at 70% of mature values but have normal lever ratios by P7 (Huangfu and Saunders, 1983). The oval window is also of normal size by P7 (Huangfu and Saunders, 1983). Most importantly, mechano-transduction currents from Inner hair cells and Outer hair cells (OHCs) become maximal in the cochlear base by P2 and at the apex by P5 (Lelli et al., 2009; Kim and Fettiplace, 2013); thus, inner hair cells are fully mature by P5. However, OHCs are immature at P5 with nonlinear capacitance (NC) of OHCs starting at P7 and maturing fully at P18 (Abe et al., 2007). The development of OHC NC from P7 is correlated with the expression of Prestin, the motor protein responsible for the OHC NC (Abe et al., 2007) and hence hearing sensitivity. Thus, barring OHC NC, the key to low threshold (0 dB SPL) hearing and high auditory nerve fiber sensitivity, all evidence suggests that the peripheral auditory system is mature. Thus, moderate- to high-intensity sound-evoked activity in the auditory pathway is possible at ages before ECO in mice. However, to the best of our knowledge, no study has shown auditory responses in mice before ECO. So far, all studies considering sound-evoked activity and early critical period in the auditory system have considered hearing onset to coincide with ECO. Our results open up an earlier time window of activity-driven plasticity to occur in the auditory pathway. Previous work on ferrets (Wess et al., 2017) also shows such a possibility, where the authors show that SPNs are the first to respond to sounds at an equivalent developmental stage of ferrets as we find in mice.
Auditory development and deviant detection
Post-ECO development of basic response properties, such as tuning, tuning width, thresholds to tonal sounds, or broadband noise and other basic stimuli, have been studied (E. F. Chang et al., 2005; De Villers-Sidani et al., 2007). Our study shows that the starting point of auditory activity-driven development is before ECO, and provides data on how deviant detection changes over development into adulthood (Figs. 5 and 6). We show that the well-known deviant detection property of the auditory pathway (Ulanovsky et al., 2003; Yarden and Nelken, 2017) is present in the early stages and is stronger in cortical neurons than in the thalamus (Fig. 6). We also show that deviant detection is strongest at early ages and reduces to the values observed in the adult. More importantly, we find that SPNs are deviant-selective at the earliest stages of sound-driven activity before ECO, while cortical L4 neuron populations are not deviant-selective at those ages (Fig. 5). Since SSA underlying deviant selectivity is primarily a network phenomenon (Yarden and Nelken, 2017), it may be surprising that SPNs are deviant-selective at ages when thalamocortical and corticocortical connections are forming. However, it is well known that SPNs form synapses with each other (Hirsch and Luhmann, 2008; Kanold, 2009; Kanold and Luhmann, 2010). Correlated with our observations of higher oddball selectivity in SPNs, SPNs are found to have more recurrent connections before ECO than after ECO and more than that of L4 neurons before ECO (Fig. 8B–D). After ECO, L4 neurons have more recurrent connections than before. The differential nature of recurrent connectivity in L4 and SPN before and after ECO also corroborates our observations of differential oddball selectivity in the two populations before and after ECO. Thus, the nature of recurrent connectivity change we observe during early development supports the changes in the strength of deviant detection as measured by CSIs.
Early auditory experience-based plasticity, SPNs, and critical period
A broad tonotopic organization in the primary ACX (A1) (Bandyopadhyay et al., 2010; Issa et al., 2014) is known to be altered with over-representation of a particular frequency in the adult rats/mice, which were exposed to that frequency tone during the critical period (P12 to P15, after ECO) (De Villers-Sidani et al., 2007; Barkat et al., 2011). In light of our present results with auditory responses present before the above period, we revisit exposure to a single frequency tone before the critical period. The previous studies (De Villers-Sidani et al., 2007; Barkat et al., 2011) did not find any evidence of adult cortical over-representation of the exposure frequency when exposing mice to that frequency before ECO. Thus, the window of P12-P15 in mice and a similar window in rats was defined as the critical period (De Villers-Sidani et al., 2007; Barkat et al., 2011). Our data explain the lack of over-representation with exposure before ECO despite auditory responses during that period (Fig. 7B). The SPNs driving thalamocortical synapses to mature are found to have deviant detection at these early stages; thus, SPNs, and consequently, L4 neurons do not respond to the repeated tone presentations. Therefore, in our SS exposure results before ECO (P6-P11, EX6), we do not find any plastic strengthening responses to the S frequency as expected from above. However, as hypothesized, the unique deviant detection properties of the SPNs before ECO show strengthening of response to the deviant tone with an SD exposure before ECO. It is expected to be because of the lack of or weak responses to the repeating S frequency and responses to the D frequency. However, SD exposure following ECO shows the opposite since the SPNs are less deviant-selective and thalamic inputs are now stronger, driving L4 responses directly. Similarly, the SS exposure after ECO also shows expected results. The mechanism underlying our observations is summarized in our model results. Of course, adaptation time constants play a role in the observed deviant detection, hence our exposure results (Fig. 7D–F).
Since SSA is primarily a network phenomenon with recurrent connections playing a key role, the observed changes in recurrent connections (Fig. 8B–D) cannot be ruled out in understanding our results. In our current model (Fig. 8A), we did not consider the recurrent connections. Extending our minimal binary network model to a recurrent network is expected to produce the same results as the effects of deviant detection and SSA. Our underlying observations would remain unchanged. Our model is only a minimal neural structure capturing the key mechanisms explaining the observations.
Further, the model currently does not include inhibition, an element crucial in development, as synaptic plasticity rules of inputs/outputs of inhibitory neurons at such ages are not available. It likely plays a pivotal role as the maturation of inhibition is controlled by SPNs (Kanold and Shatz, 2006). Also, the effect of the inhibitory switch, occurring before ECO, on the activity-driven plasticity observed before ECO needs to be investigated. The closure of the critical period, around P16, also coincides with the observed near absence of SPNs in mice from P15. Thus, mechanisms implicated in critical period closure, such as inhibition, adenosine, cholinergic inputs (Blundon et al., 2017; Takesian et al., 2018; Vickers et al., 2018; Yaeger et al., 2019), could also be tied to SPNs, since SPNs guide development of subcortical projections. Finally, exposures with selective inhibition of SPN activity during relevant development periods are required to establish SPN-driven cortical development further.
Thus, we suggest that to understand better experience-dependent developmental plasticity, the period before ECO needs to be considered and possibly revise timelines of the critical period. The concept of the auditory critical period has been modified by showing that it can be stimulus-specific (Bhumika et al., 2020). Similarly, we suggest the presence of a critical period that is deviant-selective before ECO.
Implications of the SPN deviant detection critical period
In mice, we show that relatively low probability salient stimuli before ECO at ages equivalent to human gestational week 25 (Clancy et al., 2001) cause long-term plasticity (Fig. 7C–F). The above potentially explains observations in humans of maternal voice recognition in infants (Mehler et al., 1988; Voegtline et al., 2013). The results also suggest that prenatal sensory environment can influence cortical development and may also generalize to other sensory systems given the unique response properties of the SPNs. The generalization to other systems or other stimuli, in this case, is supported by our theoretical predictions. Although our results are derived based on tones as stimuli, the possibility of the coding of auditory objects by the neurons in the ACX can also be addressed through similar experiments. For example, specific response selectivity to species-specific vocalizations in mice and other species could also be explained based on the low probability salient occurrence of such sounds during early development. Corresponding experiments for other sensory systems can lead to a better understanding of cortical development.
Footnotes
This work was supported by the DBT/Wellcome Trust India Alliance Fellowship/Grant IA/I/11/2500270 to S.B. M.M. was supported by CSIR Fellowship. A.M. was supported by MHRD PMRF. S.B. was supported by India Alliance, IIT Kharagpur, MHRD and SRIC Cell, and IIT KGP.
The authors declare no competing financial interests.
- Correspondence should be addressed Sharba Bandyopadhyay at sharba{at}ece.iitkgp.ac.in