Abstract
Familiarity of the environment changes the way we perceive and encode incoming information. However, the neural substrates underlying this phenomenon are poorly understood. Here we describe a new form of experience-dependent low-frequency oscillations in the primary visual cortex (V1) of awake adult male mice. The oscillations emerged in visually evoked potentials and single-unit activity following repeated visual stimulation. The oscillations were sensitive to the spatial frequency content of a visual stimulus and required the mAChRs for their induction and expression. Finally, ongoing visually evoked θ (4–8 Hz) oscillations boost the visually evoked potential amplitude of incoming visual stimuli if the stimuli are presented at the high excitability phase of the oscillations. Our results demonstrate that an oscillatory code can be used to encode familiarity and serves as a gate for oncoming sensory inputs.
SIGNIFICANCE STATEMENT Previous experience can influence the processing of incoming sensory information by the brain and alter perception. However, the mechanistic understanding of how this process takes place is lacking. We have discovered that persistent low-frequency oscillations in the primary visual cortex encode information about familiarity and the spatial frequency of the stimulus. These familiarity evoked oscillations influence neuronal responses to the oncoming stimuli in a way that depends on the oscillation phase. Our work demonstrates a new mechanism of visual stimulus feature detection and learning.
Introduction
Perception and processing of sensory information are critical for animal survival. Efficient strategies are necessary to recognize and differentiate among various visual contexts and stimuli. This recognition and differentiation of information require the segregation of familiar and novel sensory inputs. However, the neural substrates underlying these processes are poorly understood. Visual familiarity arises as animals experience particular visual stimuli, and manifests as changes in neural activity from the naive state within the primary visual cortex (V1). Prior research in this area has primarily focused on the analysis of neuronal responses time-locked to the visual stimuli. Previous visual experience can increase the stimulus selectivity of time-locked neural activity in V1 of awake mice (Poort et al., 2015). Similarly, repetitive presentation of phase-reversing gratings to awake mice over several days results in significant potentiation of visually evoked potentials (VEPs) in V1 specifically to the trained stimulus (Frenkel et al., 2006; Cooke et al., 2015). VEP amplitude has also been shown to report learned sequences of visual stimuli, where presentation of a familiar sequence elicits larger amplitude time-locked VEPs compared with a novel sequence (Gavornik and Bear, 2014).
While less well characterized, persistent oscillatory activity lasting beyond visual stimulation may also arise in an experience-dependent manner and potentially reflect stimulus familiarity. Several functions have been attributed to oscillatory activity of neurons in the brain, including information processing, information propagation, and synchronization between brain areas (Salazar et al., 2012; Karalis et al., 2016; Chan et al., 2017; Durkin et al., 2017). Oscillatory activity within particular frequency bands correlates with cognitive load, attention, and other higher-order brain functions (Jensen and Tesche, 2002; Womelsdorf et al., 2006; Voloh et al., 2015; Friese et al., 2016; Mussel et al., 2016). Low-frequency oscillations are pronounced during sleeping states (Besedovsky et al., 2017; Durkin et al., 2017; Kim et al., 2017). However, recent evidence has also demonstrated the prominence of low-frequency oscillations in awake states during the perception of visual stimulation (Lee et al., 2005; Einstein et al., 2017). Moreover, persistent spiking in the θ oscillation range (4–8 Hz) emerges in V1 after the presentation of a visual cue to report the time of a reward delivery (Shuler and Bear, 2006; Zold and Hussain Shuler, 2015) and during the delayed part of visual cue-reward working memory tasks (Lee et al., 2005). Thus far, persistent oscillatory activity in V1 has been hypothesized to encode information other than the characteristic features of a visual stimulus, potentially reflecting experience-dependent changes in neural population activity related to the task. It is also plausible that the persistent oscillatory activity induced by previous visual stimulus may interact with new oncoming stimuli. Currently, we do not understand how this oscillatory activity is generated or encoded, nor do we understand how it affects the visual processing of oncoming stimuli.
Importantly, the possibility that oscillations can encode visual stimulus features has not been formally excluded. For example, it remains unclear whether novel versus familiar visual stimuli can differentially evoke oscillations in V1. Whether stimulus-associated oscillations occur in the absence of reward-induced plasticity also remains to be determined. Finally, the necessary and sufficient visual stimulus features that can drive oscillatory activity are unknown. Here, we find that perceptual training leads to the emergence and potentiation of persistent low-frequency oscillations in both local field potentials (LFPs) and single-unit recordings. Consistent with other familiarity-based rhythms in V1 (Chubykin et al., 2013; Zold and Hussain Shuler, 2015), the acquisition and expression of these oscillations depend on the cholinergic muscarinic receptors. The spatial frequency (SF) content of a visual stimulus appears to be the critical factor governing the generation of oscillatory activity. Finally, we demonstrate that familiar stimuli can influence responses to oncoming stimuli, based on the excitability phase at which the subsequent stimuli are presented.
Materials and Methods
Mice.
All animal use was approved by the Purdue University animal care and use committee. Mice were housed on a 12 h light/dark cycle, with full access to food and water. Male C57BL/6 mice (The Jackson Laboratory; age postnatal day 60–65) were used for all experiments.
Surgical procedures.
At ~P55, littermate-matched C57BL/6 mice were selected for surgery. Anesthesia was induced at 5% inhaled isoflurane (in oxygen) and maintained at 1.5% during surgery. The skin above the skull was shaved and sterilized with Dynarex ethanol wipes before incision. Ophthalmic ointment was applied to the eyes, and animals were head fixed (with ear bars) in a motorized stereotaxic apparatus (Neurostar) before the surgery. A small incision was made along the midline, then expanded laterally to expose the lambda and bregma skull sutures; 3% H2O2 was applied gently with a cotton swab to sterilize and expose the remaining connective tissue on the skull. The underlying connective tissue was removed, and the skull was dried with sterile cotton swabs. Neurostar stereodrive software was used to mark coordinates to target the binocular visual cortex (from lambda: anteroposterior 0.8 mm, mediolateral ±3.2 mm). Shallow lines were scratched into the surface of the skull in a checkboard pattern to increase the surface area and facilitate strong bonding of metabond bone cement. A headpost 9.5 mm in length was glued with cyanoacrylate 3.5 mm anterior to bregma along the midline. A 1.5 mm tungsten reference wire soldered to a 0.79-mm-diameter gold-plated pin was inserted 0.2 mm forward of the bregma skull suture as a reference and glued in place with cyanoacrylate. Zip-kicker accelerant was used to reduce the drying time of the cyanoacrylate. A small drop of Kwik-Cast Silicone Elastomer was placed above the target coordinates marked on the skull to seal the area until the recording day. Metabond was then used to seal the remaining exposed skull and form a head cap. After a day of recovery, awake mice began habituation to the head-fixation apparatus for a minimum of 4 d (90 min/d). The apparatus consists of an immobile tube that secures the mouse on a raised platform 16.51 cm directly in front of and centered on a 47.63 cm × 26.99 cm monitor screen and a bar to hold the surgically implanted headpost. On the first day of habituation, some mice attempted to run to escape head fixation. By the third or fourth day of habituation and during recording sessions, we did not observe any attempts to run. Mice also exhibited grooming behavior by the third or fourth day of habituation. On the fifth day (~P60), a craniotomy was made above the visual cortex of a single hemisphere during 5 min of inhaled anesthesia (1.5% isoflurane) in the stereotaxic apparatus. Mice were then head fixed, and a 64 channel silicon electrode was inserted normal to the surface of the binocular area of the primary visual cortex (anteroposterior 0.8 mm, mediolateral ±3.2 mm, dorsoventral 1.0 mm from lambda). Before acquiring data, additional time (30 min) after insertion was allowed for the probe to settle and for animals to fully awaken from anesthesia. ACSF was added dropwise above the craniotomy every 20 min to prevent cortical drying. After a recording session, the recording hemisphere was resealed with Kwik-Cast Silicone Elastomer.
Perfusions and histology.
Mice received intraperitoneal injections of a 100 mg/kg ketamine and 16 mg/kg xylazine solution for anesthesia. A shallow incision was made below the rib cage to expose the inner cavity of the peritoneum. The lateral sides of the rib cage were cut, and the diaphragm was removed. The rib cage was peeled back, and any remaining connective tissue was removed to expose the heart. A 25 gauge needle was inserted into the left ventricle, and a small incision was made into the right atrium. Blood was forced out of the animal via gravity fed 1× PBS through the needle. The animal was then perfused with 4% paraformaldehyde (PFA) to fix the tissue. The animal was decapitated, and the Metabond cap was removed by carefully cutting the cap border and peeling it away from the skull. Shallow cuts were made along the midline, the lambda suture, and the bregma suture to peel back the skull and remove the brain. After extraction, the brain was allowed to sit in 4% PFA for 24 h before making 100-μm-thick coronal sections. The electrode track was then visualized by light microscopy to verify the electrode placement in V1, according to landmarks shown in a mouse brain reference atlas (Neurostar stereotaxic mouse brain atlas, Allen mouse brain atlas).
Visual stimulation.
Open source Python-based psychology software (PsychoPy) was used to present visual stimuli. Control gray screen was created using the color space “gray.” The mean luminance of the monitor was 73 cd/m2. After a day of recovery, mice began habituation to the head-fixation apparatus for a minimum of 4 d. During habituation, mice viewed a control gray screen for 90 min per day. Our standard recording strategy (see Fig. 1) was to show mice single 0.2 s sinusoidal drifting gratings (SF = 0.03 cycles per degree of visual angle, temporal frequency = 3 Hz, speed = 100 deg/s, oriented and drifting at an angle of 150 degrees) for 20 trials in pretraining recordings. Gray screen was presented for 0.5 s before stimulus onset to serve as a baseline with a total recording time of 2.5 s or 4.0 s for each trial, with an intertrial interval of 8 s. Mice were then trained to a pair of stimuli, where the first stimulus (same as described above) is followed by a second stimulus with the same spatial and temporal frequency, but oriented and drifting at an angle of 210 degrees. Animals were trained to this pair of stimuli 200 times in 30 min each day for 4 d. Post-training recordings included the same visual stimulation paradigm as pretraining. In some mice, we also made recordings of the training stimulus after training (12 mice from Fig. 1 trained to 1.0 s interval; 6 mice from Fig. 1 trained to 2.0 s interval). For SF experiments (see Figs. 9, 10), five visual stimuli were generated by bandpass filtering (PsychoPy filters) random noise with highest power in 0.01, 0.02, 0.04, 0.08, and 0.14 cpd. The following bandpass cutin and cutoff values were used to generate stimuli: SF 0.01 = (0.00083, 0.00086), SF 0.02 = (0.00167, 0.00170), SF 0.04 = (0.00333, 0.00336), SF 0.08 = (0.00667,0.00670), and SF 0.14 = (0.01167, 0.01170).
Data acquisition and Python packages.
Recordings were made using 64 channel silicon probes, 1.05 mm in length, with channels separated 25–50 μm vertically and 16–20 μm horizontally (Shobe et al., 2015). Recordings were made in sets of 20 trials, 2.5 or 4.0 s in duration. Raw traces and bandpass-filtered units (300–6000 Hz) were digitized at 30 kHz and acquired with OpenEphys acquisition hardware and software. LFPs were filtered (1–300 Hz) from raw traces post hoc. All data were plotted and analyzed with jupyter notebook using custom Python code written in our laboratory. Open source data analysis libraries, including Pandas, Scipy, Matplotlib, Seaborn, and sklearn, were used to analyze and plot the data. Pupillometry recordings were acquired with a Thorlabs DCC1545M camera positioned ~28.5 cm away from the mouse eye, while the pupil was illuminated with infrared light. The videos were analyzed post hoc.
LFP analysis.
Raw traces were down-sampled to 1 kHz and manually inspected for artifacts before further analysis. A notch filter was applied to remove 60 Hz noise. LFPs were compared between animals by taking the first and strongest trial averaged (20 trials) VEP elicited after visual stimulation (putative layer 4 VEPs) from each of the 3 channel columns of the silicon probe. Because the probes were inserted normal to the surface of the cortex, we could ascertain the current source density (CSD) profile of visually evoked responses. CSD analysis was performed on the trial averaged VEPs across the cortical depth using the spline iCSD method in python (Mitzdorf, 1985; Aizenman et al., 1996; Pettersen et al., 2006; Leski et al., 2007). To determine the average amplitudes of these trial averaged VEPs between mice, 6 windows of time were used to capture local minima (VEP1, 0.53–0.63 s; VEP2, 0.73–0.83 s; VEP3, 0.93–1.03 s; VEP4, 1.15–1.25 s; VEP5, 1.38–1.48 s; VEP6, 1.65–1.75 s) corresponding to the VEP timings observed in our recordings. Time-frequency analysis was performed by using complex wavelet convolution on trial averaged layer 4 VEPs across mice (Cohen, 2014). We used series of complex wavelets to extract power and phase at each sample point. We used 40 frequencies across a logarithmic range from 2 to 80 Hz, with the number of cycles of the wavelet ranging from 3 to 10 for an optimal time-frequency precision tradeoff. Power was dB normalized to the baseline period. The mean power was then calculated across θ (4–8 Hz), α (8–12 Hz), β (12–30 Hz), and γ (30–70 Hz) frequency ranges.
Spike detection and sorting.
We used KiloSort, a template-based clustering algorithm implemented in MATLAB, to detect and sort spikes from raw binary data (Pachitariu et al., 2016). We used the default KiloSort parameters but set a threshold of 6 SD for spike detection and initialized the templates from data. Clusters were further manually inspected using the Phy template GUI, and several criteria were used to determine the quality of units to be used for further analysis (Rossant et al., 2016). (1) Units with similarity scores of 1.0 were merged, so long as the shape of the average waveforms and their positions on the channel map matched. (2) In rare cases, units with clearly separated clusters in feature view were split. (3) Any unit with waveforms on all channels was removed. (4) Any unit with <100 spikes over an 80 s period (20 trials at 4 s each) or 60 spikes over a 50 s period (20 trials at 2.5 s each) was removed as reliable autocorrelograms could not be constructed. (5) Any unit with aberrant waveform shapes was removed. (6) Units without a sufficiently clear gap in the absolute refractory period of auto correlograms were removed. (7) After manual sorting, any units with >5% of their interspike intervals violating the absolute refractory period were excluded from analysis. This typically resulted in ≤3% of manually sorting units being eliminated from the analysis. (8) Any units with instantaneous firing rates >200 Hz were also removed. Klusta Suite/phy clustering software, a more established clustering algorithm, was also used to validate our results (Rossant et al., 2016). Using Klusta Suite/phy gave the same qualitative results.
Single-unit analysis.
Peristimulus time histograms (PSTHs) of single-unit activity were computed using 10 ms bins and smoothed with a Gaussian kernel (width = 100 ms). For heat maps, z scores were calculated by normalizing to the mean firing rate (FR) across all time (z = (FR − mean FR)/SD FR). For population time course line plots, z scores were calculated by normalizing FR to the baseline period (0–0.5 s) (z = (FR − mean baseline FR)/SD baseline FR). To quantify the duration of oscillations in single units, we applied a peak detection algorithm on the z-transformed PSTH with the following criteria: (1) the minimum peak height must be at least 1.5 SD from baseline; (2) the first peak must be within 100 ms of the stimulus onset; and (3) peaks must be within 200 ms from one another. Units were grouped into different clusters using K-means, an unsupervised clustering algorithm from the sklearn Python package. The input matrix was the PSTH z score of single units across 0–0.25 s after stimulus onset. We tried initializing clustering both randomly and with the first principal components, but both methods gave qualitatively the same results. Therefore, we used random initialization on all k-means clustered data shown in this report. In the majority of cases, we used simple 2-cluster unbiased k-means clustering to separate units into visually excited and visually inhibited units. For Figure 10 scatter plots, only units that significantly responded to at least two different stimuli were included to probe single-cell specificity to familiar versus novel stimuli.
Pupillometry analysis.
All recordings of mouse pupil size were analyzed post hoc using custom programs written in Python and using the open source computer vision library OpenCV. We imaged the mouse eye at 400 × 300 pixels at 20 Hz with an infrared (IR) camera and lens (Thorlabs). The eye was illuminated with an 850 nm IR LED (CMVision IR30). Synchronization with electrophysiology recordings was achieved by Arduino and a custom-written Python script in Psychopy. Acquired videos were analyzed using the OpenCV library in Python. For each video, the ROI that only included eye boundaries was selected. For each frame, we first performed image histogram equalization to improve the contrast of the images followed by a Gaussian blur. To perform image segmentation and separate the pupil from the rest of the eye, we applied binary image thresholding. The morphological transformation function morphologyEx was used to remove noise. This was achieved by first using erosion that removes white noise, followed by dilation to restore the original object boundaries, effectively removing white noise. Then we identified the pupil contours by using the function 'findContours' with a mode (RETR_TREE) and method (CHAIN_APPROX_SIMPLE). We used a minimum enclosing circle to define the pupil and to remove edge artifacts caused by whiskers and the IR illumination. In the rare cases when the artifacts could not be removed, the images were removed from the analysis. Automatic pupil detection was manually verified for each video. Following these procedures, pupil area was extracted. If the eye was not sufficiently illuminated, we could not properly track the pupil and had to exclude those recordings from analysis. All recordings of pupil diameter were baseline normalized and reported as a percentage change from baseline.
Statistical analysis.
Statistical analysis was performed in Python and Prism. We used two-sample Kolmogorov–Smirnov tests in Python to compare the distributions of oscillation durations among populations of single units. To account for unequal variance and unequal numbers of units between comparisons, we used a Welch's two-sample t test in Python to compare the mean oscillation durations between two unit populations. A Welch's two-sample t test was also used to compare the mean power, pretraining versus post-training, at each frequency band from time-frequency analysis. A two-way ANOVA was used to compare the main effects of training (pre vs post) and oscillation cycle (1–6) and the interaction effect between training and oscillation cycle on either VEP amplitude or firing rate as shown in Figures 1 and 2. A two-way ANOVA was used to compare the main effects of stimulus orientation (150° vs 300°) and oscillation cycle (1–6) and the interaction effect between stimulus orientation and oscillation cycle on VEP amplitude as shown in Figure 6. A two-way ANOVA was used to study the effect of training and SF along with comparing amplitude and power of oscillations within SF = 0.04 and SF = 0.14 training sets (see Figs. 9, 10). A paired t test was used in scatter plots to compare single-cell oscillatory activity to familiar versus novel stimuli. A two-way ANOVA was used to compare the main effects of visual stimulus condition (Ctrl, in phase, out of phase) and oscillation cycle and the interaction effect between visual stimulus condition and oscillation cycle on VEP amplitude or firing rate as shown in Figure 11. A Tukey-HSD test was used to adjust for multiple comparisons and obtain p values for each comparison after each ANOVA. We attempted the ANOVA and corresponding Tukey-HSD test in python, R, Prism, and MATLAB. However, each of these programs set a limit for how small of a p value could be reported. Therefore, we reported the exact p values when possible but otherwise reported them as p < 0.001.
Results
Visually evoked oscillations in V1 emerge with experience to visual stimuli
To explore the neural correlates of visual familiarity in V1, we performed head-fixed electrophysiological recordings in awake mice before and after perceptual training (Fig. 1A). Acute extracellular recordings were performed with 64 channel silicon probes spanning the cortical depth (Shobe et al., 2015). Using open source data acquisition software (OpenEphys), we simultaneously recorded LFPs and putative single-unit activity to study visual processing at the level of both population responses and individual neurons (Siegle et al., 2017). We first explored the influences of perceptual training to pairs of stimuli, given the evidence that oscillatory activity in V1 can encode the timing of events predicted by visual cues (Shuler and Bear, 2006; Chubykin et al., 2013; Zold and Hussain Shuler, 2015). A sinusoidal drifting grating (0.2 s duration, SF = 0.03 cpd, temporal frequency = 3 Hz, speed = 100 deg/s) was presented as a cue to predict a visually distinct second drifting grating after a short delay (1 or 2 s delay, second stimulus-oriented 60 degrees from the first) (Fig. 1B). Pretraining recordings in response to the cue were primarily characterized by stimulus-locked responses in layer 4 VEPs, although some low-power oscillatory activity could be seen in some cases (Fig. 1C, top, blue trace). After repetitively exposing the animals (perceptual training) to the stimulus pairs for 4 d (200 presentations/d), high amplitude oscillations emerged in the layer 4 LFPs in response to the visual cue that persisted beyond the timing of visual stimulation (Fig. 1C, top, cyan trace). In extreme cases, up to 6 distinct cycles could be seen that lasted up to 1.0 s beyond the end of visual stimulation. Averaging the maximum amplitudes of these 6 oscillation cycles across mice revealed that the first cycle corresponding to the time-locked VEP did not significantly change in amplitude after training, similar to the first response seen during sequential learning paradigms (Gavornik and Bear, 2014) (Fig. 1C, bottom). The second through fourth cycles were significantly potentiated compared with pretraining recordings. After the large increase in the amplitude seen at the second cycle, a decay in amplitude occurred across time until the responses returned to baseline. The oscillations observed in LFP recordings were at the canonical θ frequency (5.46 Hz, SEM = 0.07 Hz, averaged across 18 mice). There were no significant differences in the recording depth of the layer 4 VEPs before versus after training, indicating consistent insertions under both conditions (pretraining depth: mean ± SEM, 380.5 ± 14.4 μm; post-training depth: 374.1 ± 14.6 μm; 2-sample Kolmogorov–Smirnov test pretraining versus post-training: p = 0.9, Welch's t test: p = 0.75). To quantify the relative changes in the power of persistent (occurring after the stimulus, 0.7–1.2 s) θ (4–8 Hz) oscillations, as well as other common neural oscillation frequency bands, we performed complex wavelet convolution on trial-averaged VEPs across mice (Cohen, 2014). This analysis revealed a significant increase in the power of the θ (4–8 Hz) band after training, consistent with the prominent amplitude increases seen in LFPs (Fig. 1D,E). The power of other frequency bands also increased after training, including α (8–12 Hz), β (12–30 Hz), and γ (30–70 Hz) relative to the baseline period. Time-locked responses to visual stimulation displayed clear increases in β power compared with baseline in both pretraining and post-training recordings, consistent with other reports of β increases in response to grating stimulation (Chen et al., 2015). We observed a dramatic non–stimulus-locked increase in β corresponding to the timing of the second cycle that emerged only after training, an interesting result given that the β band has been suggested to represent top-down cortical processing (Fig. 1D) (Buschman and Miller, 2007; Schmiedt et al., 2014).
Such prominent oscillations in LFPs suggest a synchronous engagement of a large population of neurons in the local circuit. To examine the firing properties of individual units excited during the oscillation, we compared all visually responsive units before and after training. Individual units displayed stimulus-locked responses before training but displayed sustained periods of spiking activity at θ frequency only after training (Fig. 2A,B). Firing rates did not significantly change at the timing of the second cycle but were significantly increased at the third cycle after training (Fig. 2C). We quantified the duration of oscillatory unit activity based on their mean z score, revealing a significant increase in oscillation duration after training (Fig. 2D,E). These oscillatory units were found across the whole cortical depth. Only mild differences between the cortical depth distributions of all the recorded units could be found pretraining versus post-training, which did not lead to sampling bias (Fig. 2F,G). Given the intriguing evidence that θ oscillations may report the time of predicted events in V1, we used our oscillation duration analysis to determine whether training to a pair of salient stimuli influenced oscillation duration. We found that mice trained to a single stimulus also displayed oscillations of comparable duration to those trained to a pair of stimuli (Fig. 3A). Additionally, subsets of animals trained to either 1.0 s or 2.0 s interstimulus intervals (ISIs) displayed oscillations after both the first (the “cue”) and the second stimulus (Fig. 3B). There was no significant difference in oscillation duration after the first stimulus whether a 1.0 s or 2.0 s ISI was presented. Similarly, no significant difference in duration was observed when only the first stimulus (the “cue”) was presented between mice trained to these intervals (Fig. 3C). These results indicate that visually evoked θ oscillations emerge in an experience-dependent manner, but do not explicitly encode interval timing.
The cholinergic system is required for visual experience-dependent oscillations in V1
V1 receives extensive cholinergic input from the basal forebrain in both rodents and primates (Bigl et al., 1982; Mesulam et al., 1983). Acetylcholine (ACh) is thought to enhance perceptual learning, attention, experience-dependent synaptic plasticity, and has been shown to enhance LTP in L2/3 of binocular visual cortex (Gu, 2003; Wilson et al., 2004; McCoy et al., 2008; Kang et al., 2014). To explore the role of mAChRs in the acquisition of visually evoked θ oscillations, mice received intraperitoneal injections of 3 mg/kg scopolamine, a nonspecific mAChR antagonist, 30 min before each training session for 4 d. After 4 d of drug treatment with training, we did not observe oscillations in these animals. Visual responses were transient and time-locked to the stimulus, similar to those typically seen in pretraining recordings in controls (Fig. 4A). After allowing 24 h for drug washout followed by 4 additional days of training without scopolamine, oscillatory activity recovered. To test whether mAChRs are necessary for the expression of θ oscillations, we trained a separate group of mice for 4 d, followed by a 3 mg/kg scopolamine injection on the final day 30 min before the recording session only. Similarly, θ oscillations were blocked but recovered after 4 additional days of training (Fig. 4B). These results suggest that the cholinergic system, through the activity of muscarinic receptors, is required for both the acquisition and the expression of visually evoked θ oscillations in V1. This is consistent with other experience-dependent phenomena reported in V1, which have shown that both persistent unit activity and the potentiation of VEPs to familiar visual stimuli are dependent on the cholinergic system (Chubykin et al., 2013; Gavornik and Bear, 2014).
Experience-dependent oscillations report familiarity to visual stimuli in V1
Next, we hypothesized that the motion of the gratings might directly influence the emergent oscillations in V1. To test this notion, mice were trained to 200 ms static stimuli at the same SF (0.03 cpd) and orientation (150 degrees) as the sinusoidal drifting stimulation seen in Figure 1. Oscillatory activity was significantly increased in duration after training despite the visual stimulus remaining stationary (Fig. 5A). In the same animals, we presented an extended-duration static stimulus (800 ms) to determine whether the oscillations were sensitive to the duration of the training stimulus (200 ms). This manipulation extended stimulus-evoked persistent firing in units before training but failed to elicit low-frequency oscillations. After training, however, persistent low-frequency oscillations were evoked throughout the entire duration of stimulus presentation and beyond (Fig. 5B). Next, we tested the sensitivity of these oscillations to stimulus novelty. The first parameter we changed was stimulus orientation, considering that other experience-dependent changes in visual responsiveness, such as stimulus-specific response potentiation, are sensitive to orientation (Frenkel et al., 2006; Cooke et al., 2015). Presentation of a novel stimulus orientation (300 degrees vs 150 degrees) to trained mice yielded persistent responses that did not oscillate strongly (Fig. 6A,B). Oscillations in response to the novel stimulus were more evident among VEPs but had decreased magnitude compared with those evoked by the trained stimulus (Fig. 6C,D). However, the oscillations were still present in response to the novel orientation, suggesting that their amplitudes, but not their emergence, are orientation selective.
Next, we tested whether drifting gratings of different spatial frequencies could promote the emergence of these oscillations. Mice trained to drifting gratings of high SF (0.3 cpd) also displayed these oscillations (Fig. 7A). Interestingly, when these mice were presented with an intermediate SF (0.03 cpd) drifting grating, we noticed a distinct lack of oscillatory activity. This could be replicated in mice trained to low SF (0.003 cpd) stimuli and presented with a novel stimulus with an intermediate SF (0.03 cpd) (Fig. 7B). To further test whether the presence of the oscillations is specific to a particular type of visual stimuli, we trained mice to a drifting grating (200 ms, SF = 0.03 cpd) and displayed a complex grayscale checkerboard stimulus as a novel stimulus (Fig. 7C). Again, oscillations emerged in response to the trained stimulus, but not the novel checkerboard stimulus. Yet, mice could also be trained to the grayscale checkerboard stimulus to elicit oscillations (Fig. 7D). Surprisingly, when we presented an intermediate SF (0.03 cpd) grating as a novel stimulus to these mice, prominent oscillatory activity was elicited, similar to that evoked by the familiar checkerboard stimulus. This is likely because the checkerboard stimulus contains a broad range of spatial features, leading to the cross-activation of the trained circuitry when a simpler grating stimulus was presented. Thus, the spatial complexity of a familiar stimulus may be sufficient to drive oscillatory responses to novel “component” stimuli.
Among VEPs, we consistently noticed that the second cycle of the oscillation was the most potentiated (see Fig. 1D). This observation may reflect the strengthening of an “off” response to the 200 ms stimulus after training, as it occurs shortly after the stimulus ends. If so, then longer presentations of familiar stimuli should extend the oscillatory period and delay the “off” response accordingly. To explicitly test this prediction, we presented visual stimuli of extended durations (400, 800, and 1600 ms) after training to an intermediate SF grating (200 ms, 0.03 cpd). In this experiment, a low SF grating (0.003 cpd) was presented as a novel event. Oscillatory activity in both individual units and LFPs persisted across all durations of the stimulus in mice trained to the intermediate SF stimulus (Fig. 8A). The oscillations were abolished when the low SF stimulus was presented as a novel event. We then did the opposite experiment with another group of mice, which were trained to a low SF grating with the intermediate SF grating presented as a novel event (Fig. 8B). Similarly, the oscillations were present only in response to the familiar low SF grating stimulus. The stimulus-locked oscillations were clearly different in frequency from the persistent oscillations previously observed in response to 200 ms stimuli at this SF (compare with Fig. 7B). Given that the oscillations can persist throughout the presentation of familiar long-duration static stimuli (Fig. 5B), we suggest that the mechanism of the familiarity-evoked persistent oscillations is different from that of the visual stimulus-locked oscillations. The second cycle of the oscillation remained highly potentiated regardless of stimulus duration, suggesting that it represents a conserved property of the oscillation that is separate from the delayed “off” response seen at the end of each extended duration. Overall, these results suggest that visually evoked oscillations in V1 emerge with visual experience and may be representative of visual familiarity in a SF-dependent manner.
SF is the major stimulus feature driving the emergence of the oscillations
After the observation that oscillatory activity was not induced by drifting grating stimuli with novel SFs, we decided to further investigate the SF tuning of the oscillations without the confounding factors of orientation or temporal frequency. We generated static stimuli of nonoverlapping SF bands by bandpass filtering sparse noise in a range of frequency bands (0.01, 0.02, 0.04, 0.08, or 0.14 cpd) (Fig. 9A). Separate groups of animals were trained to specific SF bands (either 0.2 s SF = 0.04 cpd, n = 4 mice; or SF = 0.14 cpd, n = 6 mice) the same way as described above (4 d, 200 presentations/d), with all five stimuli presented both pretraining and post-training. Pretraining recordings showed that all five stimuli induce stimulus-locked firing of single units and stimulus-locked VEPs, consistent with our pretraining results to grating stimulation. After training, we observed the emergence of oscillations in both groups of mice (Fig. 9B). In both cases, however, the amplitude of the oscillations was highest for the trained SF compared with the novel SFs, particularly at the second oscillation cycle (Fig. 9B,C). Time-frequency analysis revealed potentiation of low-frequency oscillations that persisted beyond the stimulus duration in both training groups, especially to the trained SF (Fig. 9D, green and purple boxes). Overall, oscillatory activity varied based on how close in SF a stimulus was to the experienced one. We found that low-frequency oscillations were significantly modulated by the trained stimulus compared with novel stimuli within each training group (Fig. 9E).
Next, we analyzed single-unit activity to assess how individual neurons responded to the experienced versus novel stimuli by quantifying the duration of stimulus-induced oscillatory activity. Unit analysis revealed that both single-cell and population level activity could be used to discriminate between the experienced SF band (oscillatory response) and novel stimuli (transient response). On a population level, neural activity reflected our observations from LFP analysis showing that oscillatory activity was specific to the trained SF band (Fig. 10A). We found that the duration of oscillatory activity was significantly longer for the trained stimulus compared with the novel one within each training group (for mice trained to 0.04 cpd and for mice trained to 0.14 cpd) (Fig. 10B). We then sought to determine how single cells respond to the trained versus novel stimuli. To do this, we created scatter plots that summarize how the duration of single-unit oscillatory activity varied depending on the stimulus SF (Fig. 10C, after SF = 0.04 training; and Fig. 10D, after SF = 0.14 training). The duration of oscillations was quantified for each individual cell in response to the different stimuli, with pairwise relationships of familiar versus novel stimuli shown in the scatter plots. Each dot indicates a single unit, with the x-coordinate being the oscillation duration to familiar stimuli and the y-coordinate being the duration to novel stimuli of a particular SF. The red dashed line indicates a perfect correlation, corresponding to units responding the same way to the familiar or novel stimuli. If the majority of dots fall below the red line, single neurons responded with longer oscillatory activity to the familiar stimulus, whereas dots above line show that single units preferred novel stimuli. Black and green contours outline the density of the data points. This analysis demonstrated that single cells differentiated between distinct SF bands as evidenced by clustering toward the familiar stimulus (Fig. 10C). Among animals trained to 0.14 cpd, the difference between durations elicited by the trained versus novel stimuli was not as significant as after SF = 0.04 training (Fig. 10D). However, in both conditions, the strength of the familiarity-evoked oscillations was dependent on how close in SF the novel stimulus was to the trained one.
Ongoing oscillations influence incoming visually evoked responses
To determine the role of oscillatory activity in sensory information processing, we tested how an ongoing oscillation might influence the processing of subsequent visual stimuli. LFPs represent summed synchronized synaptic potentials in the local circuitry. Therefore, an ongoing oscillation may alter an incoming visually evoked response via summation of graded potentials, depending on the phase of the oscillation. To test this hypothesis, we presented visual stimuli at the high excitability θ phase (referred to here as “in phase”) or the low excitability θ phase (referred to here as “out of phase”) relative to an ongoing visually evoked oscillation. We first recorded visually evoked responses to a single familiar stimulus after training to confirm the presence of θ oscillations. A second stimulus was then presented to evoke a response in phase with the third cycle of the oscillation, or to evoke a response out of phase with the third cycle. Interestingly, presentation of a visual stimulus in phase with the third cycle elicited a response that was approximately twice the amplitude of the first or third cycles elicited by a single stimulus. This increase did not occur when the second stimulus was presented out of phase (Fig. 11A,B). To estimate the translaminar currents underlying this phenomenon, we performed CSD analysis using trial averaged LFPs recorded across the cortical depth (Mitzdorf, 1985; Aizenman et al., 1996; Pettersen et al., 2006; Leski et al., 2007). Both current sinks and sources were stronger when the second stimulus was presented in phase with the third cycle compared with out of phase (Fig. 11C). This CSD analysis confirmed our VEP potentiation results.
Oscillatory activity and pupil dynamics
Pupil dilations are well known to report surprise, arousal, or cognitive load. Increases in pupil size have also been linked to a decrease in the power of spontaneous low-frequency oscillations in LFP recordings (Reimer et al., 2014; Vinck et al., 2015). To explore pupil dynamics and their relationship to visually evoked LFP oscillations, we simultaneously recorded changes in pupil size before and after training in response to grating stimuli (SF = 0.03 cpd) or to checkerboard stimuli (novel stimuli) together with electrophysiological recordings. Time-frequency analysis of layer 4 VEPs recorded under these conditions revealed that novel stimuli elicit lower-power oscillatory activity beyond the stimulus presentation, similar to the pretraining condition (Fig. 12A,B). Pupil tracking of these animals revealed sustained increases in pupil size when the presented stimulus was novel (Fig. 12C,D). Importantly, this effect occurred both when an animal viewed a visual stimulus for the first time (the pretraining condition) and when a novel stimulus was presented after familiarity to a different stimulus was already established (the post-training novel condition). This response was not observed when animals were familiar with the presented stimulus, suggesting that these sustained pupil increases are representative of a surprise response. To explore the relationship between the power of oscillatory activity in LFPs and this surprise response, mean low-frequency (4–30 Hz) persistent (after stimulus presentation, 0.7–1.2 s) oscillatory power was plotted against the mean pupil area percentage change of the surprise response for each individual mouse for which we successfully recorded both VEPs and pupil size. This analysis revealed that recordings with low oscillatory power (pretraining and novel conditions) correlated with higher surprise responses. The opposite was true for trained animals, where higher-power oscillations negatively correlated with surprise responses (Fig. 12E). Interestingly, similarly to the low-frequency oscillations, the power of high-frequency (30–70 Hz) persistent oscillatory activity also negatively correlated with the surprise response (Fig. 12F). These results suggest that these stimulus-evoked sustained pupil dilations depend on the perceptual novelty of the stimulus and are not solely mediated by the brain state changes traditionally associated with these types of pupil dynamics.
Discussion
Here, we determined the conditions and underlying mechanisms through which visually evoked low-frequency oscillations manifest in rodent primary visual cortex (V1). We have discovered that, after exposing mice daily to repetitively presented salient visual stimuli, persistent low-frequency oscillations emerge in both VEPs and individual units in V1. Principally, we show that (1) these oscillations arise in an experience-dependent manner; (2) they are sensitive to the familiarity of an animal to the SF content of a visual stimulus; (3) muscarinic receptors are necessary for both the induction and expression of these oscillations; and (4) ongoing oscillations directly influence the VEP amplitudes of oncoming stimuli depending on the θ excitability phase of the oscillation at which the oncoming stimuli are presented.
Feature specificity of visual familiarity
The brain continuously receives visual information and can automatically detect even minor changes in the visual environment. These statistical irregularities violate expectations generated by previous familiar stimuli. Indeed, object-based irregularities are automatically identified by the visual system, including changes in the geometric patterns or in the complex attributes of the natural stimuli (Stefanics et al., 2011; Stefanics and Czigler, 2012; Müller et al., 2013). Identification of these “surprising” stimuli may be based on both the long-term and short-term history of previous events. Surprising events carry novel information that can potentially be critical for survival. This process is necessary to continuously update the internal model of the world, which represents what the brain expects to see as “normal.” However, “surprise” can only be detected whether familiarity has been established toward at least some visual stimuli, and the corresponding predictions based on these stimuli have been generated. Extraction of the common statistical features of the repeating visual stimuli may serve as an automatic “perceptual filter” (Stefanics et al., 2014). Our results are consistent with this theory. Repeated presentation of the visual stimuli of a particular SF leads to the development of familiarity to its SF mediated by the low-frequency oscillations. Although we did not observe the all-or-none oscillatory response to the familiar vs novel stimuli, this is not surprising given that individual neurons respond to a range of different SF. The power of the oscillatory activity in LFP and the response duration of single units dynamically varied based on how close the presented SF band was to the trained one. Significant differences were observed in both LFP and single units' activity when comparing the responses to the trained and the novel distant SF bands. Furthermore, we have demonstrated that the oscillations are not dependent on the choice of the stimulus by training to two different low and high SF bands. Using pure SF bands without confounding elements, such as orientation, we have demonstrated that the oscillatory activity was modulated as a function of the SF intrinsic to the familiar visual stimulus. We propose that this oscillatory code is a common strategy primary visual cortex employs for segregating familiar versus novel visual inputs.
Oscillatory activity and arousal
Locomotion exerts an excitatory influence on V1, pushing most neurons into a more depolarized state and causing significant increases in visually evoked firing rates (Niell and Stryker, 2010; Polack et al., 2013; Reimer et al., 2014). This effect is thought to be a gain modulation, as firing rates are increased without a loss in feature selectivity (Cardin et al., 2008; Niell and Stryker, 2010; Polack et al., 2013). Increases in pupil size are also a robust measure of arousal and are highly correlated with periods of locomotion. Interestingly, aroused states (characterized by periods of pupil dilation or locomotion) are correlated with a reduction in the power of low-frequency oscillations in LFPs or membrane potential recordings in V1 (Niell and Stryker, 2010; Reimer et al., 2014; Vinck et al., 2015). Consistent with these reports, we found that stimulus-evoked increases in pupil size were correlated with decreased low-frequency oscillatory (4–30 Hz) activity. These pupil size increases occurred under conditions when visual stimuli were novel to the animal, indicating a surprise-based increase in arousal levels. We also observed significantly higher firing rates of stimulus-locked responses to novel stimuli compared with familiar stimuli, which conversely showed a decrease in stimulus-evoked firing rates following perceptual training. This observation is consistent with the reports of an arousal-based gain modulation of visually evoked responses. It is also consistent with the previously reported stimulus-specific adaptation leading to a decrease in the peak stimulus-evoked firing rate following repeated exposure to the same stimulus (Hamm and Yuste, 2016; Vinken et al., 2017). On the other hand, we observed a sustained poststimulus decrease (~1%) in pupil size correlated with higher-power low-frequency oscillations. This observation is consistent with the previous reports that the peak amplitude of low-frequency oscillatory activity is correlated with pupil constriction in awake mice (Reimer et al., 2014). However, the increase in low-frequency oscillatory activity we observed only occurred after visual stimulus presentation and was not spontaneous. Furthermore, we have also observed an increase in the power of high-frequency γ oscillations following training, which are traditionally associated with the aroused cortical state and increased pupil size. This finding of higher-power γ oscillations correlated with pupil constriction after training suggests that the low- and high-frequency sustained oscillations are representative of learning-related changes in the cortex and not simply brain state alterations. The changes in pupil size may represent the influence of neuromodulators known to mediate arousal. Transient changes in pupil dilation are preceded by increases in noradrenergic and cholinergic activity, even in the absence of locomotion (Reimer et al., 2016). The gain modulatory effects often attributed to locomotion can also be influenced by direct cholinergic or noradrenergic manipulation, demonstrating that neuromodulation alone can exert some of these effects (Goard and Dan, 2009; Pinto et al., 2013; Polack et al., 2013). However, both noradrenergic and cholinergic systems are also involved in circuit and synaptic plasticity, which makes it difficult to disambiguate the role of neuromodulators in arousal and learning (Seol et al., 2007). Our results show that muscarinic receptors are necessary for both the induction and expression familiarity-evoked oscillations, consistent with other work highlighting the importance of the cholinergic system in the regulation of persistent activity or learning and memory (Chubykin et al., 2013; Gavornik and Bear, 2014; Vandecasteele et al., 2014). A detailed study of how neuromodulators can regulate arousal and oscillatory rhythms in V1 in the context of learning is one of the important future directions of our research.
Oscillations and neural processing of visual familiarity
Animals need to effectively segregate familiar from novel stimuli to successfully interpret the environment around them. There is a significant body of evidence demonstrating the differential processing of stimuli in sensory cortices based on previous experience (Gavornik and Bear, 2014; Cooke et al., 2015; Kato et al., 2015; Makino and Komiyama, 2015; Poort et al., 2015), which in some cases results in persistent activity beyond the stimulus presentation (Shuler and Bear, 2006; Zold and Hussain Shuler, 2015). However, neither the stimulus nor the visual feature specificity of experience-dependent persistent activity has been demonstrated. Here, we show, for the first time, a neural signature of visual familiarity in V1 that manifests as persistent low-frequency (θ, α, β) oscillations after perceptual experience. Furthermore, we have shown that ongoing visually evoked oscillatory activity can influence the processing of incoming visual stimuli. Visual information arriving at the peak of the oscillation cycle can be amplified while information arriving at the trough may be unchanged or attenuated. There is evidence for a similar phenomenon in the auditory cortex, whereby a corticothalamic circuit can reset the phase of low-frequency (δ, θ) rhythms to regulate the salience of auditory stimuli (Guo et al., 2017). These data and the work by others suggest that oscillations can filter or gate information flow in the brain (Schroeder and Lakatos, 2009). Our findings that familiarity is signaled by low-frequency oscillatory activity are well aligned with studies reporting the existence of different frequency channels supporting bottom-up and top-down modulation. It has been suggested by several studies that γ oscillations are involved in feedforward information flow, whereas α and β bands are involved in feedback (Buffalo et al., 2011; van Kerkoerle et al., 2014; Bastos et al., 2015; Michalareas et al., 2016). There is still some debate about the θ band, with different groups differentially reporting its influence on the directionality of information flow (Bastos et al., 2015; Michalareas et al., 2016). However, oscillations may also be important for information exchange and synchronization between different brain areas (Buzsáki and Draguhn, 2004; Fries, 2005). Consistent with this theory, θ synchronization between visual area V4 and prefrontal cortex has been reported to be necessary for the maintenance of visual short-term memories (Liebe et al., 2012). Although visual stimulus familiarity significantly upregulated all frequency bands tested beyond the time of visual stimulation, the most striking difference pretraining versus post-training lies in the strengthening of the α and β bands. Interestingly, these increases are aligned with the potentiation of the second peak seen in the LFP, which is the most prominent signature of familiarity in our data. Furthermore, our experiment with pure SF bands demonstrated that α and β (and to a lesser extent θ) power are significantly modulated depending on how close the presented (SF) is from the trained stimulus. Top-down signals reporting familiarity might manifest through the emergence of these α and β oscillations, although we also see an increase in the power of bottom-up γ frequency band after training. Future studies are necessary to explore how these oscillations propagate across different cortical areas, and to dissect the molecular, cellular, and circuit mechanisms that drive them.
Footnotes
This work was supported by Whitehall Foundation Research Grant, Showalter Trust Young Investigator Grant, and National Institute of Mental Health Grant R01 MH116500 to A.A.C. S.C.M. was supported by a 2014 McKnight Technical Innovations in Neuroscience Award. We thank Mang Gao and Yevgenia Kozorovitskiy for useful discussions and comments.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Alexander A. Chubykin, Department of Biological Sciences, Purdue Institute for Integrative Neuroscience, Purdue University, West Lafayette, IN 47907. chubykin{at}purdue.edu