Abstract
The trial-to-trial response variability of nearby cortical neurons is correlated. These correlations may strongly influence population coding performance. Numerous studies have shown that correlations can be dynamically modified by attention, adaptation, learning, and potent stimulus drive. However, the mechanisms that influence correlation strength remain poorly understood. Here we test whether correlations are influenced by presenting stimuli outside the classical receptive field (RF) of visual neurons, where they recruit a normalization signal termed surround suppression. We recorded simultaneously the activity of dozens of cells using microelectrode arrays implanted in the superficial layers of V1 in anesthetized, paralyzed macaque monkeys. We presented annular stimuli that encircled—but did not impinge upon—the RFs of the recorded cells. We found that these “extra-classical” stimuli reduced correlations in the absence of stimulation of the RF, closely resembling the decorrelating effects of stimulating the RFs directly. Our results suggest that normalization signals may be an important mechanism for modulating correlations.
- electrophysiology
- extra-classical receptive field
- population coding
- spike count correlation
- surround suppression
- V1
Introduction
Repeated presentations of a sensory stimulus produce neuronal spiking responses that differ substantially from trial to trial. This response variability is correlated between nearby neurons with similar functional properties (Cohen and Kohn, 2011): a stronger than average response in one cell on a particular trial is likely to be accompanied by a similar enhancement in other nearby cells. These correlations in trial-to-trial responses may strongly influence population coding performance (Averbeck et al., 2006).
Recent work has shown that correlations are not fixed in strength; instead they depend on context (Kohn et al., 2009). For instance, in primary visual cortex (V1), the presentation of a high-contrast stimulus in the receptive fields (RFs) of a pair of neurons reduces the strength of long timescale correlations of their spontaneous activity (Kohn and Smith, 2005; Smith and Kohn, 2008; Huang and Lisberger, 2009; Tan et al., 2014). In V4, attention has been shown to weaken correlations, which may underlie its beneficial effects on performance (Cohen and Maunsell, 2009; Mitchell et al., 2009). Learning (Gu et al., 2011; Jeanne et al., 2013) and adaptation (Gutnisky and Dragoi, 2008; Adibi et al., 2013) have also been shown to alter correlations. Indeed, changes in correlation strength and structure may be an important component of how sensory processing adjusts to meet the demands of a particular environment or situation.
While there is good evidence that correlations are flexible, the mechanisms that control their strength remain unclear. To date, demonstrations of altered correlations have focused on manipulations that activate the recorded neurons. Here we test the possibility that stimuli that do not directly drive a pair of neurons can also alter their correlation. We focus on the modulatory influence of visual stimuli placed outside the classical RF—in the surround. The influence of the surround has been studied extensively in primary visual cortex (Angelucci and Bressloff, 2006), and its effects are suggested to be a form of divisive normalization (Heeger, 1992; Cavanaugh et al., 2002; Coen-Cagli et al., 2012), a canonical cortical computation (Carandini and Heeger, 2012). We find that surround stimulation can suppress correlations, much like stimuli within the RF. This effect was observed for surround stimulation even in the absence of stimulation of the RF. This suggests that the mechanisms underlying divisive normalization influence the strength of pairwise correlations, and we propose that this may account for at least some previous findings of flexible correlations.
Materials and Methods
Subjects and electrophysiological recordings.
The basic experimental procedures have been described previously (Smith and Kohn, 2008). Anesthesia was induced with ketamine (10 mg/kg) and maintained during preparatory surgery with isoflurane (1.0–2.5% in 95% O2). Sufentanil citrate (6–18 μg/kg/h, adjusted as needed for each animal) was injected intravenously to maintain anesthesia during recordings. Vecuronium bromide (0.15 mg/kg/h) was used to suppress eye movements. All procedures were approved by the Albert Einstein College of Medicine at Yeshiva University and followed the guidelines in the United States Public Health Service Guide for the Care and Use of Laboratory Animals.
We made five array implants in V1 of five hemispheres of four adult male cynomolgus (Macaca fascicularis) monkeys. Implants were made on the operculum of V1 ∼8 mm posterior to the lunate sulcus and 10 mm lateral to midline, where neuronal RFs are 2–3° eccentric in the lower visual field and ∼1° in diameter (Cavanaugh et al., 2002). We recorded neural activity with “Utah” arrays (Blackrock Microsystems), using methods reported previously (Kelly et al., 2007). Signals from each microelectrode were amplified and bandpass filtered (250 Hz to 7.5 kHz). Waveform segments that exceeded a threshold were digitized (30 kHz).
Waveform segments were sorted off-line using an automated algorithm with manual refinement (Smith and Kohn, 2008). We computed the signal-to-noise ratio (SNR) of each candidate unit as the ratio of the average waveform amplitude to the SD of the waveform noise (Kelly et al., 2007). Candidates with an SNR <2.75 were discarded as multiunit activity. We also eliminated neurons whose best grating stimulus did not evoke a response of at least 2 spikes/s (92 units removed). This procedure yielded a total of 375 cells recorded from 299 distinct electrodes (55–91 neurons/implant). Each neuron was paired with all other simultaneously recorded neurons, excluding pairs from the same electrode, providing 14,360 pairs. Changing our selection criteria for SNR and response rate did not alter meaningfully the reported results.
Stimuli.
We presented visual stimuli on a calibrated CRT monitor (1024 × 768 pixels, 100 Hz refresh) placed 110 cm from the animal, using custom software (EXPO). We presented sinusoidal gratings centered on the aggregate RF of the array implants. We used gratings of 4° diameter (Fig. 1) and two annular sinusoidal gratings, both with 12° outer diameters but one with a 4° inner diameter (“near annulus”) and the other an 8° inner diameter (“far annulus”). Post hoc analysis revealed that units on only 1 of the 299 channels in the analysis had response fields that slightly overlapped the near annulus. Gratings were full contrast, and presented at four equally spaced orientations (45° increments) for 1.28 s with a 10 s interstimulus interval. The spatial frequency (1.3 cpd) and temporal frequency (6.25 Hz) values were chosen to drive well parafoveal V1 neurons (Smith et al., 2002). Stimuli were presented monocularly. In a subset of experiments (three array implants), we included a 12° diameter grating, with otherwise identical properties, to measure surround suppression.
For experiments involving gratings presented to the ipsilateral field, we used 1.28 s stimulus durations separated by 1.5 s blank intervals. This shorter interstimulus interval allowed us to maximize the number of stimulus presentations. To avoid contamination due to offset responses from the preceding trial in these experiments, we restricted our measurements of spontaneous activity to the 600 ms preceding each stimulus. To maintain time bins of equal length, we also restricted the analysis of the stimulus period to the first 600 ms. Note that while spike count correlation (rsc) is dependent on the length of the time period over which spikes are counted, we have previously shown that its magnitude asymptotes for time periods exceeding 200 ms (Smith and Kohn, 2008).
Analysis.
The rsc is the Pearson correlation coefficient of the evoked spike counts of two cells to repeated presentations of a particular stimulus: it captures shared trial-to-trial variability. We calculated rsc separately for each stimulus orientation and then averaged these values to derive a single rsc value for each stimulus type for each pair of neurons. We used Fisher's r-to-Z transformation before performing statistical evaluation of rsc values.
We calculated the average rsc across all pairs within each array implant, and then averaged across the implants, weighted by the number of neurons on each implant. We weighted in this fashion rather than by the number of pairs for two reasons. First, the contribution of each array to the total number of pairs is nonlinearly related to the number of individual neurons from each array. Weighting by neuron number enables a consistent relative weight of each array for single neuron measurements (e.g., firing rate) and correlation data. Second, since the many correlations derived from all of the pairings of neurons are not independent (Olkin, 1981), treating each pair as an independent degree of freedom could violate assumptions of the statistical tests. It is more conservative to define the degrees of freedom using the number of individual neurons.
We compared firing rate and rsc across three stimulus conditions (4° RF grating, near annulus and far annulus; Fig. 1), and across the two time periods (baseline vs stimulation) using a two-way ANOVA. For the factor of “stimulus,” significant F tests (p < 0.05) were followed by post hoc pairwise repeated-measures t tests (two-tailed, α = 0.017, reflecting Bonferroni correction). The mean squared deviations and degrees of freedom for the F tests reflected weighting by the number of neurons, rather than the number of pairs.
To quantify the latency of changes in correlation strength following stimulus onset, we used a bootstrap resampling approach. First, we calculated rsc throughout the trial in 100 ms windows shifted by 2 ms. Next, we measured the mean and SD for rsc and firing rate across all prestimulus time windows. We estimated the onset latency of a change in rsc as the midpoint of the first 100 ms time window that exceeded three SDs above or below the mean of the baseline. We then created a surrogate dataset of equal size by resampling from our original 14,360 pairs of neurons with replacement, and estimated latencies for these data. The entire procedure was repeated 5000 times to create an estimate for the sampling distribution of latencies for each condition (the “bootstrap distributions”). The means of these bootstrap distributions are our estimates of the onset latencies, and the 0.5 and 99.5 percentiles of these distributions form 99% confidence intervals (CIs) for the latencies. To perform pairwise comparisons of onset latencies, we took the difference between the onset latencies for the two conditions of interest for each surrogate dataset; the distribution of these differences can be used to derive a p value by finding the widest confidence interval for the mean that does not include zero.
Results
We recorded from 375 V1 neurons with five microelectrode arrays implanted in four anesthetized, paralyzed macaque monkeys. This provided 14,360 neuronal pairs. We measured responses to three sets of visual stimuli: 4° sinusoidal gratings centered on the aggregate RF (termed RF gratings), and two sets of grating annuli that did not impinge on the RF (outer diameter of 12°, inner diameters of 4 and 8°; Fig. 1). Each stimulus was presented at four orientations. We measured firing rate and rsc during the 1.28 s of visual stimulation, and in 1.28 s epochs preceding stimulus onset.
We found that rsc was reduced by the presentation of a stimulus (Fig. 2). Across all conditions and pairs, the average rsc was 0.184 ± 0.007 (±SEM), significantly lower than during the baseline period immediately preceding stimulus onset (0.212 ± 0.007. F(1,2244) = 16.28, p ≪ 0.001). Critically, this decrease was observed not only for stimuli inside the RFs of the recorded neurons, but also for the two sets of annuli (all ps ≪ 0.001). The RF gratings caused the strongest drop in correlation strength (Δrsc = −0.042 ± 0.005), followed by the far (Δrsc = −0.029 ± 0.004) and near annuli (Δrsc = −0.017 ± 0.004). The effect size was different across stimuli (interaction: F(2,2244) = 18.62, p ≪ 0.001), but none of the pairwise comparisons was significantly different when corrected for multiple comparisons. Thus, we conclude that stimulus drive both within and outside the RF reduced the strength of correlations.
It has previously been shown that the value of rsc may be underestimated at low firing rates (de la Rocha et al., 2007; Cohen and Kohn, 2011). Our grating stimulus in the RF caused firing rates to increase from 6.802 ± 0.029 to 8.774 ± 0.035 Hz; the relatively small increase is due to suppression by the 4° grating and averaging across four orientations. Thus a decrease in firing rates could not explain the drop in correlation strength we observed with RF gratings, but it could contribute to the measurement with annular gratings, which slightly suppressed responses below the spontaneous firing rate (to 6.061 ± 0.029 and 5.989 ± 0.028 Hz for the near and far annulus, respectively; p ≪ 0.001 for both comparisons to spontaneous rate; p = 0.5 for comparison between annuli; Fig. 3, bottom). To be sure that the decorrelation was not due to weak firing rates and an underestimation of correlations, we restricted our analysis to high rate pairs—those in which both cells fired at least 3 spikes/s (we tested a range of threshold values from 1 to 5 spikes/s, and the results held throughout). We found robust decorrelation in this subset of data, similar in magnitude to that in the full dataset(Δrsc RF: − 0.03; Δrsc near: − 0.01; Δrsc far: − 0.03; all ps ≪ 0.001). Thus, even in pairs with robust responses, decorrelation was evident, ruling out low firing rates as an explanation for the effect.
V1 surround suppression involves feedforward, horizontal, and feedback circuitry. Because of the size of our annular gratings, it is unlikely that surround effects in the retina or thalamus contribute strongly (Angelucci and Bressloff, 2006). The contribution of horizontal and feedback circuitry can be distinguished by measurements of onset latency and its dependence on stimulus size (Bair et al., 2003; Henry et al., 2013). The propagation velocity of horizontal connections has been estimated to be 0.1–0.2 m/s (Bringuier et al., 1999). Our annular gratings likely drive V1 neurons located ∼5 and 10 mm away from those recorded (Van Essen et al., 1984). Thus, we would expect delays of 25–50 and 50–100 ms relative to RF signals, for the near and far annulus, respectively, discounting synaptic delays for inputs from neurons too distant to reach the recorded cells directly. Feedback connections, on the other hand, are fast (∼3 m/s; Girard et al., 2001) and project extensively. Thus, there would be little delay for either annulus, and no difference between effects driven by near and far annuli.
To quantify the latency of decorrelation for each condition, we calculated rsc in sliding 100 ms windows shifted by 2 ms steps (Fig. 3, top). We defined the onset latency as the midpoint of the first bin with a significant difference from the spontaneous value, and we used a bootstrap procedure to quantify our confidence in this estimate (see Materials and Methods). The latency for the RF grating-driven decrease in rsc was 108 ms (99% CI: 76–116 ms; Fig. 3, top, blue plot). For the near and far annuli, the latencies for the decrease were 110 ms (99% CI: 72–124 ms; yellow) and 62 ms (99% CI: 36–92 ms; red), respectively. The 99% CIs for the pairwise differences between conditions were as follows: Near-RF: − 30 to 40 ms; Far-Near: − 82 to 4 ms; Far-RF: − 62 to 8 ms. Thus, none of the conditions differed significantly in their onset latencies (i.e., all the CIs contain zero). As mentioned above, if horizontal connections mediated the decorrelation, we would predict an onset latency delay of ∼25 ms relative to that of the RF grating in the case of the near annulus, and 50 ms in the case of the far annulus. While we cannot rule out the possibility that the decorrelation caused by the near annulus is delayed 25 ms relative to that caused by the RF grating (the CI for this comparison contains 25 ms), we can reject the hypothesis that the effect of the far annulus is delayed by 50 ms relative to the RF grating (the CI does not contain 50 ms). The lack of a significant latency difference between the RF grating and the annuli, and between the two annuli, suggests a more important contribution from feedback than horizontal connections.
Our quantification of onset latency is based on crossing a statistical significance threshold, and thus sensitive to the magnitude and variability of changes in correlations. We therefore also computed onset latency by fitting a piecewise-linear function to the correlation time series of each pair. The pattern of results was qualitatively unchanged.
We next asked whether the strength of surround suppression in each pair was related to the magnitude of the annulus-driven decorrelation. To measure surround suppression, we included a 12° grating in our stimulus ensemble in three of the five array implants comprising the main dataset. We calculated a surround suppression index (SI) as the difference in response between the RF grating and the 12° grating (Cavanaugh et al., 2002), divided by the response to the RF grating: When calculating SI, we considered only the grating orientation that maximally excited each neuron. We also only considered cells for which the firing rate to the RF grating increased by >2 SDs above baseline, to minimize potential floor effects. We then averaged the SI values across the two cells comprising each pair.
The average SI across the 185 qualifying neurons was 0.18 (±0.014 SEM). Previous studies have reported average SI values of ∼0.4 (Cavanaugh et al., 2002). The weaker suppression that we observed is likely due to our RF grating covering the aggregate RF of all the recorded neurons. Since this 4° grating was much larger than a typical ∼1° V1 RF at our recording eccentricity (Cavanaugh et al., 2002), the neurons were likely already suppressed by the RF grating, leading to an underestimate of the true strength of suppression. Nevertheless, we did observe clear surround suppression in our sample of neurons.
We examined the relationship between SI and the average change in rsc caused by the near annulus (across all orientations), reasoning that if decorrelation and surround suppression relied on shared mechanisms then there would be an inverse relationship in these values. We found that the magnitude of the decrease in rsc was not related to SI (Pearson's r = −0.06, t(183) = −0.76; p = 0.77; Fig. 4). We obtained similar results when we performed this analysis only on pairs of neurons with matched orientation preferences and considering only the near annulus having their preferred orientation. Restricting the analysis to pairs of cells with similar suppression indices also provided similar results. Thus, despite most pairs showing both a decrease in rsc and firing rate by stimulation of the surround, the magnitude of these two effects were largely independent. The lack of a clear relationship between surround suppression and the drop in correlations caused by an annulus suggests that the decorrelation may involve a more distributed effect than the suppression provided directly to the pair of neurons.
We showed that stimulation of the RF and the surround each reduces correlations in V1. The inclusion of the large grating condition positioned us to assess whether these effects might be combined. Consistent with this suggestion, we found the magnitude of decorrelation caused by the large grating (Δrsc = −0.046 ± 0.010, p ≪ 0.001) was significantly greater than that caused by the RF grating (Δrsc = −0.006 ± 0.011) for the three implants for which we collected both conditions (paired-samples t test: p = 0.002). Thus, providing drive to the surround decorrelates responses to the RF stimulus. It was possible that the decrease in firing rate caused by surround suppression to the large grating relative to the RF grating could have led to this reduction in correlations. As with our main analysis above, we also re-examined the data after excluding neurons with average firing rates <3 Hz. We found that the decorrelation induced by the large grating remained above that induced by the RF grating for this subset of high firing rate neurons (p = 0.011).
Finally, we investigated whether the decorrelation caused by stimulation beyond the RF was due to spatially specific network interactions or to a more global mechanism. To distinguish between these possibilities, we compared the effects of stimuli in the ipsilateral hemifield to gratings presented over the aggregate RF. If the ipsilateral stimuli, which provide drive to the contralateral hemisphere, also reduced correlations, this would suggest a global mechanism. For this experiment, we collected data from 64 neurons in one array implant, providing 2016 pairs.
As in the primary experiment, rsc was weaker during stimulation of the RF (baseline: rsc = 0.186 ± 0.023; stimulus: rsc = 0.164 ± 0.016; two-tailed paired-samples t test: p ≪0.001). In contrast, rsc did not change during stimulation of the ipsilateral hemifield (baseline: rsc = 0.172 ± 0.023; stimulus: rsc = 0.170 ± 0.023; p = 0.7). As expected, the firing rate increased during stimulation of the RF (prestimulus: 7.036 ± 0.782 Hz; stimulus: 10.278 ± 1.158 Hz; p = 0.022), but did not change during ipsilateral stimulation (prestimulus: 6.967 ± 0.784 Hz; stimulus: 6.894 ± 0.780 Hz; p = 0.9). Thus, the suppression of correlations by stimuli delivered outside the RF only occurs for inputs provided to the contralateral visual field.
Discussion
It has long been known that the firing rate of many V1 neurons is suppressed by stimuli placed outside the classical RF (Hubel and Wiesel, 1968), a phenomenon now commonly known as surround suppression (Angelucci and Bressloff, 2006). Here we have shown that correlations are also suppressed by stimuli in the RF surround. Remarkably, this suppression is similar to that caused by stimulating the RF directly, despite having opposite effects on neuronal firing rates.
V1 surround suppression is thought to be inherited in part from precortical structures, with important additional contributions from horizontal and feedback circuitry in cortex (Angelucci and Bressloff, 2006). The effects we report likely depend most strongly on feedback connections. First, the spatial extents of the annuli we use fall beyond the region where precortical mechanisms contribute most strongly (Angelucci and Bressloff, 2006). Second, the onset latency for decorrelation caused by surround stimulation was similar to that caused by RF stimulation, and we observed no difference in the latency of signals from the near and far annulus. This is difficult to explain with the propagation of signals via slowly conducting horizontal connections.
Surround suppression has been shown to involve a withdrawal of excitatory drive (Ozeki et al., 2009), the recruitment of inhibition (Haider et al., 2010), or both (Anderson et al., 2001). There are several possible scenarios for how these mechanisms could reduce correlations. The change in the ratio of excitation to inhibition could lead to more balanced networks (Renart et al., 2010), in which spiking correlations are reduced by the coordination between inhibitory and excitatory inputs. Stimulation of the surround, like the RF, may also cause decorrelation by “stabilizing” spontaneous fluctuations in the activity of clustered groups of neurons (Litwin-Kumar and Doiron, 2012). Alternatively, the withdrawal of excitation may involve a loss of inputs shared with other nearby cells, resulting in weaker correlations. Finally, the recruitment of suppression may make cells intrinsically less sensitive to shared fluctuations in their input. The slow timescale correlations we measured here likely arise from low-frequency fluctuations in shared inputs; if surround-mediated inputs lower the input resistance of cells, this might make them less sensitive to these low-frequency signals (Kohn and Smith, 2005). However, we observed no relationship between the strength of suppression and decorrelation, on a pair-by-pair basis, arguing against this final interpretation.
Surround suppression is a form of normalization (Heeger, 1992; Schwartz and Simoncelli, 2001; Cavanaugh et al., 2002; Carandini and Heeger, 2012; Coen-Cagli et al., 2012). In this framework, the response of a neuron is proportional to the excitatory drive it receives, divided by the summed activity of a broader population of neurons. For surround suppression, the relevant pool involves neurons with offset (or larger) spatial RFs. Interestingly, other signals that modulate correlations also involve normalization. For instance, increasing stimulus contrast in the RF recruits more normalization (Carandini et al., 1997) and reduces long timescale correlations (Kohn and Smith, 2005; Huang and Lisberger, 2009; Tan et al., 2014). Increasing the size of a high-contrast grating, like presenting annular gratings, recruits additional normalization from the surround and weakens long timescale correlations further, as shown here and in data of Jia et al., 2013 (unpublished analysis). Attentional modulation also involves normalization (Reynolds and Heeger, 2009; Sundberg et al., 2009; Lee and Maunsell, 2010; Ni et al., 2012; Schwartz and Coen-Cagli, 2013) and it too weakens correlations (Cohen and Maunsell, 2009; Mitchell et al., 2009). Finally, adaptation may affect normalization signals (Wainwright et al., 2002; Wissig and Kohn, 2012), and it also alters correlations (Gutnisky and Dragoi, 2008; Adibi et al., 2013). Note that although these many different forms of modulation have been cast as involving normalization, they may involve distinct cellular and circuit processes. Nevertheless, together these findings suggest a framework for understanding the mechanisms that control the strength of correlations, namely that they may be tightly related to those involved in normalization.
Surround suppression is a well known form of spatial contextual modulation in vision, which has been linked to a number of perceptual and computational phenomena, like figure-ground segregation (Lamme, 1995; Zhou et al., 2000), contour completion (Field et al., 1993; Li et al., 2008), and salience (Itti and Koch, 2000; Li, 2002; Coen-Cagli et al., 2012). This literature has focused on the effects of surround stimuli on the firing rate of individual neurons. Our results show that surround stimuli can also alter the coordination of activity within a neuronal population. This may influence the coding performance of that population (Averbeck et al., 2006), or the strength of the input it provides to downstream circuits. Thus, a full understanding of how surround contextual effects contribute to visual processing will likely require a greater appreciation and understanding of their effect on population coordination.
Footnotes
A.C.S. was supported by an institutional National Institutes of Health (NIH) postdoctoral training fellowship through the University of Pittsburgh (T32NS007391) and an individual NIH fellowship (F32EY023456). M.J.M. was supported by an NIH undergraduate research training fellowship through the University of Pittsburgh's Program in Neural Computation (R90DA023428). A.K. was supported by NIH Grant R01EY016774 and Research to Prevent Blindness. M.A.S. was supported by NIH grants R00EY018894, R01EY022928, and P30EY008098; a career development grant and an unrestricted award from Research to Prevent Blindness; and the Eye and Ear Foundation of Pittsburgh.
The authors declare no competing financial interests.
- Correspondence should be addressed to Matthew A. Smith, Department of Ophthalmology, University of Pittsburgh, Eye and Ear Institute, 203 Lothrop Street, 9th Floor, Pittsburgh, PA 15213. smithma{at}pitt.edu