Abstract
Although several lines of evidence suggest that stimulus selectivity in somatosensory and visual cortices is critically dependent on unselective inhibition, particularly in the thalamorecipient layer 4, no comprehensive comparison of the responses of excitatory and inhibitory cells has been conducted. Here, we recorded intracellularly from a large population of regular spiking (RS; presumed excitatory) and fast spiking (FS; presumed inhibitory) cells in layers 2–6 of primary visual cortex. In layer 4, where selectivity for orientation and spatial frequency first emerges, we found no untuned FS cells. Instead, the tuning of the spike output of layer 4 FS cells was significantly but moderately broader than that of RS cells. However, the tuning of the underlying synaptic responses was not different, indicating that the difference in spike-output selectivity resulted from differences in the transformation of synaptic input into firing rate. Layer 4 FS cells exhibited significantly lower input resistance and faster time constants than layer 4 RS cells, leading to larger and faster membrane potential (Vm) fluctuations. FS cell Vm fluctuations were more broadly tuned than those of RS cells and matched spike-output tuning, suggesting that the broader spike tuning of these cells was driven by visually evoked synaptic noise. These differences were not observed outside of layer 4. Thus, cell type-specific differences in stimulus feature selectivity at the first level of cortical sensory processing may arise as a result of distinct biophysical properties that determine the dynamics of synaptic integration.
Introduction
The responses of cortical cells to sensory stimuli depend critically on the interactions between excitatory and inhibitory inputs. Particularly in the thalamorecipient cortical layer 4, studies in vivo have shown that both the selectivity (Wehr and Zador, 2003; Wilent and Contreras, 2005b) and timing (Higley and Contreras, 2006) of cortical responses to sensory input are determined by the relative timing and amplitude of inhibitory and excitatory inputs. In the primary visual cortex, this interplay between excitation and inhibition is a critical element of the receptive field structure of simple cells (Anderson et al., 2000a; Monier et al., 2003; Marino et al., 2005) and, consequently, of stimulus selectivity (Jones et al., 1987; Lampl et al., 2001; Monier et al., 2003). Moreover, local inhibitory input in primary visual cortex has been implicated in maintaining constant orientation and spatial frequency selectivity during changes in stimulus contrast (Somers et al., 1995; Troyer et al., 1998; Lauritzen and Miller, 2003).
Intracellular studies in primary visual cortex in vivo have used the tuning properties of excitatory and inhibitory postsynaptic potentials (Ferster, 1986, 1987, 1988; Anderson et al., 2000a; Hirsch, 2003; Monier et al., 2003) to infer the tuning properties of presynaptic excitatory and inhibitory cells. Based on these inferences, a variety of strategies have been proposed by which excitatory and inhibitory inputs may contribute to the emergence of stimulus selectivity. However, the tuning of postsynaptic excitation and inhibition does not necessarily reflect the properties of individual presynaptic cells. Rather, it is strongly dependent on both patterns of synaptic convergence and the relative degree of recurrent activation of excitation and inhibition. Although there is some evidence for broadly tuned interneurons in layer 4 (Azouz et al., 1997; Hirsch et al., 2002, 2003; Usrey et al. 2003), it is not clear whether inhibitory and excitatory neurons comprise two populations with distinct degrees of stimulus selectivity. Furthermore, if systematic tuning differences exist between excitatory and inhibitory cells, they may derive either from differences in synaptic input or from differences in the intrinsic transformation of synaptic input to spike output.
Here we use intracellular recordings in cat primary visual cortex in vivo to compare the stimulus selectivity of two classes of cells: fast spiking (FS) inhibitory interneurons and regular spiking (RS) excitatory projection neurons (Connors et al., 1982; Kawaguchi and Kubota, 1993; Nowak et al., 2003). We find that layer 4 RS and FS cells demonstrate statistically indistinguishable tuning for stimulus orientation and spatial frequency at the level of synaptic responses, but FS cells show significantly broader spike-response tuning than RS cells. In contrast, we find no difference in the synaptic or spike tuning of RS and FS cells outside layer 4. Within layer 4, differences in visually evoked membrane potential (Vm) fluctuations and distinct biophysical properties account for these differences in stimulus selectivity. Together, our results suggest that the differing sensory response properties of RS and FS cells in layer 4 result, in part, from biophysical membrane properties. These results constrain previously proposed models of both the stimulus selectivity and the contrast invariance of cortical visual responses.
Materials and Methods
Surgical protocol.
Experiments were conducted in accordance with the guidelines of the National Institutes of Health and with the approval of the Institutional Animal Care and Use Committee of the University of Pennsylvania. Surgical and recording methods were as reported previously (Contreras and Palmer, 2003; Cardin et al., 2005). Briefly, adult cats (2.5–3.5 kg) were anesthetized with an initial intraperitoneal injection of thiopental (25 mg/kg) and supplementary halothane (2–4% in a 70:30 mixture of N2O and O2). Subsequently, inhalant anesthesia was discontinued, intravenous thiopental was administered, and the animal was paralyzed with gallamine triethiodide (Flaxedil). Anesthesia was maintained during surgery with intravenous thiopental as needed for the duration of the experiment (14–16 h) with a continuous infusion (3–10 mg/hr). Anesthesia level was monitored by continuous recordings of heart rate, blood pressure, and EEG. Because thiopental was infused continuously, we obtained a very stable level of anesthesia throughout the experiment. The end-tidal CO2 was kept at 3.7 ± 0.2% and the rectal temperature was kept at 37–38°C with a heating pad.
The surface of the visual cortex was exposed with a craniotomy centered at Horsley–Clarke coordinates posterior 4.0, lateral 2.0. The stability of the recordings was improved by performing a bilateral pneumothorax, draining the cisterna magna, suspending the hips, and filling the cranial defect with a solution of 4% agar. Intracellular recordings were performed with glass micropipettes (50–80 MΩ) filled with 3 m potassium acetate and 1–2% neurobiotin. All cells used in this study had a stable Vm more negative than −60 mV, had overshooting action potentials, and were recorded for at least 15 min. For each electrode penetration, we lowered the pipette perpendicular to the surface of the apex of the lateral gyrus. In hundreds of electrode penetrations, we consistently found layer 4 simple cells clustered between 600 and 950 μm, as measured from the microdrive position. Laminar position was then confirmed post hoc by staining filled cells, and these depths varied from the microdrive values by an average of ±30 μm. A plot of the depths of the simple and complex cells used in this study is shown in supplemental Figure 1 (available at www.jneurosci.org as supplemental material).
Visual stimulation.
The corneas were protected with contact lenses after dilating the pupils with 1% atropine and retracting the nictitating membranes with phenylephrine (Neosynephrine). Spectacle lenses were chosen by the tapetal reflection technique to optimize the focus of stimuli on the retina. The position of the monitor was adjusted with an x–y stage so that the area centralae were well centered on the screen.
Stimuli were presented on an Image Systems (Minnetonka, MN) model M09 LV monochrome monitor operating at 125 frames per second at a spatial resolution of 1024 × 786 pixels and a mean luminance of 47 cd/m2. Custom software allowed for stimulus control, on-line displays of acquired signals (Vm and spikes), and a graphical user interface for controlling all stimulus parameters. In addition to this online control, all data were stored on a Nicolet Vision (LDS, Middleton, WI) for off-line analyses. Vm and stimulus marks were sampled at 10 kHz with 16 bit analog-to-digital converters.
Computer-assisted hand plotting routines were used with every cell to provide initial estimates of the optimal orientation, direction of movement, and spatial and temporal frequencies and to determine the receptive field position and dimensions. Tuning curves for orientation and spatial frequency were then determined more precisely with several series of drifting sinusoidal gratings spanning the initial estimates. For broadly tuned cells, orientation was varied in steps of 22.5°. For most cells, orientation was varied in smaller steps around the optimal (±3, ±6, ±12, ±18, ±26, ±34, and ±44°), providing much finer resolution of the tuning curve. Similarly, spatial frequency tuning was assessed using stimuli whose spatial frequency varied in small steps around the initial estimates. All stimuli in a given series were presented in pseudorandom order. When orientation or spatial frequency was varied, all other parameters of the stimuli, including mean luminance, were held constant.
Cells were classified as simple or complex based on two criteria. First, the relative modulation of spike trains evoked by an optimized patch of drifting sinusoidal grating was measured. If the response at the fundamental temporal frequency of the stimulus (F1) exceeded the average (DC) response, the cell was classified as simple. Otherwise, the cell was classified as complex (Skottun et al., 1987). Second, we estimated the one-dimensional spatiotemporal weighting function by averaging membrane potential and spike responses to bright and dark bars (n = 16) distributed across the receptive field at the optimal orientation. Cells exhibiting nonoverlapping regions excited by bright and dark stimuli were classified as simple. Cells showing excitation to bright and dark stimuli throughout their receptive fields were classified as complex. These two measures yielded the same functional classification in every case.
Response quantification.
Orientation curves were characterized by the half-width at half height (HWHH) of the best fitted Gaussian function. Bandwidth (BW) of the spatial frequency curves was calculated in octaves as BW = log2(Fhigh/Flow), where Fhigh and Flow are the upper and lower frequency cutoffs at half-height on the tuning curve, respectively. Cells with low-pass spatial frequency characteristics were not included in population estimates of bandwidth. For Vm measurements, spikes were removed by first determining the time at which spike threshold was reached and then extrapolating the Vm values from that point to when the spike repolarized back the spike threshold level. This was followed by smoothing with a three-point running average. The number of spikes did not affect comparisons of Vm tuning estimates, as comparison of RS and FS cells with either similar or very different firing rates did not change the results of the analyses. Spike output in response to drifting gratings was measured as the F1 and the DC of the mean firing rate for simple and complex cells, respectively. The magnitude of the Vm fluctuations associated with spikes evoked by visual stimuli was measured as the SD of the mean Vm value (Vm SD) during the 10 ms preceding the threshold crossing of each spike. To normalize measurements across cells, all Vm SD measurements were taken as the increase relative to the Vm SD in the absence of visual stimulation. The Vm SD was only measured when the interspike interval ≥10 ms. This method of measurement avoided contamination of the Vm SD values with artifacts left by spike removal and focused on the Vm fluctuations most closely associated with visually evoked spike output. To compare Vm fluctuations at optimal and nonoptimal orientations, the visually evoked increase in prespike Vm SD was measured using drifting gratings at two orientations: one eliciting a maximal response and one eliciting a half-maximal response.
Overall mean rate of Vm change (dV/dt) was quantified by calculating the mean of the absolute value of the first derivative of a 10 to 30 s recording after spike removal (|dV/dt|). Prespike dV/dt was measured by calculating the Vm change over the 2 ms leading up to spike threshold. Prespike dV/dt was measured only for spikes with an interspike interval of ≥10 ms. For estimation of membrane time constant and input resistance in the absence of visual stimulation, we used a larger group of RS and FS cells that included those used in the rest of the study. Time constant and input resistance were measured from injection of 10–20 small square current pulses per cell.
Prediction of spike output.
Two models were used to capture the relationship between membrane potential (Vm) and firing rate (R):
The power-law model (Eq. 1) is identical to that used previously (Miller and Troyer, 2002; Murphy and Miller, 2003; Priebe et al., 2004; Priebe and Ferster, 2006) (see also Carandini, 2004). Sets of values of R and Vm were extracted from 20 ms bins in the responses to gratings drifting at all orientations and spatial frequencies. The scale factor (kp) and the exponent (p) were estimated from these sets of values using a nonlinear estimation procedure. The same sets of values were used to estimate the scale factor (kl) and threshold (Vth) in the linear model (Eq. 2).
Once the parameters were estimated for each cell, these equations were used to generate predicted orientation and spatial frequency tuning curves. Specifically, cyclegrams for stimuli presented at all orientations and spatial frequencies were binned at 20 ms and firing rates calculated from the Vm values according to Equations 1 and 2. The final result was the predicted F1 or DC components of the spike response as a function of orientation and spatial frequency.
Because, as we show below, the Vm SD varies systematically with stimulus parameters, we also used a variant of the linear model: where Vth and kl are the same as in Eq. 2, but synaptic noise (SN) is added to the Vm obtained from each 20 ms bin. SN was drawn randomly from Gaussian distributions of zero mean whose SD matched that measured for each orientation and spatial frequency. A Gaussian distribution was used because the distribution of Vm values during baseline activity at resting Vm was well fit by a Gaussian in all cells. Because this model can occasionally produce small negative firing rates, the output was truncated at 0 Hz (rectified).
Statistical analysis.
Statistics were used as described in Results. Unless otherwise noted, error bars denote SEM.
Histology.
At the end of each experiment, the animal was given a lethal dose of sodium pentobarbital and perfused intracardially with 0.9% saline followed by cold 4% paraformaldehyde in 0.1 m sodium phosphate buffer. The brain was post-fixed and cryoprotected, sectioned at 100 μm, and treated with Cy3-conjugated streptavidin (Jackson ImmunoResearch, West Grove, PA) to label cells filled with neurobiotin. To confirm recording locations and laminar positions, Cy3-labeled cells and laminar boundaries were visualized with an Olympus (Melville, NY) BX51 microscope. Examples of filled cells are shown in supplemental Figure 1 (available at www.jneurosci.org as supplemental material).
Results
The purpose of this study was to compare the orientation and spatial frequency selectivity of regular spiking and fast spiking neurons in primary visual cortex in vivo. Our initial database of cells included 671 neurons distributed throughout layers 2–6 of cat primary visual cortex (layers 2/3, n = 175; layer 4, n = 168; layers 5/6, n = 328). All cells included in the database had a stable resting Vm more negative than −60 mV for >15 min and an overshooting action potential. Of those, 463 were RS cells and 73 were FS cells. We chose a subset of RS cells at random for statistical comparison with the smaller FS cell population (n = 140 RS and 73 FS) (supplemental Table 1, available at www.jneurosci.org as supplemental material, for details). Within these groups, 63 cells (35 RS, 28 FS) were located in layer 4. Cells were classified electrophysiologically according to their spike duration at half height (RS, 0.73 ± 0.1 ms; FS, 0.36 ± 0.1 ms), their degree of firing rate adaptation, and the slopes of their frequency–intensity (F–I) relationships in response to square current pulses, as shown in Figure 1(McCormick et al., 1985; Nowak et al., 2003). In addition, RS and FS cells differed in their spike afterhyperpolarization (AHP; FS cells have short and pronounced AHPs) (Fig. 2) and their overall firing rates in response to current injection (FS, >300 Hz). Because the spontaneous activity of RS and FS cells varied widely across cells, the spontaneous firing rates of the two populations of cells were not significantly different.
Orientation and spatial frequency tuning
We measured orientation tuning with drifting gratings of optimal spatial frequency and quantified the responses by measuring the F1 and DC response components for simple and complex cells, respectively (see Materials and Methods). The orientation tuning curves for the Vm and spike responses of four representative cells are shown in Figure 2. A simple RS cell from layer 4 showed a characteristic, robustly modulated Vm response to optimally oriented drifting gratings with a peak to peak amplitude of 14.9 mV and generated a modulated spike response with firing rates around 9.8 Hz (Fig. 2A). The half-maximal response elicited by a nonoptimally oriented grating is shown for comparison. The DC responses of this cell were similarly tuned (data not shown), although of much smaller amplitude. A complex RS cell from layer 5 (Fig. 2B) demonstrated a sustained (DC) response of 9.5 mV to an optimally oriented drifting grating, leading to an unmodulated firing rate increase of 24.6 Hz, and showed tuning of both the Vm and spike responses. A simple FS cell in layer 4 showed broader spike tuning than the simple layer 4 RS cell, and less sharpening of the tuning of spike responses relative to Vm responses. In response to an optimally oriented grating, this cell showed peak to peak Vm fluctuations of 11.2 mV (Fig. 2C) and firing rates of 56.8 Hz. A complex FS cell in layer 5 had a Vm response to an optimally oriented grating of 10.1 mV and it showed a sustained firing rate of 78.1Hz (Fig. 2D). Additional example cells are shown in supplemental Figure 2 (available at www.jneurosci.org as supplemental material). Spatial frequency tuning was assessed with drifting gratings of optimal orientation and the responses were quantified as the F1 or DC components for simple and complex cells, respectively. Figure 3 shows the Vm and spike spatial frequency tuning curves for the four cells shown in Figure 2. The two simple cells shown in Figures 2 and 3 illustrate differences between layer 4 RS and FS cells that were consistent throughout the population. Layer 4 FS cells exhibited similar Vm tuning, but broader spike response tuning than layer 4 RS cells. This difference was not observed in nongranular layers.
In agreement with previous reports (Ferster et al., 1996; Chung and Ferster, 1998; Anderson et al., 2000a), orientation tuning of spike responses was narrower than that of the underlying Vm responses. This is shown by the plot of Vm HWHH versus spike HWHH in Figure 4A, in which most RS (n = 108) and FS (n = 61) cells fall below the unity line. Similarly, spatial frequency tuning of Vm responses was broader than that of spike responses for both RS (n = 76) and FS (n = 39) cells (Fig. 4B). As shown by the accompanying histograms of Vm and spike tuning widths, when cells in all layers were included there were no significant differences between RS and FS population means for either orientation or spatial frequency (unpaired t test; p > 0.05 in all cases).
Differences in stimulus selectivity between RS and FS cells in layer 4
When the comparison was limited to cells in layer 4, the orientation tuning of the spike response was significantly broader in FS than RS cells (RS, 14.5 ± 1.0°, n = 31; FS, 24.8 ± 1.8°, n = 26; Mann–Whitney test, p < 0.001) (Fig. 4C). Spatial frequency tuning was also broader for the spike response of layer 4 FS cells than layer 4 RS cells [FS, 2.30 ± 0.27 octaves (oct), n = 16; RS, 1.31 ± 0.16 oct, n = 21; p < 0.01] (Fig. 4D). Surprisingly, the underlying Vm responses were not significantly different for either orientation (RS, 29.9 ± 1.8°; FS, 32.8 ± 2.4°; p > 0.05) (Fig. 4C) or spatial frequency (RS, 2.61 ± 0.22 oct; FS, 2.87 ± 0.40 oct) (Fig. 4D). The Vm data suggest that the tuning of the summed synaptic input to layer 4 RS and FS cells is not significantly different for orientation or spatial frequency. However, cell type-specific intrinsic properties may differentially affect the conversion of synaptic input to spike output in layer 4 RS and FS cells.
Differences in stimulus selectivity between RS and FS cells in nongranular layers and comparison with layer 4
The spike output tuning differences between layer 4 FS and RS cells were not replicated in downstream cortical layers. No significant differences between FS and RS orientation tuning for either Vm or spike responses were observed in supragranular (n = 25 RS and 11 FS) or infragranular (n = 54 RS and 25 FS) layers (p > 0.05 in all cases) (supplemental Table 1, available at www.jneurosci.org as supplemental material). Similarly, no significant differences were observed between FS and RS spatial frequency tuning for the Vm or spike responses in supragranular (n = 25 RS and 4 FS) or infragranular (n = 52 RS and 18 FS) layers (p > 0.05 in all cases) (supplemental Table 1, available at www.jneurosci.org as supplemental material).
The lack of differences between RS and FS cells in nongranular layers, and particularly in layers 2/3, could be attributable to their common excitatory input from layer 4 RS cells and differences in local connectivity. However, the orientation tuning of the spike output of layer 4 RS cells (14.5 ± 1.0°; n = 28) was significantly narrower than both the Vm and spike tuning of their target RS (Vm, 47.5 ± 4.5°, p < 0.001; spikes, 23.6 ± 2.1°, p = 0.04; one-way ANOVA with Dunn's multiple comparison test) and FS (Vm, 42.6 ± 8.9°, p < 0.001; spikes, 20.3 ± 2.1°, p = 0.04) cells in supragranular layers. This significant decrease in selectivity suggests some degree of pooling of the sharply tuned output of layer 4 cells, either directly at their synaptic targets in layers 2/3 or indirectly via entrainment of extensive local recurrent excitation.
Across our population of recorded cells in all layers, we observed no RS or FS cells that were untuned for stimulus orientation or spatial frequency. In particular, none of the complex layer 4 FS cells we recorded were untuned (n = 6; orientation: Vm, 30.1 ± 8.3°; spikes, 26.8 ± 10.8°; spatial frequency: Vm, 2.58 ± 0.28 oct; spikes, 2.22 ± 0.42 oct). We observed very few cells (n = 1 of 116 RS, 2 of 67 FS) with extremely broad Vm tuning for orientation (HWHH >100°), and only one of those, a simple FS cell in layer 4, maintained such broad tuning in its spike output. We observed a number of cells that demonstrated low-pass spatial frequency tuning for Vm (12 of 108 RS cells; 11 of 41 FS cells) and spikes (4 of 108 RS cells; 2 of 41 FS cells). Interestingly, in layer 4, although 0 of 23 RS cells demonstrated low-pass spatial frequency tuning for Vm, 5 of 15 FS cells did. No differences were observed in tuning for stimulus orientation or spatial frequency between simple and complex cells at the Vm or spike level (n = 87 simple, 125 complex; p > 0.05 in all cases) (data not shown). The average population values for orientation tuning and spatial frequency tuning in all cortical layers are shown in supplemental Table 1 (available at www.jneurosci.org as supplemental material).
Vm responses predict spike tuning in RS, but not FS, cells in layer 4
The broader orientation and spatial frequency tuning of the spike, but not Vm, responses of layer 4 FS may derive from differences in the transformation of Vm activity into spike output. To test this possibility, we generated a predicted spike output from the Vm responses of each layer 4 cell and compared the orientation and spatial frequency tuning curves of the predicted spike responses with those obtained experimentally. Predicted spike responses were generated by first obtaining power law fits for the relationship between Vm and firing rate for each cell. The power law fit parameters were then applied to each cell's averaged Vm responses to drifting grating stimuli to give predicted firing rates for each orientation or spatial frequency (see Materials and Methods).
The mean exponent of the power law functions for layer 4 FS cells (2.5 ± 0.05; n = 24) was significantly higher than that for layer 4 RS cells (1.8 ± 0.08; n = 25; p < 0.001). A higher exponent should result in narrower FS spike response tuning, the opposite of the experimental observations described above. We thus expected that the power law predictions for FS cell spike tuning would underestimate the observed values. Indeed, as shown by the example cells in Figure 5, whereas the prediction was accurate for both the orientation and spatial frequency tuning of the layer 4 RS cell (Fig. 5A,C, top plots), in both cases it underestimated the observed values for the layer 4 FS cell (Fig. 5A,C, bottom plots). Across the population of cells, this was true for both orientation (Fig. 5B) and spatial frequency (Fig. 5D). In both cases, data from RS cells fell close to the unity line, indicating a good match between predicted and observed values, whereas data from the FS cells fell predominantly below the line, indicating broader observed than predicted tuning. The mean ratio of predicted to observed HWHH of orientation tuning was significantly <1 for FS cells (0.69 ± 0.04; p < 0.001, one-sample t test), indicating a poor match between predicted and observed tuning, but was close to 1 for RS cells (1.04 ± 0.03; p > 0.05). Similarly, the mean ratio of predicted to observed BW of spatial frequency was significantly <1 for FS cells (0.71 ± 0.05; p < 0.001), but was close to 1 for RS cells (1.06 ± 0.02; p > 0.05). Together, these results suggest that the broader tuning of FS cells is not a simple consequence of a static relationship between Vm and firing rate.
Vm fluctuations and spike tuning in layer 4 cells
One possible mechanism underlying both the different stimulus selectivity of RS and FS spike responses and the failure of Vm values to predict spike response tuning in FS cells is cell type-specific differences in Vm fluctuations (also called synaptic noise). Increased synaptically driven Vm fluctuations, quantified as an increase in Vm SD (see Materials and Methods), are likely to cause the Vm to cross the spike threshold more often, despite a similar level of mean Vm depolarization (Anderson et al., 2000b; Azouz and Gray, 2000). This impact of Vm fluctuations on spike output could lead to broader spike response tuning than that predicted from the Vm values, as was the case for layer 4 FS cells.
Indeed, the increase in Vm SD evoked by visual stimuli was higher in layer 4 FS than RS cells not only in response to the preferred orientation (FS, 3.2 ± 0.2 mV, n = 26; RS, 1.9 ± 0.1 mV, n = 31; p = 0.03), but also to nonoptimally oriented stimuli that generated a half-maximal response (FS, 1.7 ± 0.2 mV, n = 25; RS, 0.8 ± 0.1 mV, n = 32; p < 0.001). These differences were not observed in nongranular cells for either the optimal (FS, 1.8 ± 0.6 mV, n = 39; RS, 2.0 ± 0.5 mV, n = 83; p > 0.05) or the nonoptimal (FS, 1.1 ± 0.1 mV, n = 39; RS, 1.0 ± 0.1 mV, n = 83; p > 0.05) orientation. Across the population of cells, the mean baseline Vm SD in the absence of visual stimulation was ±2.3 mV.
As shown by the example traces and tuning curves from the layer 4 RS and FS cells in Figure 6A, the magnitude of the visually evoked synaptic noise (Vm SD) was tuned for stimulus orientation. In both example cells, the Vm SD tuning curve agreed well with the tuning curve of the spike response. Indeed, for the populations of layer 4 RS and FS cells, the widths of the tuning curves for orientation (Fig. 6B) and spatial frequency (Fig. 6E) of the Vm SD response plotted against those of the spike responses were closer to the unity line than were the plots of Vm response versus spike response. Vm response tuning was consistently broader than Vm SD tuning in RS and FS cells for both orientation and spatial frequency (Fig. 6C,F). These results are highlighted by HWHH (Fig. 6D) and BW (Fig. 6G) ratios. In each case, the ratio of Vm SD to spike tuning widths was close to 1, indicating a close match between the Vm SD and spike tuning curves. These results suggest that Vm fluctuations, rather than the overall level of Vm depolarization, may be a critical determinant in driving spike output in layer 4 cells.
A linear model with tuned Vm SD accurately predicts spike output in all layer 4 cells
Because the Vm SD response was well tuned for both orientation and spatial frequency, we tested the hypothesis that a linear model of spike generation, in combination with tuned Vm noise, is a better predictor of spike output tuning properties than the power law model. For each cell, we compared the accuracy of the tuning predictions generated by a linear threshold (LT) model alone with the accuracy of predictions generated by adding Vm noise matched to the experimentally observed Vm SD evoked by each stimulus condition. The observed Vm SD response was measured in short epochs (see Materials and Methods). For illustrative purposes, Figure 7A shows several long example traces of visually evoked Vm SD. As shown by the orientation (Fig. 7A) and spatial frequency (Fig. 7D) tuning curves for two example cells, the LT prediction was much narrower than the observed tuning in each case. However, the linear threshold model with tuned noise (LTN) resulted in consistently accurate predictions of spike response tuning for both the RS and FS cells.
Across the populations of layer 4 RS (n = 25) and FS (n = 24) cells, the widths of the LTN-predicted tuning curve widths for orientation (Fig. 7B) and spatial frequency (Fig. 7E) were a close match to the observed tuning widths. In contrast, the LT-predicted tuning curve width consistently underestimated the observed spike tuning widths. Mean ratios of predicted to observed tuning widths are shown for orientation and spatial frequency in Figure 7, C and F, respectively. In each case, the ratio of the LT prediction to the observed tuning was significantly smaller than 1 (p < 0.001 in each case). In contrast, the ratio of LTN prediction to observed tuning width was not significantly different from 1 (p > 0.05 in each case). Unlike the power-law model (Fig. 5), the LTN model consistently generated accurate predictions of layer 4 FS cell spike output. These results suggest that tuned synaptic noise, in combination with Vm depolarization, is a key element in determining the stimulus selectivity of spike responses, especially in layer 4 FS cells.
Role of membrane properties in stimulus selectivity
We next explored whether the Vm fluctuation differences between layer 4 RS and FS cells are attributable to differences in visually evoked synaptic inputs or to differences in the biophysical properties of the postsynaptic cell. For instance, a shorter time constant would lead to shorter temporal summation, higher rates of membrane potential change (dV/dt), and higher levels of Vm fluctuations in response to similar amounts of synaptic input. We therefore measured the mean dV/dt during stimulation with optimal drifting gratings (see Materials and Methods). Examples from an RS and an FS cell in layer 4 are shown in Figure 8A. In response to a drifting grating stimulus of optimal parameters, the RS cell exhibited large Vm fluctuations that were slow compared with the rapid, sharp fluctuations of the FS cell. Within layer 4, FS cells demonstrated a significantly higher mean dV/dt (0.35 ± 0.02 mV/ms; n = 25) than did RS cells (0.20 ± 0.01 mV/ms; n = 36; p = 0.01). This difference was not replicated in supragranular (RS, 0.29 ± 0.01 mV/ms; FS, 0.28 ± 0.02 mV/ms) or infragranular (RS, 0.26 ± 0.01 mV/ms; FS, 0.27 ± 0.02 mV/ms) layers. We also measured the dV/dt of the membrane potential leading up to spike threshold during visual stimulation. Within layer 4, FS cells demonstrated significantly faster prespike dV/dt (4.0 ± 0.1 mV/ms; n = 25) than RS cells (3.0 ± 0.1 mV/ms; n = 36; p = 0.003) (Fig. 8B). This difference was not observed in supragranular (RS, 4.0 ± 0.1 mV/ms; FS, 3.9 ± 0.2 mV/ms) or infragranular (RS, 3.4 ± 0.1 mV/ms; FS, 3.7 ± 0.2 mV/ms) layers.
The faster Vm fluctuations observed in FS cells in response to synaptic input may be caused by biophysical membrane properties, such as the membrane time constant and input resistance, which play significant roles in determining the dynamics of synaptic integration (Eccles, 1964). We therefore measured these two variables for populations of RS and FS cells in all cortical layers (see Materials and Methods). FS cells in layer 4 demonstrated a significantly lower mean time constant (4.2 ± 0.6 ms; n = 36) than did layer 4 RS cells (6.1 ± 0.5 ms; n = 69; p < 0.001). This difference in membrane time constant was associated with a significantly lower input resistance in layer 4 FS (27.3 ± 9.8MΩ) than RS (47.1 ± 11.4MΩ; p < 0.001) cells. In addition, within layer 4 there was a linear, negative relationship between the membrane time constant and membrane dV/dt during visual stimulation, suggesting that smaller time constants contribute to faster synaptically driven Vm fluctuations (RS, n = 20; FS, n = 14; r2 = 0.45; p < 0.0001) (data not shown). No differences in membrane time constant were observed between RS and FS cells in supragranular (RS, n = 102; FS, n = 24) or infragranular (RS, n = 159; FS, n = 42) layers (p > 0.05 in all cases) (supplemental Table 1, available at www.jneurosci.org as supplemental material).
If biophysical membrane properties contribute to the emergence of stimulus selectivity in layer 4 cells, a relationship between the membrane time constant and the tuning of the cell's spike output is expected. Indeed, within layer 4, membrane time constant values were linearly related to the width of the spike response orientation tuning curve (RS, n = 24; FS, n = 20; r2 = 0.45; p < 0.0001). Cells with smaller time constants demonstrated broader tuning for stimulus orientation.
Together, these results show that layer 4 FS cells exhibit faster membrane time constants than do layer 4 RS cells, and this property is associated with faster visually evoked Vm fluctuations. In turn, the Vm fluctuations determine spike output and result in significantly broader tuning of the spike output of layer 4 FS cells, despite the similar tuning of FS and RS cell mean Vm responses. In summary, our results demonstrate a difference in stimulus selectivity in layer 4 that derives from neuronal biophysical properties, rather than from the selectivity of synaptic inputs.
Discussion
Previous work has suggested that excitatory and inhibitory neurons may play distinct roles in shaping stimulus selectivity in visual cortex. However, their functional properties have not been systematically studied. Here, we compared the stimulus selectivity and membrane properties of excitatory and inhibitory neurons throughout primary visual cortex.
Within layer 4, RS and FS cells exhibited equal orientation and spatial frequency tuning of their Vm responses, but FS cells showed significantly broader spike output tuning. A power law model of the relationship between Vm activity and spike output accurately predicted layer 4 RS cell spike tuning, but consistently underestimated the tuning of FS cells. Because the tuning of synaptic noise (Vm SD) agreed well with that of the spike output, especially in FS cells, we hypothesized that synaptic noise plays a significant role in determining spike response tuning. Indeed, a linear threshold model that incorporated tuned noise accurately predicted spike response tuning for the population of layer 4 FS cells.
We found that layer 4 FS cells have lower membrane time constants and input resistance and faster visually evoked Vm fluctuations than layer 4 RS cells. Thus, the difference in the stimulus selectivity of layer 4 RS and FS spike responses is not driven by differences in the tuning of their overall synaptic input, but rather by cell type-specific differences in biophysical membrane properties that mediate the dynamics of synaptically driven Vm fluctuations.
Tuning of excitation and inhibition in local cortical networks
Theoretical studies of orientation tuning in layer 4 have incorporated untuned or broadly tuned inhibition as an important component underlying either the emergence of selectivity or its contrast invariance (Somers et al., 1995; Troyer et al., 1998; Lauritzen and Miller, 2003; Ringach et al., 2003). In previous work, Hirsch et al. (2003) described four smooth stellate cells in layer 4 that were complex and demonstrated no selectivity for the orientation of moving bars. Similar results were found in layer 2/3 of mouse visual cortex (Sohya et al., 2007). Such cells are obvious candidates to provide untuned inhibition to local cortical circuits. However, the orientation tuning of postsynaptic inhibitory potentials and conductances in layer 4 cells is narrow and indistinguishable from the tuning of postsynaptic excitatory potentials and conductances (Anderson et al., 2000a; Monier et al., 2003). In fact, this relatively narrow tuning of inhibitory synaptic inputs is suggested by the widely accepted push–pull receptive field structure of layer 4 simple cells (Hirsch, 2003; Hirsch and Martinez, 2006). Furthermore, neither elimination of all intracortical activity (Ferster et al., 1996; Chung and Ferster, 1998) nor specific postsynaptic blockage of GABAergic conductances (Nelson et al., 1994) affects orientation tuning. These results suggest that local inhibition, whether tuned or untuned, is not necessary for the expression of orientation selectivity in layer 4.
The data presented here represent the largest population of inhibitory neurons recorded to date in layer 4. We observed no (0 of 21 simple; 0 of 7 complex) untuned layer 4 FS cells. Only one, a simple cell, exhibited spike response tuning broader than 100 degrees. Although we did observe that the orientation tuning of layer 4 FS cell spike output was systematically broader than that of layer 4 RS cells, this difference was relatively small (10°). It therefore seems unlikely that the broader tuning of FS cells plays more than a minor role in directly shaping the tuning curves of RS cells. At most, it may contribute a small sharpening that would likely fall below the resolution of the measurements made in previous studies of cortical inactivation (Nelson et al., 1994; Ferster et al., 1996; Chung and Ferster, 1998). It is difficult to reconcile our results with those of Hirsch et al. (2003). One possibility is that the disparity in observation of untuned FS cells may be attributable to the sampling biases inherent in different intracellular recording techniques. In addition, the small number of previously recorded untuned inhibitory cells (four cells) (Hirsch et al., 2003), in combination with observations that the inhibitory input to simple cells drops to low values at nonpreferred orientations (Ferster, 1986; Anderson et al., 2000a; Martinez et al., 2002), suggests that such cells may be rare. Another factor affecting comparisons between this and other work is the often sparse sampling of tuning curves (22.5° steps) by other studies, which may contribute to misestimation of tuning width.
We found no differences between RS and FS cells in either orientation or spatial frequency tuning outside of layer 4. Furthermore, the tuning of layer 4 RS spike output was significantly narrower than both the Vm and spike responses of target supragranular RS and FS cells, suggesting downstream pooling of the narrowly tuned output of layer 4 cells or recruitment of the extensive recurrent local connections within supragranular layers. It is unlikely that differences in morphological characteristics alone can explain the difference between FS cells in layers 4 and 2/3. The main inhibitory cell type in cat layer 4 is a small basket cell that differs from supragranular basket cells only in size. The morphological differences among inhibitory cells in layer 2/3 are far greater than those between those in layer 2/3 and layer 4, and yet their biophysical properties are homogeneous. Network input, which underlies the functional differentiation of simple and complex neurons, may likewise shape the distinct tuning properties of morphologically similar interneurons in different cortical layers.
Vm fluctuations and visually evoked spike responses.
One mechanism underlying the different spike response tuning of RS and FS cells in layer 4 may be differences in visually driven Vm fluctuations. Previous studies have suggested that Vm fluctuations lead to an increase in sensory-evoked spike responses by eliciting more frequent spike threshold crossings (Anderson et al., 2000b; Azouz and Gray, 2003) and lowering the spike threshold (Azouz and Gray, 2000; Wilent and Contreras, 2005a).
Several previous studies have suggested a power-law relationship between the average Vm and firing rate, in which the transition from subthreshold to suprathreshold activity is smoothed by Vm noise (Hansel and van Vreeswijk, 2002; Miller and Troyer, 2002; Murphy and Miller, 2003; Prescott and De Koninck, 2003; Carandini, 2004; Priebe et al., 2004). The power law fits for FS cells showed higher exponent values than those for RS cells, indicating cell type-specific differences in the slope of the relationship between Vm and firing rate similar to those observed from responses to injected current pulses both in vitro (Connors et al., 1982; Kawaguchi, 1993) and in vivo (Contreras and Palmer, 2003; Nowak et al., 2003). However, higher power law exponents should give rise to narrower, not broader, spike-orientation tuning. Indeed, tuning predictions generated from Vm activity and power-law fits agreed well with the actual spike output of RS cells, but consistently underestimated the width of FS cell spike response tuning. The power law model assumes a static relationship between Vm noise and spike output. However, membrane potential noise, measured as Vm SD, is tuned for visual stimulus parameters. The modified linear threshold model, which accounted for this nonlinear contribution by tuned noise, generated accurate predictions of spike response tuning in both RS and FS cells. Together, these results suggest that the broader tuning of FS cells does not derive from a simple relationship between the overall level of Vm depolarization and firing rate, but is instead greatly influenced by large, fast Vm fluctuations.
Intrinsic biophysical properties contribute to stimulus selectivity.
Passive electrical membrane properties, such as the membrane time constant, play a pivotal role in regulating synaptic integration (Rall, 1957; Coombs et al., 1959; Curtis and Eccles, 1960; Eccles, 1961). In particular, the rate of membrane potential change (dV/dt), which is partly determined by the membrane time constant, influences the complex relationship between Vm depolarization and spike threshold (Azouz and Gray, 2000; Wilent and Contreras, 2005a). Layer 4 FS cells showed lower membrane time constants and input resistance than RS cells. Thus, the observed differences in RS and FS cell Vm fluctuations may be caused by cell type-specific differences in biophysical membrane properties (Spencer and Kandel, 1961; Brown et al., 1981; Connors et al., 1982). Indeed, recent in vitro findings in barrel cortex suggest that lower input resistance and fast time constants may be general properties of layer 4 interneurons (Cruikshank et al., 2007). Layer 4 FS cells receive stronger and more numerous thalamocortical synaptic inputs than layer 4 RS cells (Gibson et al., 1999; Porter et al., 2001; Bruno and Simons, 2002; Cruikshank et al., 2007). These differences offset the lower input resistance of FS cells, leading to synaptic responses of larger amplitude. The different transformation from Vm to spikes in RS and FS cells in layer 4 may be the combined effect of differences in intrinsic biophysical properties and synaptic dynamics. Stimulus selectivity in layer 4 cortical neurons, previously suggested to be predominantly determined by feedforward inputs (Miller, 2003), is thus partly the result of biophysical membrane properties.
In the somatosensory barrel system, suspected interneurons in layer 4 exhibit very broad directional tuning (Swadlow, 1995; Alonso and Swadlow, 2005). However, our data show that this is not the general case for stimulus selectivity in cat primary visual cortex interneurons. A high degree of selectivity, observed in cat and monkey visual cortex, but not in barrel cortex, may require sharply tuned inhibition for fine regulation of spike timing or spike output frequency, rather than overall damping of feedforward excitatory inputs. The results described here may serve to constrain conceptual models of visual cortex that require broadly tuned inhibition.
Footnotes
-
This work was supported by the National Institutes of Health National Eye Institute. We thank Dr. Michael J. Higley for helpful comments on a previous version of this manuscript and Dr. Esther Garcia de Yebenes for histology.
- Correspondence should be addressed to Diego Contreras, Department of Neuroscience, University of Pennsylvania School of Medicine, 215 Stemmler Hall, Philadelphia, PA 19106-6074. diegoc{at}mail.med.upenn.edu