Abstract
The role of gamma oscillations in producing synchronized firing of groups of principal cells is well known. Here, we argue that gamma oscillations have a second function: they select which principal cells fire. This selection process occurs through the interaction of excitation with gamma frequency feedback inhibition. We sought to understand the rules that govern this process. One possibility is that a constant fraction of cells fire. Our analysis shows, however, that the fraction is not robust because it depends on the distribution of excitation to different cells. A robust description is termed E%-max: cells fire if they have suprathreshold excitation (E) within E% of the cell that has maximum excitation. The value of E%-max is approximated by the ratio of the delay of feedback inhibition to the membrane time constant. From measured values, we estimate that E%-max is 5–15%. Thus, an E%-max winner-take-all process can discriminate between groups of cells that have only small differences in excitation. To test the utility of this framework, we analyzed the role of oscillations in V1, one of the few systems in which both spiking and intracellular excitation have been directly measured. We show that an E%-max winner-take-all process provides a simple explanation for why the orientation tuning of firing is narrower than that of the excitatory input and why this difference is not affected by increasing excitation. Because gamma oscillations occur in many brain regions, the framework we have developed for understanding the second function of gamma is likely to have wide applicability.
Introduction
Gamma frequency oscillations were originally discovered in the field potential of visual cortex (Eckhorn et al., 1988; Gray and Singer, 1989) and have subsequently been observed in most brain regions (for review, see Jensen et al., 2007). Such oscillations are thus likely to be a fundamental aspect of neural processing. Analysis of the function of gamma oscillations has focused on the role of oscillations in synchronizing cell firing (Singer and Gray, 1995): rather than firing with a uniform probability over time, networks that display gamma oscillations show clustered firing of principal cells that tends to occur at a particular phase of each gamma cycle (Bragin et al., 1995; Penttonen et al., 1998; Csicsvari et al., 2003). Such synchronization is likely to be functionally important because it allows the detection of this group by coincidence detection in target cells (König et al., 1996). Gamma oscillations are thus thought to be an important aspect of neural processing that provides a way for a group of cells that represents a particular percept or memory to be distinguished from other groups.
Although neurons are synchronized by gamma oscillations, they do not generally fire on every gamma cycle. For instance, in the hippocampus, principal neurons fire during only 2–5% of the gamma cycles [Senior et al. (2008), their Fig. 6]. It is thus important to understand how excitation and inhibition interact to produce this selectivity. Importantly, inhibition itself is modulated at gamma frequency (Soltesz and Deschênes, 1993); indeed, gamma oscillations appears to arise through a feedback process in which principal cells excite interneurons, which then inhibit the principal cells (Miles, 1990; Fisahn et al., 1998; Bartos et al., 2007; Fries et al., 2007; Mann and Paulsen, 2007). This dynamic inhibition not only synchronizes cells but, through interaction with excitation, selects which cells fire.
We have sought to determine whether there are any simple rules that describe this process. It has generally been thought that inhibition selects the most excited cells by performing a type of winner-take-all process. There is clearly more than one winner, and thus a commonly used assumption is that that there are k winners. We have examined this possibility and found that it is not robust. An alternative description (E%-max winner-take-all) is more robust: cells fire in a given gamma cycle if they have excitation (E) within E% of the cell that has maximal excitation. We show that the value E% can be estimated from easily measurable properties. Given how widespread gamma oscillations are in the nervous system, the role of these oscillations in determining which cells fire is of fundamental importance. This E%-max process is not a single-cell process, but rather a network process. In light of the present results, some standard ideas about what causes cells to fire may need to be revised.
Materials and Methods
The network we simulate is shown in Figure 1 and involves a group of identical principal cells that converge onto an interneuron. The interneuron provides feedback inhibition to all principal cells. This inhibition occurs with a delay (d), relative to the spike in the principal cell. In most of the simulations, we adopted a delay period of 3 ms. This feedback inhibition is strong enough to prevent firing; firing can again occur after partial decline of the inhibition. This simple network creates gamma frequency inhibition.
Different excitatory cells receive different excitation from an external source. Principal cells are modeled as simple integrate-and-fire neurons, which have excitatory input current (exc), inhibitory input current (GABA), and an afterhyperpolarization (AHP) current. The voltage Vj of neuron j is defined by the following equation: Here, we use as parameters the average input resistance of CA3 cells (∼Rm = 33 MΩ) (Turner and Schwartzkroin, 1983), the membrane time constant (τm = 30 ms), and the threshold for firing (T = −50 mV). After each spike, voltage is reset instantaneously to the resting potential (Vrest = −65 mV). We use the following parameters: the steady excitatory current Iexc is constant (Aexc = 2 nA); the afterhyperpolarization current (IAHP) has AAHP = −2 nA and τAHP = 17 ms (duration); the inhibitory current IGABA has AGABA= −20 nA and τGABA = 3 ms (duration).
For the simulation, we considered the excitatory input (Iexc) constant over time (see Results for rationale), whereas the other currents are modeled as an instantaneous rise followed by a linear decrease (for consideration of the case in which a component of excitation is rapid, see supplemental material, available at www.jneurosci.org). H(x) is the Heaviside function, where H(x) = 1 if x > 0 and 0 otherwise, and […]+ = xH(x) is the clipped linear function.
In the simulation of orientation selectivity, we consider that the excitatory current to a V1 neuron is given by the following: where Ibasal is an excitatory current strong enough to produce a suprathreshold potential in all neurons; Imax is related to the image contrast, such that the larger the contrast, the larger Imax; G(θ0,θ,σ) is the orientation selectivity function given by the following: where θ0 is the angle with the maximum response and σ is the width of the selectivity function. Finally, ζ is a Gaussian random variable with SD = 0.3 and clipped in the interval −1 and 1. This represents the noise in the system.
For these simulations, the width of the tuning curves is σ ≅ 32°, the values of Imax are 5 and 10 nA (as displayed in Fig. 6A), and Ibasal is 0.5 nA. All simulations and analysis here were made using Matlab (Mathworks).
Results
Our overall goal is to understand how networks with gamma frequency inhibition select which cells fire based on their varying excitatory drive. The simplified circuit that we consider here is shown in Figure 1A. Principal cells receive external input that is purely excitatory. When these cells fire, they excite an interneuron, which inhibits all the principal cells (feedback inhibition). When this inhibition declines sufficiently, firing again occurs. This process repeats indefinitely, thereby generating a gamma frequency oscillation. Experimental results (Miles, 1990) show that feedback inhibition is very rapid, as shown in Figure 1B (we use the value of 3 ms). The use of a single interneuron in our simulations is a reasonable approximation because of several properties of interneuron networks: there is enormous convergence of principal cells onto these interneurons, enormous divergence of the feedback connections from interneurons to principal cells and electrical coupling among the interneurons (Buhl et al., 1994; Cobb et al., 1995; Galarreta and Hestrin, 1999; Tamás et al., 2000; Meyer et al., 2002). Furthermore, interneurons are sensitive enough to fire in response to input from only a single principal cell (Miles, 1990; Gulyás et al., 1993; Marshall et al., 2002; Silberberg and Markram, 2007). The circuit of Figure 1A was simulated as an integrate-and-fire network (see Materials and Methods). The relevant currents are the excitatory input, the feedback inhibition and a brief AHP after each action potential.
A common framework for describing networks with feedback inhibition is as a winner-take-all process. Because it is clear that there is more than one winner in biological networks, the term k-winner-take-all is often used to denote that there are k winners. Under a given set of conditions, this is certainly true, but to be a robust description of the network computation, k should be invariant not only for multiple values of excitation and inhibition but also for different distributions of input excitation (excitation is considered here to be constant over time) (for a similar analysis with time-varying excitation, see supplemental material, available at www.jneurosci.org). To examine whether this is the case, we changed the ratio of excitation and inhibition in our integrate-and-fire model; we also varied the distribution of inputs to different principal cells (Fig. 2A). We found that the number of winners (k) is invariant over a large range of excitation but varies strongly with the distribution of excitation (Fig. 2B). Thus, the concept that a network can robustly select a fixed number of winners is not correct.
To identify a more robust description of the selection process, we considered two cells, N1 and N2, that have only slightly different (10%) excitatory input. The traces in Figure 3B start with the inhibition initiated during the previous gamma cycle. As the IPSP decays with the membrane time constant, N1 reaches threshold first and fires (resulting immediately in an AHP in N1). However, because the IPSP continues to decline, other cells may fire during the brief “vulnerable period” before feedback inhibition arrives. In the example shown in Figure 3, N2, which has only 10% less excitation than N1, continues to depolarize because of decay of the IPSP and almost reaches threshold. However, before it does so, feedback inhibition arrives and prevents N2 from reaching threshold. If feedback inhibition had not arrived, N2 would have fired after a short additional delay (Fig. 3C). However, if excitation of N2 was only 5% less than N1, the depolarization during the vulnerable period reaches threshold, and thus both cells fire (Fig. 3D). This simple example shows that the network can select which cells fire based on small (10%) differences in excitation and that understanding the events during the vulnerable period is crucial.
To quantify the processes during the vulnerable period, we define the effective excitation (E) of a given cell as the excess of voltage above threshold (E = VE − T), where VE is the sum of the excitatory input and intrinsic afterpotentials that result from previous firing. If E < 0, a cell will never fire; if E > 0, cells may fire if the inhibition allows. The cell that fires first during a gamma cycles has excitation Emax; as inhibition declines during the gamma cycle, the last neuron to fire during the vulnerable period has lower excitation, Emin. E%-max is the percentage difference between this lower excitation and that of the maximal excitation. To examine the robustness of E%-max in defining which cells fire, we determined E%-max under various conditions in our integrate-and-fire network. Figure 2C shows that neither scaling the excitation (>10-fold) nor changing the distribution of excitation strongly affected E%-max. Thus, the E%-max description robustly captures a fundamental aspect of the computation.
Analytical estimation of E%-max and its determinants
We next sought to determine what properties of the network determine E%-max. As shown in Figure 1B, firing creates a feedback IPSP in all principal cells of the network. The fall of the IPSP occurs with the membrane time constant, creating a “ramp” in the membrane potential of the principal cell that interacts with synaptic excitation. As the ramp declines, the cell with maximal excitation fires and triggers feedback inhibition. At this moment (t*; defined relative to the onset of inhibition), the following condition is met relating the voltage threshold (T; defined relative to resting potential), the EPSP (VEmax) of the cell, and the IPSP (VGABA): Therefore, the condition to fire can be written as follows: where we define suprathreshold excitation “E” as the difference between the EPSP and threshold as follows: Our goal is to determine the minimal excitation (Emin) necessary for a second cell to fire in the same gamma cycle. Consider that the difference between excitations is Since the feedback inhibition takes d seconds to occur, the second cell will fire if at most Considering that the firing period t* is much larger than the delay (d), we can make a linear approximation as follows: The inhibitory component of the potential is a consequence of the integration of the fast IGABA current across the membrane. We consider that IGABA(t) = 0 by the time the neurons are approaching to their thresholds; therefore, VGABA is decaying exponentially with the membrane time constant τm as follows: Combining Equation 6 with Equations 7, 9, and 10 results in the following: According to Equation 11, E%-max increases with d and decreases with the membrane time constant. Figure 2C, dotted line, shows that Equation 11 correctly predicts the magnitude of E%-max, as determined in our integrate-and-fire network. In Figure 4, the same network is used to verify that E% depends linearly on the delay of feedback inhibition and inversely on the membrane time constant, in accord with Equation 11.
E%-max rule: application to excitation and firing tuning in V1
The process by which gamma oscillations perform an E%-max computation means that the selection of which cells fire is inherently a network process and implies that there is not a direct relationship between the excitatory input and cell firing. Rather, whether a cell fires will depend on the excitation to other cells in the network. In most brain regions, input excitation has not been measured and so the above ideas cannot be related to experimental data. However, in the case of orientation cells of V1, both the orientation tuning of excitation (measured intracellularly) and the orientation tuning of spiking have been measured (Anderson et al., 2000; Carandini and Ferster, 2000; Monier et al., 2003). The results show that the tuning of firing is considerably narrower than the tuning of excitation and that this difference is contrast invariant (unaffected by the increased excitation produced by enhancing the contrast of the stimulus). There has been considerable interest in understanding the mechanism of this invariance, and many models have been proposed (for review, see Ferster and Miller, 2000; Teich and Qian, 2006). However, although both intracellular and field recordings indicate the presence of gamma oscillations (Gray and Singer, 1989; König et al., 1996; Singer and Gray, 1995; Volgushev et al., 2003; Fries et al., 2007) in V1, the specific role of the dynamic inhibition provided by gamma has not previously been considered. It was thus of interest to ask whether an E%-max computation can account for the observed differences in the tuning of excitation and firing.
The tuning of excitation in V1 cells was studied by Carandini and Ferster (2000) and is illustrated in Figures 5 and 6A. Each cell responds maximally to some degree of orientation (around 135° for the graphs shown in Figs. 5, 6A), but the same cell also shows some level of excitation for a range of other orientations (between 45 and 225° for the examples here). Anderson et al. (2000) showed that the tuning of spiking is sharper than the tuning of excitation; specifically, the half-width at half-height of the tuning of spiking was around 23° compared with 38° for the EPSP. Importantly, this narrow tuning of spiking was not changed when the contrast of the visual stimuli was increased. As discussed by Carandini and Ferster (2000), feedforward models with fixed threshold are unable to reproduce this independence of contrast; in such models (Fig. 5), tuning can be sharpened because of a threshold for firing, a phenomenon termed the “iceberg” effect. However, an important property of this iceberg effect is that the sharpening is reduced by increasing the overall level of excitation (by increasing contrast).
To examine how gamma frequency inhibition affects orientation selectivity, we modified our integrate-and-fire network to have orientation-selective input to each principal cell. In these simulations, the network was composed of 100 neurons, each with slightly different optimal orientation (evenly spaced between 0 and 270°). E%-max was set at 10%. We ran the network for many gamma cycles using two levels of contrast (Fig. 6A). Figure 6B shows that the probability of spiking per gamma depended on stimulus orientation (300 trials). Similar to the experimental results (Anderson et al., 2000), the neurons in the simulated network displayed a sharper orientation tuning for spikes than for input excitation: the half-width at half-height of the excitation tuning is 37° (Fig. 6A), whereas the same measure for spike tuning is 16.5° (this value would be slightly higher if more noise was assumed). Importantly, the tuning of spikes was practically unchanged when the excitatory input was doubled (Fig. 6B). A selection process based on gamma frequency inhibition can thus account for the contrast invariance of orientation tuning.
We emphasize that we have kept this model as simple as possible to isolate the computational capabilities of feedback inhibition. Other forms of synaptic input (feedforward inhibition from both “on” and “off” cells; recurrent excitation) are necessary to account for the full complexities of the response of V1 cells, including the response to moving stimuli (Ferster and Miller, 2000).
Discussion
Almost all work to date on the functional role of gamma oscillations has focused on the production of synchronized firing (Bragin et al., 1995; Singer and Gray, 1995; Penttonen et al., 1998; Csicsvari et al., 2003). We argue that a second function of gamma, the selection of which cells fire, is equally important. It has been experimentally shown that only a fraction of cells fire on each gamma cycle (Senior et al., 2008), but the mechanism that determines which cells fire has been unclear. Our work indicates that this selection is a type of winner-take-all process that follows directly from the properties of the feedback inhibition that underlies gamma frequency oscillations.
We have sought to find a simple quantitative description of this winner-take-all process and have found that several descriptions are not correct. There is no single winner, and so the winner-take-all concept cannot be taken literally. Nor will a network determine a fixed number of winners, independent of the input distribution. We find, however, that a simple rule approximates the selection process: cells will fire if their suprathreshold excitation (E) is within E% of the cell that receives maximal excitation. We term this an E%-max winner-take-all-process. As shown in Figure 2C, E%-max holds over a considerable range as the excitatory inputs to the network are scaled relative to inhibition. Furthermore, E%-max is not altered by changing the distribution of excitation in the different cells (relative to the cell with maximal excitation). Thus, the E%-max computation is robust. Because E%-max rule does not depend on the exact ratio of excitation to inhibition, it can be applied to cases in which this ratio is not known. The companion study (de Almeida et al., 2009) applies the rule to calculate properties of hippocampal place fields. In contrast to previous work (Rolls et al., 2006), in which the percentage of cells with place fields was used as a way to arbitrarily set inhibition, the E%-max rule allows the calculation of this percentage from theoretical considerations (without knowing the exact value of inhibition), which can then be compared with the observed value.
Determinants of E%-max
We have shown by simulation and theory that E%-max is determined by the ratio of the delay of feedback inhibition (d) to the membrane time constant (τm). This functional dependence can be understood intuitively as follows (see also Fig. 3). When gamma-mediated inhibition is maximal, cells will be below threshold. The gradual decay of inhibition creates a ramp, which can be view as “searching” for the neuron with maximal excitation; this will be the first cell to fire and trigger feedback inhibition (Miles, 1990; Gulyás et al., 1993; Marshall et al., 2002). This inhibition occurs within 2–3 ms, and it is this delay that creates a vulnerable period during which cells with less than maximal excitation can fire. The more inhibition declines during the vulnerable period, the more likely it is that cells with less inhibition will fire: thus selectivity decreases as the delay increases. Selectivity is also decreased if the decay of inhibition (membrane time constant) becomes faster. Based on experimental values for d and τm in the hippocampus, we estimate that E%-max is in the range of 5–15%. This is a small fraction of excitation and indicates that the selection process can make fine discriminations.
We emphasize that the rules we have developed are meant only as a first-order approximation and that the operation of feedback networks will depend on additional factors that we have not taken into consideration. These include the variability of delays in feedback inhibition, the opening kinetics of inhibitory and excitatory channels, and the limited spatial spread of feedback inhibition in the network. Furthermore, the excitatory input to inhibitory cells may often be enhanced by convergent inputs from multiple principal cells, a summation process that we have not modeled. In most of our calculations, we have assumed that excitation varies slowly with respect to gamma. This assumption may be valid when the stimulus is slowly changing, but may be invalid when a network receives a brief pulse of synchronized input. In the supplemental material (available at www.jneurosci.org), we examine the case in which excitation has both steady and fast components and show that the E%-max rule and Equation 11 still apply. Another assumption in our calculations is the choice of a fast AHP. Different cell types have different duration afterpotentials, often depending on neuromodulatory state (Storm, 1987, 1989). Moreover, in some cells, the afterpotential can be depolarizaing rather than hyperpolarizing (Storm, 1989; Andrade, 1991; Araneda and Andrade, 1991; Caeser et al., 1993). These afterpotentials will contribute to the suprathreshold excitation of the cell. Under these conditions, E%-max can still be usefully applied to determine which cells fire, so long as it is understood that both internal and external processes contribute to the effective excitation. Indeed, afterpotentials may account for important properties of firing. For instance, a long AHP would prevent a cell from firing on sequential gamma cycles, even if the external excitatory drive stays constant. Alternatively, if there is an afterdepolarization, a cell that fired once would be particularly likely to fire again, a process that may underlie working memory (Lisman and Idiart, 1995; Klink and Alonso, 1997).
Implications for neural computation
Because analysis of spiking in functional circuits is generally done with extracellular recording, the tuning of the EPSP is usually not known. However, in the case of orientation-selective cells of V1, intracellular recordings have been achieved. Orientation selectivity appears to depend on two mechanisms: a process of connectivity, which makes the input EPSP somewhat orientation selective (Reid and Alonso, 1995), and a second process dependent on inhibition (Sillito, 1975; Troyer et al., 1998; Carandini and Ferster, 2000). This second mechanism makes the orientation tuning of spiking narrower than that of the EPSP. Moreover, this narrowing is not affected by scaling up the excitation, a finding inconsistent with models based on fixed inhibition. Consequently, the narrowing of tuning cannot be explained by the iceberg effect (Fig. 5). Intracellular recordings provide direct evidence for gamma frequency inhibition in orientation-sensitive V1 cells (Volgushev et al., 2003). We show here (Fig. 6) that an E%-max computation produced by such oscillations can explain why the orientation tuning of spiking is narrower than that of the EPSP and why this difference is contrast invariant. Thus, there will be orientations in which a cell receives substantial excitation (sufficient to make the cell fire in the absence of inhibition) but in which firing is suppressed by feedback inhibition triggered by cells that that are slightly more excited by the stimulus.
A second system in which the E%-max winner-take-all computation is likely to be important is the formation of place cells in the hippocampus (de Almeida et al., 2009). The input to place cells is from grid cells of the entorhinal cortex, which are active (with spatial periodicity), over broad regions of the environment. Nevertheless, hippocampal cells are active only in very restricted regions of the environment. We show in a companion study (de Almeida et al., 2009) that, despite the broad excitation, the E%-max mechanism can select winners that are only slightly more excited than other cells in the network and that cells are winners in a relatively small region of the environment, thereby accounting for their place cell properties.
More generally, the winner-take-all function (and the specific E%-max form considered here) requires a change in the conceptual understanding of how firing is controlled. According to textbook accounts, firing can be understood as a single-cell property; firing rate is determined by how far the net excitation is above threshold. Based on this, if excitation x causes firing, excitation 2x in another context should also cause firing. However, this is not necessarily correct in networks with feedback inhibition. If, for example there are other cells in the second context that have 3x excitation, the cell with 2x excitation may not be among the winners. This simple example demonstrates that firing in networks with winner-take-all gamma-frequency inhibition cannot be derived from the excitation of a given cell, but is rather a result of a competitive network computation in which all cells must be considered.
Footnotes
-
This work was supported by National Institute of Mental Health Grant MH060450, National Institute of Neurological Disorders and Stroke Grant NS27337, and European Commission Project 217148. M.I. and L.d.A. acknowledge partial financial support from Brazilian agencies Conselho Nacional de Desenvolvimento Científico e Tecnológico and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior. We thank Ole Jensen and Sridhar Raghavachari for comments on this manuscript.
- Correspondence should be addressed to John E. Lisman, Department of Biology and Volen Center for Complex Systems, Brandeis University, 145 South Street, Waltham, MA 02454. lisman{at}brandeis.edu