Abstract
The primate retina communicates visual information to the brain via a set of parallel pathways that originate from at least 22 anatomically distinct types of retinal ganglion cells. Knowledge of the physiological properties of these ganglion cell types is of critical importance for understanding the functioning of the primate visual system. Nonetheless, the physiological properties of only a handful of retinal ganglion cell types have been studied in detail. Here we show, using a newly developed multielectrode array system for the large-scale recording of neural activity, the existence of a physiologically distinct population of ganglion cells in the primate retina with distinctive visual response properties. These cells, which we will refer to as upsilon cells, are characterized by large receptive fields, rapid and transient responses to light, and significant nonlinearities in their spatial summation. Based on the measured properties of these cells, we speculate that they correspond to the smooth/large radiate cells recently identified morphologically in the primate retina and may therefore provide visual input to both the lateral geniculate nucleus and the superior colliculus. We further speculate that the upsilon cells may be the primate retina's counterparts of the Y-cells observed in the cat and other mammalian species.
Introduction
Although decades of work have been devoted to the study of primate retinal architecture, fewer than one-half of the 22 or more anatomically identified types of retinal ganglion cells (RGCs) (Rodieck and Watanabe, 1993; Dacey et al., 2003, 2005; Dacey, 2004; Yamada et al., 2005) have been characterized physiologically, namely the ON and OFF midget and parasol cells (Dacey, 1999; Chichilnisky and Kalmar, 2002), the small bistratified cells (Dacey and Lee, 1994; Chichilnisky and Baylor, 1999), the recently discovered giant sparse (melanopsin-expressing) cells (Dacey et al., 2003, 2005; Dacey, 2004), and, to some extent, the large bistratified cells and the cells of type “sparse” (Dacey et al., 2003; Dacey, 2004). One of the possible reasons for this discrepancy is that the morphological RGC types awaiting physiological characterization constitute only a small fraction of all the primate ganglion cells (1–4%) [see Dacey (2004), their table 20.1]. This makes them difficult to encounter and identify with standard single-electrode recording or small-scale multielectrode array methods.
To search for new physiological cell types in the primate retina, we used a newly developed large-area, high-density multielectrode array system with 512 electrodes, capable of recording the extracellular spiking activity of several hundred RGCs in a single in vitro preparation (Litke et al., 2004). We recorded the RGC activity in isolated pieces of macaque monkey retina, as it responded to a dynamic visual stimulus focused on the photoreceptors.
The main result of these studies is the identification and characterization of a new functional type of primate RGC. These cells have large receptive fields, rapid and transient light responses, and highly nonlinear spatial summation. Cells with similar visual response properties, the Y-cells, were first observed more than forty years ago in the cat retina (Enroth-Cugell and Robson, 1966). The original observation of the cat Y-cells was followed by a decades-long search for the counterpart of these cells both in the primate retina and in the primate lateral geniculate nucleus (LGN) (de Monasterio, 1978; Kaplan and Shapley, 1982; Derrington and Lennie, 1984; Blakemore and Vital-Durand, 1986; Benardete et al., 1992; Levitt et al., 2001; White et al., 2002). Until this report, no clear evidence had been found for a distinct type of Y-like retinal ganglion cell in the primate.
Materials and Methods
Preparations.
Eyes were obtained from three deeply and terminally anesthetized macaque monkeys (Macaca mulatta) used by other experimenters, in accordance with institutional guidelines for the care and use of animals. The data reported here came from three retinal preparations (1–3), each from a different macaque monkey. The macaques for preparations 1 and 3 were experimentally infected with the simian immunodeficiency virus (Roberts et al., 2006); the macaque for preparation 2 was uninfected. No differences were observed in the retinal response properties for the preparations from the infected macaques compared with the uninfected macaque.
Immediately after enucleation, the anterior portion of the eye and vitreous were removed in room light, and the eye cup was placed in bicarbonate buffered Ames' solution (Sigma-Aldrich, St. Louis, MO) and stored in darkness at 32–33°C, pH 7.4, for >20 min before dissection. Under infrared illumination, pieces of peripheral retina at eccentricities of ∼12 mm, 3–5 mm in diameter, isolated from the retinal pigment epithelium, were placed flat, ganglion cell layer down, against a planar array of extracellular microelectrodes. These electrodes were used to record action potentials from spiking retinal cells. Each preparation was perfused with Ames' solution bubbled with 95% O2–5% CO2, with pH 7.4. The three preparations (1–3) were maintained, respectively, at 34–35°C, 32–33°C, and 36–37°C.
Multielectrode array system.
The multielectrode array system has been described by Litke et al. (2004). The electrode array had 512 extracellular microelectrodes arranged in a 32 × 16 rectangular pattern, with overall dimensions of 1890 × 900 μm2, covering an active area of 1.7 mm2. The electrode-to-electrode spacing was 60 μm, and each electrode was 5 μm in diameter. The analog signal from each electrode was sampled at 20 kHz. This system has been used previously to study the encoding of visual motion and multineuron firing patterns in mosaics of parasol cells in the primate retina (Frechette et al., 2005; Shlens et al., 2006).
Neuron identification.
Individual neurons were identified as has been described previously (Litke et al., 2004). Briefly, a spike was identified on an electrode (the “seed” electrode) as a signal amplitude that exceeded a threshold of three times the rms noise. For each spike, a 182-dimensional vector was constructed from the analog waveform measurements on the seed electrode and its six nearest neighbors. Each waveform consisted of the 26 analog samples for the period 0.5 ms before the spike to 0.8 ms after the spike. Principal components analysis (PCA) was used to extract the five most significant variables for spike sorting. Multidimensional clustering was performed on these variables, using the expectation maximization algorithm, to find neuron candidates. Duplicate neuron candidates were eliminated by spike train cross-correlation. The absence of a refractory period in the autocorrelation function was used to exclude neuron candidates contaminated by other neurons.
Stimuli.
The retina was stimulated with the optically reduced (2.9 mm diameter) image of a cathode ray tube display refreshed at 120 Hz, focused on the photoreceptor layer by a microscope objective, and centered on the electrode array. Stimuli were attenuated to low photopic light levels using neutral density filters and presented as modulations around a mean gray background level. The light flux at the gray background level was equivalent to 4400, 4300, and 2200 quanta · μm−2 · s−1 produced by monochromatic sources of wavelengths 561, 530, and 430 nm, respectively.
Several visual stimuli were used to classify and characterize each identified neuron. These stimuli included the following: (1) spatiotemporal white noise to measure spatial receptive fields, temporal filtering, and receptive field mosaics; (2) diffuse light steps to measure response kinetics; (3) S-cone-isolating spatiotemporal white noise to evaluate S-cone input; and (4) contrast-reversing gratings (CRGs) to measure the nonlinearity of spatial summation.
The spatiotemporal red, green, and blue (RGB) white noise stimulus consisted of 32 × 16 square pixels with each pixel 116 μm on a side, refreshed every 8.33 ms. The relative intensity level for each pixel, in each frame, for each of the three color guns (RGB) of the display monitor, was set randomly above or below the 0.5 mean background level at 0.5 ± 0.48. The corresponding contrast, (Imax − Imin)/(Imax + Imin), was therefore 96%, where Imax and Imin are the maximum and minimum intensities, respectively.
The diffuse light stimulus was a repeating pattern of four full-field frames of white, gray, black, and gray, with relative intensities, respectively, of 0.98, 0.5, 0.02, and 0.5. Each frame had a duration of 2 s, and the contrast was 96%.
The S-cone-isolating S/(L + M) white noise stimulus independently modulated the photon absorption in the S-cones and the summed photon absorptions in the L and M cones. This stimulus was obtained by the method of silent substitution (for review, see Estevez and Spekreijse, 1982) and was based on the cone absorption spectra for the macaque monkey (Baylor et al., 1987) and the measured intensity spectrum generated by each of the three color guns. The stimulus contrast was matched at ∼40% for the S-cone-isolating and (L + M)-cone-isolating stimuli. The cross talk between these two stimuli was estimated to be 0 ± 4%, based on measurements of the light intensity transmitted through filters with filtering characteristics similar to the S, M, and L cone absorption spectra. The S/(L + M) white noise stimulus was applied only to the second preparation.
The CRG stimuli were spatial sinusoidal gratings with amplitudes that were sinusoidally modulated in time. The CRG stimuli had a temporal frequency of 4 Hz, 10 spatial frequencies ranging from 0.2 to 28.7 cycles/mm, and, for each spatial frequency, eight spatial phases ranging from 0 to 315° in 45° steps. The contrast for all gratings was 96%.
Spatial and temporal filtering characterization of neurons.
The spatial and temporal visual image filtering characteristics of each identified neuron were measured from its spike-triggered average (STA), when the retina was exposed to the RGB white noise stimulus. The STA for each neuron was calculated by averaging, over all spikes generated by the neuron, the sequence of stimulus frames preceding each spike (Sakai et al., 1988; Chichilnisky, 2001).
The receptive field (RF) location, size, and orientation (e.g., see Fig. 3a) were based on the STA frame most significantly different from the mean (gray) background level. This frame was fit with a two-dimensional generalized Gaussian: where A is the amplitude, x0 and y0 are the locations of the Gaussian center, σx and σy are the semi-axes, and φ is the orientation angle of the one σ contour ellipse. The RF diameter D is defined as the diameter of the circle with the same area as the one σ contour ellipse of the Gaussian: D = 2(σxσy)1/2. The surround was not included in the RF fit because, in most cases, the stimulus used did not significantly activate the surround mechanism in the recorded cells. Furthermore, this representation assumes that the STA is separable in space and time.
The red, green, and blue time filters of each neuron were estimated by the STA time courses corresponding to the three color guns of the stimulus display monitor (e.g., see Fig. 3f). They were calculated by averaging together the time courses of all STA pixels significantly different from the mean intensity level (a significant pixel was defined as one that differs in absolute value from the mean level by more than three times its rms noise).
Time filter classification parameters.
The two time filter classification parameters of each neuron in a collection of neurons (referred to as PC1 and PC2 in Fig. 1b) were defined as the two most significant principal components obtained by performing PCA on the time filter vectors of all neurons in the collection. The time filter vector for a given neuron was obtained by concatenating its red, green, and blue time filters.
Electrophysiological imaging.
The electrophysiological image (EI) is a dynamic image of the average electrical activity of a neuron at the time it generates a spike, projected onto the two-dimensional plane of the electrode array (Litke et al., 2004). The EI reveals the signals generated by the cell body and the dendritic arbor and shows the signal propagation down the axon(s). It is calculated by averaging the analog waveforms recorded on each of the 512 electrodes every time the neuron generates a spike. The EI was used to categorize the identified neurons as either ganglion cells, if they possessed a single axon, or polyaxonal amacrine cells, if they possessed an axonal arbor.
For some cells, no axon or axonal arbor was detected. In all cases, these cells were located close to the border of the electrode array, in a position such that a ganglion cell's axon would be outside of the array's active area. Moreover, these cells had the same physiological properties as the cells in the same classification cluster that possessed clearly visible single axons (see the functional classification of neurons in Results). Therefore, we concluded that these cells were ganglion cells.
Response latency.
As a measure of the RF center response latency for an RGC, we used the zero crossing time (tzero) of the STA time filters (Chichilnisky and Kalmar, 2002). In a linear–nonlinear response model (Chichilnisky, 2001), when the cell is stimulated by a small-amplitude light step presented over the RF center, tzero corresponds to the time from the onset of the light step to the peak of the response.
The calculation of tzero was based on a fit of the time filter T of each neuron as a sum of two filters: where t is the time before a spike, τ1 and τ2 are the time constants of the two filters, a1 and a2 are their amplitudes, and n1 and n2 are the number of stages of the filters. This function fits all time filters well.
The green time filter was used for the tzero calculation, because it provides the best measurement as a result of the high signal amplitude. The red or blue time filters can be used instead without any significant difference in the results.
Results
Functional classification of neurons
The data shown in this study were recorded from three retinal preparations (1–3) obtained from three different macaque monkeys (see Materials and Methods). Individual neurons were identified in each preparation and were classified into separate functional cell types, based on their responses to a randomly flickering white noise checkerboard stimulus.
Figure 1a–c illustrates the functional classification technique applied to the 257 neurons identified in preparation 1. (Neurons in the other two preparations were classified using similar techniques.) The neurons were first classified as ON-center or OFF-center, based on the sign of the amplitude of the primary peak of the sum of the red, green, and blue STA time filters (the peak closest to zero on the time axis); the amplitude histogram is shown in Figure 1a. The cells with positive amplitudes are classified as ON cells, whereas the cells with negative amplitudes are classified as OFF cells. Only the OFF cells will be subject to additional analysis in the remainder of this paper.
The OFF cells, in turn, were classified into four distinct clusters, shown in Figure 1b, according to the detailed shape of their time filters. The shape of each time filter was parameterized by the two most significant principal components, obtained by performing principal components analysis on the combined time filters of all OFF neurons (see Materials and Methods). Two of the clusters were identified as parasol and midget cells by their RF diameters and their response kinetics (Chichilnisky and Kalmar, 2002). The two remaining clusters, C1 and C2, were of unknown cell type.
The cluster C2 contains a small number of cells. Moreover, this cluster does not represent a unique cell type; the cells it contains differ from each other if additional classification criteria (autocorrelation function, response to contrast-reversing gratings, and response to diffuse light steps) are used. Therefore, we will not discuss these cells further.
The neurons in cluster C1 can be divided into two separate clusters, C1A and C1B, by their RF diameters (Fig. 1c). The cells in these clusters could be ganglion cells or, possibly, spiking amacrine cells (Stafford and Dacey, 1997; Volgyi et al., 2001). A ganglion cell can be identified by the presence of a single axon. A polyaxonal spiking amacrine cell, on the other hand, can be identified by the presence of an extensive axonal arbor. To image the axonal structure of each neuron, and thereby make possible the separation of ganglion cells from polyaxonal spiking amacrine cells, we used EI analysis (Litke et al., 2004) that provides a detailed spatiotemporal image of the electrical activity of the neuron (see Materials and Methods).
The EIs show that the neurons in the cluster C1A possess an axonal structure consistent with that expected for ganglion cells. In contrast, the EIs reveal that the neurons associated with cluster C1B do not have single axons but instead have an axonal arbor structure compatible with that expected for polyaxonal spiking amacrine cells. Example EIs of a parasol cell, a member of the C1A cluster (later termed an upsilon cell), and a member of the C1B cluster, are shown in Figure 2. The dynamic electrical activity of these cells can also be viewed with animated versions of their EIs (supplemental movies, available at www.jneurosci.org as supplemental material).
Visual response properties of the upsilon cells
In what follows, only the members of the C1A ganglion cell cluster will be considered. We examine the characteristics of their spatial RFs, response kinetics (including response to diffuse light steps and temporal filtering properties), and the linearity of spatial summation. Corresponding analyses are performed on the other two preparations. (An additional study of the strength of the S-cone inputs to the upsilon cells was performed only with the second preparation.)
Based on the uniformity of their properties, as indicated below, both within each preparation as well as from preparation to preparation, we infer that the cells in the C1A cluster (and the corresponding cells in preparations 2 and 3) constitute a single physiological type. Additional evidence for a single cell type, based on the RF mosaics, will also be provided. We will refer to this RGC type as “upsilon.” (This name choice will be explained in the discussion section.)
Receptive field diameters and mosaics
Figure 3b–d shows, for each preparation, the RF mosaics and average RF diameters of the RGCs identified as OFF parasol cells or OFF upsilon cells. The average ratio between the RF diameters of the OFF upsilon and OFF parasol cells is 2.9 ± 0.4 (mean ± SD). The RFs of the parasol cells as well as the upsilon cells form almost complete mosaics; the few holes are possibly caused by small inefficiencies in our recording methods. The presence of these upsilon mosaics with nonoverlapping RFs lends additional support to the idea that the upsilon cells constitute a single physiological (and presumably morphological) ganglion cell type (Peichl, 1991).
Response kinetics
As a first step in characterizing the response kinetics, we recorded the response of each cell to a repeating pattern of diffuse light steps, with intensity increments or decrements in 2 s intervals (Fig. 3e) (see Materials and Methods). The parasol cells responded with a strong, transient increase in spike rate to light intensity decrements (as expected for an OFF-type cell) and had a weak response to light intensity increments. In addition, the spike rate did not return to its previous value after the transient responses, indicating that the parasol cells possess both transient and sustained responses. (A sustained cell responds to a light step with a change in spike rate for the 2 s duration of the light step.)
The upsilon cells responded strongly to decrements in light intensity with a transient increase in spike rate but did not respond at all to light intensity increments. In contrast to parasol cells, the spike rate returned to its previous resting discharge rate (mean ± SD = 0.1 ± 0.2 spikes/s for the data shown in Fig. 3e, right) after the transient responses, indicating that the upsilon cells are highly transient.
The temporal filtering properties of the parasol and upsilon cells (in preparation 1) are indicated by the STA time filters shown in Figure 3f. As a measure of each neuron's response latency, we have used tzero of the STA time filter (Chichilnisky and Kalmar, 2002) (see Materials and Methods). The average tzero values (mean ± SD) for the parasol cells in preparations 1–3 are, respectively, 53.8 ± 2.0, 75.4 ± 3.8, and 50.2 ± 3.8 ms, with n = 52, 86, and 114. The corresponding values for the upsilon cells are 67.3 ± 3.6, 70.1 ± 4.9, and 56.1 ± 2.6 ms, with n = 5, 10, and 8. For completeness, we include also the tzero values for the OFF-midget cells in the three preparations; these are 80.5 ± 7.5, 106.9 ± 6.7, and 76.2 ± 8.8 ms, with n = 67, 15, and 28. The ratio of the average tzero for the upsilon cells to the average tzero for the parasol cells, averaged over the three preparations, is 1.10 ± 0.16 (mean ± SD). The corresponding upsilon-to-midget ratio is 0.74 ± 0.09. Therefore, the upsilon cells have response latencies similar to those of the simultaneously recorded parasol cells but shorter than those of the simultaneously recorded midget cells.
Color response properties: S-cone inputs
Because of recent observations indicating that some large-field ganglion cells possess selective connectivity to S-cones (Dacey and Packer, 2003; Dacey et al., 2003, 2005; Dacey, 2004), we investigated the S-cone input to the upsilon cells by using the cone-isolating S/(L + M) stimulus (see Materials and Methods). The response of the upsilon cells to this stimulus was dominated by the (L + M)-cone OFF component, as shown in Figure 4. These data indicate that the S-cone input to the upsilon cells is a very small fraction of their total input. This fraction is measured to be 4.4 ± 4.1%, based on the peak of the S-cone time filter response relative to the peak of the S-cone plus (L + M)-cone response, and including an estimated cross talk of 0 ± 4% between the S and (L + M) cone-isolating stimuli. For comparison, the corresponding S-cone fraction measured for S-ON/(L + M)-OFF cells [most likely small bistratified blue/yellow color opponent cells (Dacey and Lee, 1994)] in the same preparation is 48.4 ± 16.6%, indicating that the S-cones were functioning normally (data not shown). (This insignificant S-cone input to upsilon cells, measured in the presence of simultaneously recorded S-ON/(L + M)-OFF cells, was also observed in several preparations that contained only a few upsilon cells and therefore were not included in this paper.) The combination of a strong (L + M)-cone OFF response and weak-to-negligible S-cone OFF response differentiates the upsilon cells from the large bistratified, sparse, and giant sparse (melanopsin-expressing) large-field cells, all with significant S-cone/(L + M)-cone opponent responses (Dacey and Packer, 2003; Dacey et al., 2003, 2005; Dacey, 2004).
Linearity of spatial summation
To gain insight into how the visual input signals get summed spatially by the upsilon cells, we have investigated their spatial summation properties and compared these properties with those of the midget and parasol cells. An RGC that sums the visual input linearly over its RF is known as an X-like cell, whereas an RGC that exhibits nonlinear spatial summation is known as a Y-like cell (Enroth-Cugell and Robson, 1966). It is thought that the receptive fields of Y-like cells are constructed by pooling excitation from an array of small nonlinear spatial subunits (Hochstein and Shapley, 1976b; Shapley and Victor, 1979; Victor and Shapley, 1979; Demb et al., 1999, 2001a), and it has been proposed that these subunits are bipolar cells (Victor and Shapley, 1979; Demb et al., 1999, 2001a).
We have studied this nonlinearity in the classical way by recording the responses of neurons to CRGs (Hochstein and Shapley, 1976a,b; Derrington and Lennie, 1984). The CRG is a spatial sinusoidal grating with a contrast that is modulated sinusoidally in time (see Materials and Methods). Midget, parasol, and upsilon cell average responses to one period of low and high spatial frequency CRGs at four spatial phases, in preparation 1, are shown in Figure 5a–c, respectively.
The responses to the CRGs of low spatial frequency (0.2 cycles/mm) are modulated in time mainly at the temporal frequency of the stimulus (implying a significant first Fourier harmonic, F1), because at low spatial frequencies, all the subunits are stimulated in phase. Additionally, the F1 response amplitude varies sinusoidally as a function of the spatial phase of the CRG, with null responses at two spatial phases separated by 180° (shown as 90 and 270° in Fig. 5a–c).
At high spatial frequency (4.8 cycles/mm), the midget cells still respond mainly at the temporal frequency of the stimulus, indicating that spatial summation in their RFs remains dominantly linear (that is, they exhibit X-like behavior). The upsilon cells, however, display a response modulated mainly at twice the temporal frequency of the stimulus (two peaks per stimulus period, corresponding to a significant second Fourier harmonic, F2; this is the classical signature for Y-like cells). This frequency doubling response may arise because approximately one-half of the nonlinear subunits are stimulated in counterphase relative to the other half of the subunits, assuming the nonlinearity is half-wave rectification [as supported for Y-cells by experimental data (Demb et al., 2001a)]. This nonlinear spatial summation leads to an F2 response only weakly dependent on spatial phase, as can be seen in Figure 5c (right column).
The parasol cells also show an F2 response at the high spatial frequency (4.8 cycles/mm), as is apparent in Figure 5b (right column). However, the F2 response coexists with an F1 response of similar amplitude, which explains the stronger dependence on spatial phase.
Figures 5a–c illustrate qualitatively the different nonlinear response properties of single midget, parasol, and upsilon cells at two spatial frequencies. To more completely characterize and quantify the nonlinear spatial summation response properties of these cell types, we measured the F1 and F2 Fourier harmonics, as a function of spatial frequency, for all the midget, parasol and upsilon cells in the three preparations. The harmonics of the response of each cell, under the different CRG stimuli (see Materials and Methods), were calculated by temporal Fourier transforms (Hochstein and Shapley, 1976a). The spatial frequency dependence of the F1 and F2 responses were obtained after taking account of the phase dependence. This was done by taking the maximum value of F1 and the mean value of F2 over all spatial phases (Hochstein and Shapley, 1976a). Figure 5d–f shows the spatial frequency tuning curves of the average F1 and F2 harmonics for the three cell types in each of the three preparations.
The midget cell responses are dominated by the F1 harmonic at all measured spatial frequencies (up to 28.7 cycles/mm) (Fig. 5d), indicating that they are X-like cells in terms of spatial summation. In contrast, both parasol and upsilon cells have large F2 harmonics at high spatial frequencies (i.e., they show frequency doubling responses) (Fig. 5e,f) and therefore possess significant nonlinearities in their spatial summation.
We have quantified the level of nonlinearity of spatial summation with the maximum value of the ratio between the second and the first Fourier harmonic (F2/F1) over all spatial frequencies recorded. This value is called the nonlinearity index (Hochstein and Shapley, 1976a).
Histograms of the nonlinearity indices for midget, parasol, and upsilon cells are shown in Figure 6. These histograms show significant overlap between the nonlinearity indices of midget and parasol cells and also between those of parasol and upsilon cells. However, on average, the parasol cells have ∼3.8 times larger nonlinearity indices than the midget cells, and the upsilon cells have ∼2.6 times larger nonlinearity indices than the parasol cells. In summary, we find that the upsilon cells have Y-like properties, with the nonlinear spatial summation response (F2) dominating the linear response (F1) over a broad range of spatial frequencies.
Comparison of the nonlinearity index results with previous studies
In this subsection, we will compare the nonlinearity index distributions, as shown in Figure 6, with the previously measured results in macaque retina (White et al., 2002) and in macaque LGN (Derrington and Lennie, 1984; Levitt et al., 2001). The parasol and upsilon nonlinearity distributions shown in Figure 6 have long tails that extend to 12–13. These are different from the distributions shown by Derrington and Lennie (1984), Levitt et al. (2001), and White et al. (2002), which extend only up to values of 2–4. We think this discrepancy is attributable to differences in the methods used to calculate the nonlinearity index.
We followed the method suggested by Hochstein and Shapley (1976a) and defined the nonlinearity index as the maximum value of the F2/F1 ratio over a wide range of spatial frequencies. In the experiments of Derrington and Lennie (1984) and White et al. (2002), the nonlinearity index was calculated at 1.5–2 times the spatial frequency of maximum response. However, the experimental data for parasols shown in Figure 5e indicate that the peak response is ∼2 cycles/mm and that measuring the nonlinearity index at twice this frequency (4 cycles/mm) falls short of the most nonlinear frequency domain (∼7 cycles/mm). Likewise, for the upsilon data shown in Figure 5f, the peak response is in the range 0.3–1 cycles/mm, and measuring the nonlinearity index at twice these values (0.6–2 cycles/mm) again falls short of the most nonlinear region (3–7 cycles/mm). By restricting the nonlinearity index measurement to spatial frequencies not sufficiently high, the nonlinearity indices were possibly underestimated.
In the experiment of Levitt et al. (2001), spatial frequencies that had small F1 responses were ignored, even when there was a significant F2 response. Again, this has the potential to underestimate the nonlinearity index. We suggest that these factors may explain the differences between the distributions in this study and the results of the previous studies.
Discussion
The main result presented in this paper is the identification and characterization of a new physiological type of ganglion cell in the primate retina. We refer to this new type as OFF upsilon. These cells are characterized by large receptive fields (with approximately three times the RF diameter of parasol cells at the same eccentricity), highly transient and rapid OFF responses to diffuse light steps (with insignificant ON–OFF responses), the absence of significant S-cone selective input, and highly nonlinear spatial summation. These characteristics can be used to identify these cells and separate them from all other known macaque functional RGCs. With as yet no confirmed morphological counterpart, we have chosen the name upsilon for this cell type to emphasize its Y-like character (the capital Greek letter upsilon is written as a roman “Y”).
The role of large-scale multielectrode array recording
The large-scale multielectrode array recording system (Litke et al., 2004) had a combination of features of particular importance for these investigations. These features included a high density of electrodes (300 electrodes/mm2), fine time resolution (50 μs sampling interval) and large active area (1.7 mm2). The combination of these features contributed to the reported results in the following critical ways: (1) the efficient identification of neurons (spike sorting) based on the correlated analog waveforms recorded on multiple nearby electrodes; (2) the ability to separate ganglion cells from spiking amacrine cells by a dynamic high-density EI of the electrical activity of each identified neuron. The EI also provided a valuable check on the quality of neuron identification; (3) the detection of a sufficiently large sample of neurons of each type to allow statistically significant clustering for neuron classification; and (4) the ability to detect enough cells of a given type, over a sufficiently large area, to observe and characterize a mosaic, even for cells with large receptive fields.
Possible morphological correlate of the upsilon cells
Based on the measured upsilon cell properties, we can speculate as to which morphological RGC type (Rodieck and Watanabe, 1993; Dacey et al., 2003, 2005; Dacey, 2004; Yamada et al., 2005) they may correspond to. Because the upsilon cells are OFF-type cells with no evidence for ON-OFF responses (i.e., no significant responses to both light intensity increments and decrements) (Fig. 3e), we expect that they would correspond to a cell whose dendrites narrowly stratify in the outer part of the inner plexiform layer (Nelson et al., 1978; Peichl and Wassle, 1981). Also, because the upsilon cells do not show any significant S-cone/(L + M)-cone opponent response properties, we do not expect them to correspond to the sparse or giant sparse (melanopsin-expressing) cells (Dacey and Packer, 2003; Dacey, 2004; Dacey et al., 2005). Furthermore, because the ratio of the RF diameter of the OFF upsilon cells to the RF diameter of OFF parasol cells is ∼2.9, the ratio of the dendritic arbor diameter of the upsilon cells to the dendritic arbor diameter of the parasol cells is expected to also be ∼2.9 [assuming the same ratio of the RF to the dendritic arbor diameters for both cell types (Peichl and Wassle, 1983)]. Given these constraints, the most plausible morphological candidate(s) corresponding to the upsilon cells are the smooth (Dacey, 2004) and large radiate (Yamada et al., 2005) ganglion cells recently identified in the primate retina.
Both the smooth and large radiate cells exist as inner and outer stratifying subtypes with consistent stratification levels and with very similar dendritic tree structures. Furthermore, both cell types are morphologically similar (Dacey, 2004; Yamada et al., 2005) to the intensively studied cat α-cells (Boycott and Wassle, 1974; Wassle et al., 1981). Therefore, in what follows, we will consider them as a single cell type that has been given different names by different investigators. We will refer to them as smooth/large radiate cells.
The upsilon cells may be the primate counterpart of the Y-cells of the cat retina
Given the morphological similarity of the primate smooth/large radiate cells to the cat α-cells (Dacey, 2004; Yamada et al., 2005), and given that the physiological counterpart of the cat α-cell is the Y-cell (Cleland et al., 1975; Peichl and Wassle, 1981), it may be expected that the smooth/large radiate cells will possess physiological properties similar to the Y-cells of the cat, namely rapid and transient light responses, large receptive fields, and nonlinear spatial summation. The fact that these visual response properties also characterize the upsilon cells reported here brings full circle the following physiological/morphological correspondences: primate upsilon cells→primate smooth/large radiate cells→cat α-cells→cat Y-cells→primate upsilon cells. This suggests that the upsilon (smooth/large radiate) cells may be the primate retina's physiological (morphological) counterparts of the Y (α) ganglion cells of the cat retina and the retinas of other mammalian species (Peichl, 1991).
If the upsilon (smooth/large radiate) correspondence is correct, then the upsilon cells would project to both the LGN and the superior colliculus (SC), as observed for the smooth cells by retrograde labeling experiments (Dacey, 2004). This property of projection to both the LGN and the SC is also a property of the Y (α)-cells of the cat (Wassle and Illing, 1980), further strengthening the upsilon (smooth/large radiate)-to-Y (α) correspondence.
X/Y-cell classification
As mentioned in the introduction, there has been a decades-long search for primate Y-cells in both the retina and the LGN. The data of Kaplan and Shapley (1982) in macaque LGN indicated the existence of two types of cells: magnocellular X (MX)-cells, showing linearity in spatial summation, and nonlinear magnocellular Y (MY)-cells. These results supported the hypothesis that the MY-cells form a functional group with properties distinct from the MX-cells (hypothesis 1). However, the measurements of Kaplan and Shapley (1982) did not use a quantitative description of nonlinearity, as introduced, for example, by Hochstein and Shapley (1976a). This makes it difficult to determine whether the MX- and MY-cells are indeed distinct cell types or, instead, represent two ends of a nonlinearity distribution for a single cell type (hypothesis 2).
Later studies in macaque LGN (Derrington and Lennie, 1984; Levitt et al., 2001) and in macaque retina (White et al., 2002) used the quantitative nonlinearity index of Hochstein and Shapley (1976a) to search for distinct X- and Y-like cell types. However, the measured nonlinearity distributions were all unimodal, in support of hypothesis 2 but inconsistent with hypothesis 1.
The data in this study seem to reconcile the two hypotheses. The measurement of the nonlinearity distribution of macaque RGCs as shown in Figure 6d, without separation into the parasol and upsilon cell types, shows a unimodal distribution consistent with hypothesis 2. However, if the cells are classified using additional information, such as RF size and time filter characteristics, then one finds two distinct distributions of nonlinearity indices, as shown in Figure 6, b and c. The distribution of Figure 6c, in particular, indicates a distinct type of highly nonlinear Y-like RGCs (the upsilons), consistent with hypothesis 1. [Interestingly, the ratio of RF sizes of the MY- to MX-cells identified by Kaplan and Shapley (1982), based on the response to a drifting sine wave grating as a function of spatial frequency, is 2.6, consistent with the upsilon-to-parasol RF diameter ratio of 2.9 ± 0.4 reported in this study. Of course, the caveat is that we have no direct evidence that the upsilon cells project to the LGN.]
Future investigations
These results and speculations raise two critical issues for future physiological and anatomical investigations. First, the correspondence of the upsilon cells with the smooth/large radiate cells needs to be confirmed through combined physiological and morphological experiments. Second, if the OFF upsilon cells are indeed the physiological counterparts of the outer stratifying smooth/large radiate cells, one should also find the physiological correlates of the inner stratifying smooth/large radiate cells (i.e., the ON upsilon cells). Our preliminary data on the ON upsilon cells suggest that these cells do exist and that they have properties similar to the OFF upsilon cells (i.e., large RFs, rapid and transient light responses, and nonlinear spatial summation). However, because of very limited statistics, more experiments are required to confirm their existence and to study their properties.
Implications
These findings have the following implications: (1) the rapid, transient, and highly nonlinear spatial summation (Y-like) response properties of the upsilon cells are well suited for the detection of moving objects or moving textured patterns. This includes fine-grained patterns with constant luminance but drifting contrast modulation [second-order motion (Demb et al., 2001b)]. Assuming the upsilon cells project to both the LGN and the SC, they may simultaneously contribute to the primary visual pathway for motion perception and provide the SC with the information needed for triggering attention to movement; (2) the upsilon cells may be the primate counterparts of the Y (α)-cells of the cat (and many other mammalian species) and may originate similar visual processing pathways; and (3) these cells may contribute to nonlinear spatial summation properties observed for some cells recorded in the magnocellular layers of the primate LGN (Kaplan and Shapley, 1982).
Footnotes
-
This work was supported by National Science Foundation (NSF) Grant PHY-0417175 (A.M.L.), McKnight Foundation (A.M.L., E.J.C.), Alfred P. Sloan Foundation (E.J.C.), National Eye Institute Grant 13150 (E.J.C.), Burroughs Wellcome Fund Career Award at the Scientific Interface (A.S.), Helen Hay Whitney Foundation (G.D.F.), National Institutes of Health National Research Service Award and Chapman Foundation (J.L.G.), German Academic Exchange Service (M.G.), and NSF Integrative Graduate Education and Research Traineeship and La Jolla Interfaces in the Sciences (J.S.). We thank W. Dabrowski, A. A. Grillo, P. Grybos, P. Hottowy, S. Kachiguine, and B. Kutka for their valuable assistance and contributions; S. Barry for machining; H. Fox, K. Osborn, and E. Callaway for access to retinas; and the San Diego Supercomputer Center and the National Science Foundation (cooperative agreements 05253071 and 0438741) for large-scale data storage.
- Correspondence should be addressed to Alan M. Litke, Santa Cruz Institute for Particle Physics, Natural Sciences 2, University of California, 1156 High Street, Santa Cruz, CA 95064. alan.litke{at}cern.ch