Abstract
Adaptation to static scenes is a familiar and fundamental aspect of visual perception that causes negative afterimages, fading, and many other visual illusions. To establish a foundation for understanding the neuronal bases of such phenomena and to constrain the contributions of retinal versus cortical processing, we studied the responses of neurons in the dorsal lateral geniculate nucleus during and after the presentation of prolonged static visual stimuli. We found that parvocellular (P) cells (the more numerous and color-sensitive pathway) showed response adaptation with a time constant on the order of tens of seconds and that their response after the removal of a visual stimulus lasting 1 min was similar in amplitude and time course to the response evoked by the photographic negative stimulus. Magnocellular (M) cells (the faster-conducting and achromatic pathway) had after responses that were substantially weaker than responses evoked by patterned visual stimuli. This difference points to the existence of an adaptive mechanism in the P-pathway that is absent or impaired in the M-pathway and is inconsistent with full adaptation of photoreceptors, which feed both pathways. Cells in both pathways often maintained a substantial tonic response throughout 1 min stimuli, suggesting that these major feedforward inputs to cortex adapt too slowly to account for visual fading. Our findings suggest that faster-adapting mechanisms in cortex are likely to be required to account for the dynamics of perception during and after the viewing of prolonged static images.
Introduction
A fascination with visual afterimages since the time of Purkinje (1825; Anstis et al., 1978) has produced a wealth of psychophysical studies that probe the nature of visual perception, from retinal mircocircuitry (Wilson, 1997) to neuronal correlates of consciousness (Tsuchiya and Koch, 2006). A perennial debate has been whether the seat of afterimages is retinal or cortical (Parinaud, 1882; Delabarre, 1889; Creed and Harding, 1930; Misiak and Lozito, 1951; Loomis, 1972; Virsu and Laurinen, 1977; Shimojo et al., 2001; Gilroy and Blake, 2005; Tsuchiya and Koch, 2006). A more recent debate has arisen regarding the roles of the functionally segregated parvocellular (P) and magnocellular (M) retinogeniculate pathways (Ingling and Grigsby, 1990; Kelly and Martinez-Uriegas, 1993; Schiller and Dolan, 1994). Despite many attempts to link the perception of afterimages to hierarchical stages and parallel streams of the visual system, we are not aware of any systematic electrophysiological characterizations of the signals discharged by single neurons in retina, lateral geniculate nucleus (LGN), or V1 in response to prolonged presentations of static patterned visual stimuli. The absence of, and need for, studies characterizing slow components of visual adaptation was noted by Shapley and Enroth-Cugell (1984). Such characterizations can offer a new set of constraints for working out functional neuronal circuitry in the visual system. Moreover, understanding how neurons in the primate visual system respond to such stimuli can provide a foundation for interpreting the extensive psychophysical literature on afterimages and fading.
We studied responses in the macaque LGN during and after prolonged visual stimuli presented at moderate luminance on a standard video display [cathode ray tube (CRT)]. Such stimuli are typical of those used in psychophysical and neurophysiological studies and are not associated with significant bleaching of photoreceptors (Shapley and Enroth-Cugell, 1984). We focused on the time scale of seconds to minutes because this covers the ranges associated with visual fading (Tulunay-Keesey, 1982), afterimage formation (Kelly and Martinez-Uriegas, 1993), and luminance adaptation (Baker, 1949; Adelson, 1982; Hayhoe et al., 1992; Yeh et al., 1996; Hood, 1998). Recording in the LGN allowed us to separately study the behavior of the parallel M- and P-pathways. We hoped to reconcile psychophysical claims that the M-pathway does not mediate afterimages (Ingling and Grigsby, 1990; Kelly and Martinez-Uriegas, 1993) with the demonstration that macaques still report afterimages after P-pathway lesions (Schiller and Dolan, 1994). Furthermore, we measured time constants of adaptation in the LGN to determine whether they match those of visual fading and afterimages in human psychophysics, and we tested the implications of an explicit model of afterimage formation via early retinal circuitry (Wilson, 1997).
Materials and Methods
Electrophysiology
Single-unit responses were recorded extracellularly from the dorsal LGN of two anesthetized, paralyzed macaques (Macaca mulatta). Detailed methods are available in the study by Bair et al. (2002). Animals were anesthetized and paralyzed with sufentanil citrate (6–30 μg/kg/h) and vecuronium bromide (Norcuron; 0.1 mg/kg/h), in lactated Ringer's solution (2 ml/kg/h) containing dextrose (2.5%). Anesthesia was supplemented with isoflurane (0.25%). Artificial respiration with a moist mixture of 50% O2 and 50% room air was maintained with rate adjustments to keep expired CO2 between 32 and 38 mmHg. Body temperature was maintained near 37°C with a heating pad. EEG and electrocardiogram were monitored to maintain a proper depth of anesthesia. The corneas were protected with gas-permeable hard contact lenses, with additional lenses to optimize neuronal responses to high spatial frequency (SF) stimuli. All procedures conformed to UK Home Office regulations on animal experimentation.
A mechanical microdrive was used to advance quartz–platinum tungsten microelectrodes into the brain (Thomas Recordings). The craniotomy was centered behind the central sulcus and 11 mm lateral to the midline. Neurons were recorded in the P- and M-layers and had receptive field eccentricities between 10 and 40°. Signals were digitized at 12.5 kHz using a National Instruments analog-to-digital board, and spikes were discriminated using time-amplitude windows (custom software, C-code) and stored at 1 ms resolution. Electrolytic lesions were made for histological verification and estimation of LGN layer. After 5 d, animals were given an overdose of sodium pentobarbital (65 mg/kg), exsanguinated, and perfused with 4% paraformaldehyde in saline.
Visual stimuli
Basic characterization.
We mapped each cell by hand using bars and gratings while adjusting the electrode depth in micrometer increments to obtain well isolated action potential waveforms. We next characterized each cell physiologically by presenting on a CRT (96 Hz; mean luminance, 27 cd/m2) a series of drifting sinusoidal gratings under computer control to generate tuning curves for direction, SF, temporal frequency, and size. We then presented the optimal grating as a static stimulus for 2 s at eight spatial phases. We defined the preferred phase to be that generating the highest firing rate and the antiphase to be 180° opposite (see Fig. 1).
Static, long duration.
To examine adaptation and recovery on a long time scale, we presented 64 s static stimuli followed by 128 s of mean gray. Typically, we collected six trials with the antipreferred stimulus and six trials with the preferred stimulus. At the end of most runs, and particularly if an audible change in response occurred, we retested the spatial phase tuning to ensure that eye position had not changed. If a substantial phase shift was detected, the data were discarded.
Static, variable duration.
We presented the antipreferred stimulus for varying durations followed by recovery epochs. Six conditions were interleaved, consisting of stimulus durations 1, 2, 4, 8, 16, and 32 s, followed by recovery epochs of 16, 2, 4, 8, 8, and 16 s, respectively. The recovery epochs for 16 and 32 s stimuli were limited to shorten the overall experiment, and the recovery epoch for 1 s was matched for comparison with the 32 s stimulus. These details were designed for testing V1 cells, which typically recovered rapidly and within the epochs provided (McLelland et al., 2007), and are not critical here. Within each block of six conditions, the stimulus was present for 63 s and absent for 63 s, including 1.5 s of gray between trials.
Cell classification.
We distinguished M- and P-cells based on the presence of contrast gain control (Shapley and Victor, 1978) in M-cells and its absence in P-cells (Benardete and Kaplan, 1999). Classification was based on the temporal kernels (Benardete and Kaplan, 1999; Bair et al., 2002) and was in agreement with our assessment from contrast sensitivity, latency, and laminar location.
Data analysis
Responses are shown as peristimulus time histograms (PSTHs), convolved with a Gaussian to suppress noise (for SD, see figures) and scaled to show the mean instantaneous firing rate. Where normalized population PSTHs are presented, we divided the PSTH for each cell by the firing rate in the time window starting at 500 ms (to avoid the on transient) and ending at 1000 ms. We calculated the amplitude of on- and off-transient responses as the peak firing rate within the first 120 ms in PSTHs that were smoothed with a Gaussian of SD 4 ms.
We measured the variability of interspike intervals (ISIs) using the CV2 metric (Holt et al., 1996) defined for two consecutive ISIs, a and b, as CV2 = 2|a − b|/(a + b). This is simply the difference over the mean of the intervals and is insensitive to slow changes in firing rate because it is local in time.
Exponential functions of the following form were fit to PSTHs, minimizing mean squared error (MSE): The parameters A and B are the amplitude and offset firing rates, and τ is the time constant
A model of adaptation was fit to the data for the variable duration stimuli. The model holds that the response, r(t), to the stimulus is proportional to the difference between the stimulus amplitude and the reference adaptation level: where the adaptation level is given by the following: Here, the asterisk indicates convolution, y(t) is an exponential (Eq. 1), and s(t) is the time-varying stimulus contrast, which is zero before the stimulus begins and during recovery epochs and −1 during the antipreferred stimulus. The MSE of the fit was minimized only during the recovery periods, because the firing rates of most cells were suppressed to zero during the antipreferred stimulus, preventing the observation of the temporal evolution of the antipreferred response.
Results
For each LGN neuron, we identified a preferred static stimulus (see Materials and Methods). The antipreferred stimulus was constructed by shifting the phase of the preferred grating by 180°; thus, it was the photographic negative of the preferred (Fig. 1). The responses to these stimuli can be broken into four major components: the on- and off-transients and the maintained and after responses (Fig. 1B). The first three components are familiar, as described by Kuffler (1953) and depicted by Enroth-Cugell and Robson (1966) for cat retinal ganglion cells, but the fourth component, the after response, has been essentially overlooked.
To characterize the after response and compare it to the maintained response on a time scale that encompasses the psychophysical fading of stabilized images and negative afterimages (Tulunay-Keesey, 1982; Hayhoe et al., 1992), we presented static stimuli for 64 s, followed by 128 s of mean gray (Fig. 2A).
The responses of three P-cells to prolonged stimuli are shown in Figure 2B. The maintained response of the first cell, which had appeared relatively flat in Figure 1B, can be seen to decay substantially at the longer time scale presented here. The response to the antipreferred stimulus was somewhat suppressed during the maintained period but increased rapidly after the stimulus disappeared such that the after response to the antipreferred stimulus (AR) approximated the maintained response to the preferred stimulus (MR). This differs from the case for the 2 s stimuli (Fig. 1A,B), in which the AR was substantially less than the MR, indicating that the amplitude of the AR depends on the adaptation time. A second example (Fig. 2B, middle) shows the same pattern, in which the approximate amplitude and time course of the MR and AR were similar. The third example P-cell had a more complicated time course (Fig. 2B, bottom), in which the MR increased over the first 10 s and then decayed back toward the spontaneous level. This humped time course also appeared in the AR, consistent with the similarity of MR and AR in the first two examples.
All P-cells showed behavior consistent with these examples. We normalized the PSTHs for each cell so that 1 represents the firing rate at the start of the MR and 0 represents the baseline rate (see Materials and Methods) and averaged them together (Fig. 2C) (n = 9). The resulting population PSTHs reveal several important characteristics of long time-scale adaptation. First, the average MR does not return to baseline, even after 64 s, suggesting that adaptation in the retina and LGN is insufficient for the fading of these stimuli within 64 s. Second, despite the incomplete decay toward baseline during the maintained period, the amplitude of the AR is similar to that of the MR, suggesting that the underlying adaptive mechanism may be near saturation. Third, the dynamics of the after responses approximated those of the maintained responses to the photographic negative stimuli. Thus, the P-pathway delivers a set of signals consistent with a negative afterimage that is similar in time course and amplitude to the original stimulus in terms of the mean instantaneous firing rate.
We tested M-cells with the same stimulus paradigm, with markedly different results. Figure 2D shows the responses of three typical M-cells. Their responses were similar to those of P-cells while the stimuli were present (i.e., MRs were elevated for preferred stimuli, suppressed for antipreferred stimuli, and decayed partially toward a middle value during the 64 s stimulus presentation). However, when the antipreferred stimulus disappeared at t = 64 s, the ARs were substantially less than the MRs had been to the preferred stimuli and thus less than would have been expected for P-cells. This difference in behavior is summarized by the population average PSTHs (Fig. 2E), which show that M-cells provide the cortex with signals consistent with a negative afterimage but substantially weaker than that associated with the actual negative stimulus and similarly weaker than that transmitted by P-cells (Fig. 2, compare C, E).
We quantified the time courses of MRs and ARs by fitting exponential functions (Eq. 1; see Materials and Methods) to the PSTHs. When fitting, we omitted the first 10 s period, which often contained marked deviations from exponential decay (e.g., cell 3 in Fig. 2B). The distributions of time constants τMR and τAR across cells are shown in Figure 2, C and E (insets). For P-cells, there was no significant difference between τMR and τAR (t test, p = 0.23; n1 = 9; n2 = 10). For M-cells, τAR was significantly larger than τMR (t test, p = 0.02; n1 = 8; n2 = 6); however, we believe this is related to low AR amplitudes in some M-cells (e.g., cell 6 in Fig. 2D). In fact, many ARs were basically flat and could not be reliably fit, as indicated by the histogram bars marked “No fit” in Figure 2E (inset). τMR was not significantly different between P- and M-cells (p = 0.17; n1 = 9; n2 = 8), but τAR was larger for M-cells (p = 0.01; n1 = 10; n2 = 6). Overall, τ values were, on average, ∼20 s (Fig. 2) but somewhat longer for M-cell ARs (∼50 s). Thus, a major component of adaptation in the MRs and ARs operates on the order of tens of seconds.
Given the similarity of the P-cell AR and MR in terms of mean firing rate, we wondered whether they could be distinguished based on response variability. We found no difference in spike count variance or in the ISI variability metric, CV2 (Holt et al., 1996) (see Materials and Methods). In particular, the spike count variance was not significantly different when comparing the first 20 s of the MR to the first 20 s of the AR (p = 0.71, paired t test). The CV2 metric varied considerably across cells (0.56–1.1), was highly correlated between MR and AR (r = 0.91; p < 0.001; n = 9), and also did not differ significantly when comparing MR with AR (p = 0.74, paired t test). We performed similar tests for 10 and 60 s epochs and found similar results. For M-cells, we found no difference in spike count variance and a slight increase in ISI variability for the AR, but the latter was expected, being correlated with the decrease in firing rate during the AR compared with the MR.
Having observed that the M-cell AR was substantially weaker than the MR, we tested whether the off-transient (Fig. 1) was likewise weaker than the on-transient. We found no significant difference between the amplitude (see Materials and Methods) of on- and off-transient responses for M-cells (p = 0.54; n = 11). This also held for P-cells (paired t test, p = 0.35; n = 9). Thus, the difference between the MR and AR for M-cells (Fig. 2E) was observed only after the transient responses. A high-resolution comparison of the population PSTHs for the relevant part of the response is shown in Figure 3, where off/after responses are overlaid with on/maintained responses. As expected, M-cell on- and off-transients occurred ∼20 ms earlier than P-cell responses (Fig. 3, inset) (Maunsell et al., 1999), and M-cell responses were more transient in the first ∼100 ms, agreeing with the classical distinction made by Schiller and Malpeli (1978). Shortly after 100 ms, there is a clear difference between the M-cell AR and MR that is not present for P-cells. The M-cell AR is significantly less than the MR in the epoch from 200 to 600 ms (paired t test, p = 0.01; n = 9). In summary, the M-cell difference arises early, although it is not immediately evident in the on-transient, perhaps as a result of saturation of the high-amplitude transient signals.
The results above suggest that the dynamics of P-cell MRs and ARs are similar and could be unified with a simple adaptive model (see Materials and Methods) in which the response depends not directly on stimulus contrast but on the difference (Eq. 2) between the true contrast and a reference signal that is constantly adapting to match the stimulus (Eq. 3). We tested whether this form of adaptation could approximate the ARs of P-cells on a single-trial basis in response to a richer stimulus in which antipreferred gratings were presented at variable times and with variable durations. A 400 s response epoch from example cell 1 (Fig. 2B) is shown in Figure 4B along with the AR predicted by the model (black line). The fit captures several prominent features of the data. The ARs after the first few stimuli have lower amplitudes (e.g., arrow 1) than later ones (arrow 3), showing that adaptation builds up across stimuli. The AR decays progressively across stimuli when the antipreferred stimuli are infrequent and brief (arrow 4). Finally, the off-transient (arrow 2), which is not modeled here, is strong from the beginning and does not change in proportion to the AR.
The time constants for these fits (Fig. 4C) were longer, on average, than those of Figure 2. This reflects the fact that fitted epochs are shorter and the fits here begin at the start of each AR, whereas those in Figure 2 started 10 s later to avoid the early part of the response, which in some cells was not well modeled by decaying exponentials (being flat or leading to a hump). Nevertheless, results from both experiments demonstrate that time constants of adaptation to static stimuli are on the order of tens of seconds. We did not fit M-cells because they had weaker and flatter ARs, on average.
Discussion
We have identified a component of adaptation to static visual patterns that operates on a time scale of 10–100 s and is present in the outputs of both P- and M-cells in the LGN. Both pathways showed similar adaptation during prolonged static stimuli. However, whereas ARs in the M-pathway were consistently weaker than the driven response to the photographic negative, ARs in the P-pathway approximated the driven response. Under the assumption of rate coding, as in the fifth dogma of Barlow (1972), signals in the P-pathway that adapt completely could support a perceptual equivalence between real images and afterimages. This, however, is undermined by the M-pathway, which maintains a distinction by virtue of its incomplete adaptation. Nevertheless, both pathways carry ARs that are significant and decay slowly over many tens of seconds.
These results help to resolve some conflict between past studies. The idea that the M-pathway is completely transient, having only brief on- and off-responses, is inconsistent with our findings, yet this was put forward by Ingling and Grigsby (1990) in a psychophysical study concluding that afterimages are solely P driven. Their conclusion was echoed by Kelly and Martinez-Uriegas (1993), who found the intriguing result that afterimages from both chromatic and achromatic gratings elevated the psychophysical threshold for detecting chromatic gratings, but neither raised the threshold for achromatic gratings. Schiller and Dolan (1994) tested this P-only afterimage hypothesis by lesioning the LGN P-layers in macaque and found that subjects continued to reliably report visual afterimages. They concluded that the M-pathway must be capable of supporting afterimages. Our results support this conclusion, as both M- and P-pathways showed clear ARs, but also offer a basis for explaining why achromatic detection thresholds are insensitive to afterimages, as many M-cells showed weak ARs. Thus, it is possible that such M-cells retain enough of their dynamic range to support low detection thresholds even as others carry signals strong enough to spark awareness of an afterimage, but a systematic study of sensitivity during the after response would be required to test this idea.
Our results also shed light on plausible retinal circuits for local luminance adaptation. Wilson (1997) argued that a slow subtractive process of light adaptation (Hayhoe et al., 1992) that underlies afterimage formation is mediated by the gain of the negative feedback from horizontal cells onto cones. However, this model predicts similar ARs for parasol and midget ganglion cells because the adaptive mechanism operates early in elements that are common to both pathways (cones and H1 horizontal cells). Our data show substantial differences in M and P ARs, suggesting that the relevant mechanism lies in cells that are distinct in these pathways: bipolar, amacrine or ganglion cells (Dacey, 2000). We hypothesize the existence of slow, subtractive adaptation (Adelson, 1982) like that in our model (see Materials and Methods) in the midget pathway that is absent or weaker in the parasol pathway. An alternative hypothesis, that the adaptation occurs at the photoreceptors and is subsequently suppressed in the parasol pathway, is less attractive because it would require the M-pathway to somehow selectively suppress ARs but not MRs. It seems simpler to assume that strong ARs never develop in the first place in the M-pathway.
We think it is unlikely that our results are closely linked to the growing body of literature on contrast gain control (Shapley and Enroth-Cugell, 1984; Solomon et al., 2006), which operates nearly three orders of magnitude faster and involves adaptation to temporal variation around a mean rather than to a static luminance pattern. Perhaps a more relevant study is that of Yeh et al. (1996), who probed the sensitivity of macaque retinal responses in the 30 s after step changes in full-field (unpatterned) luminance. In contrast with our results, for achromatic stimuli they reported a rapid (∼2–3 s) return to baseline response levels and low overall responses. This is consistent with their full-field stimuli engaging center-surround antagonism, thereby minimizing the driven response. Our patterned stimuli engaged the center and surround synergistically, causing strong elevations (or depressions) in firing rate, not present for their stimuli. Notably, for chromatic stimuli, which should not engage center-surround antagonism, they found strong responses and longer time courses of recovery that are consistent with our results.
The time scale of adaptation characterized here bears comparison to a large body of psychophysical literature on light adaptation, visual fading, and afterimages. Baker's (1949) study of light adaptation yields τ values of, on average, 25 s and complete adaptation within 3 min. Anstis et al. (1978) reported τ = 25–54 s for the decay of contrast afterimages and found that 18 s stimuli were too short to build full-strength afterimages. These reports, for example, approximately match our data. However, one of the most thorough investigations of the time course of afterimages (Kelly and Martinez-Uriegas, 1993) found that chromatic and achromatic afterimages developed and decayed with similar exponential time constants, 4–8 s, consistent with the 5–9 s reported previously for achromatic afterimages (Virsu and Laurinen, 1977). These numbers are substantially shorter than our values for the mean time constants of LGN ARs. Hayhoe et al. (1992) identified a subtractive process of light adaptation that asymptotes within 10–15 s and is believed to be consistent with the time course of visual fading, which occurs on the order of a few seconds (King-Smith et al., 1977; Tulunay-Keesey, 1982). These results, not to mention common experience with a variety of popular visual illusions, suggest that visual fading and afterimages typically operate on a substantially faster time scale than that required for LGN responses to return to their baseline levels.
Thus, although classical studies have provided compelling explanations of visual psychophysics on the basis of retinal adaptation (Barlow and Levick, 1969; Shapley and Enroth-Cugell, 1984), the slowly adapting signals and long-lasting maintained responses observed in the LGN suggest that adaptation in the cortex, not the retina, will be required to account for fading and afterimages. Preliminary data from V1 suggests that most cortical cells adapt back to their spontaneous firing rates within a few seconds (McLelland et al., 2007).
Footnotes
-
This work was supported by the Wellcome Trust. B.A. was supported by Biotechnology and Biological Sciences Research Council Grant BBC5049431. We thank Pamela Baker, Mark Baxter, Kristine Krug, Kathy Murphy, and Andrew Parker for assistance with experiments.
- Correspondence should be addressed to Douglas McLelland at the above address. douglas.mclelland{at}dpag.ox.ac.uk