Attention is a process for selecting relevant information from complex sensory inputs for further processing. Attention can be classified into spatial attention and feature attention, the former denoting selective processing of stimuli in a specific location and the latter denoting selective processing of stimuli having a particular sensory feature. Many studies have demonstrated that both spatial and feature attention modulate neuronal responses and improve the signal-to-noise ratio of the encoding of the stimulus being attended. It has also been reported that both types of attention increase the magnitude of neuronal responses in multiple visual areas (Carrasco, 2011; Maunsell, 2015). Furthermore, both spatial and feature attention enhance the representation of the attended stimulus by reducing neuronal response variability (often measured with the Fano factor) and pairwise noise correlation (Cohen and Maunsell, 2009, 2011; Mitchell et al., 2009). Commonalities between spatial and feature attention suggest that they may share a common top-down source.
The effects of attention on neuronal activity are well captured by attentional normalization models (Lee and Maunsell, 2009; Reynolds and Heeger, 2009). Normalization has been suggested as a canonical neural computation and has successfully explained many nonlinear properties of neurons (Carandini and Heeger, 2012). In the normalization model, a neuron not only is excited when a stimulus appears in its receptive field, but also is inhibited by neurons in a local network. Attention modulates the neuron's response by adjusting the amount of the feedforward excitatory drive associated with the sensory stimulus and suppressive drive from the local network.
Although the commonalities between feature and spatial attention have been widely observed, many differences have also been reported. One significant difference is that spatial attention is limited to a specific location of the retinotopic map (Womelsdorf et al., 2006), whereas feature attention affects the processing across the entire visual field (Saenz et al., 2002). Also, the temporal dynamics of the modulation of sensory neurons by the two types of attention are different (Hayden and Gallant, 2005). These observations raise several questions. How are spatial and feature attention similar in some aspects, and different in others? Can normalization explain both commonalities and discrepancies? If it cannot, what neural mechanism could explain commonalities and discrepancies between spatial and feature attention?
To address these questions, Ni and Maunsell (2019) trained monkeys with a direction-change detection task while recording from the medial temporal cortex (MT). In the task, at most three drifting Gabor patches were presented on a computer screen; two of them were in the receptive field of the recorded neuron, and the other was on the opposite side of the visual field. Each Gabor patch flashed every 200 ms; then after a blank interstimulus period, its direction or contrast might change. A cue presented before Gabor-patch onset indicated which of the three locations the monkeys should attend. The monkeys were required to detect when the direction of the Gabor patch changed <90° and were rewarded for saccading to the target Gabor within 100–600 ms after the change happened. All other changes, such as direction changes >90° and changes of contrast, should be ignored.
By manipulating the direction of the Gabors and the attended location, Ni and Maunsell (2019) created three different task variants to measure the neuronal modulation elicited by normalization, spatial attention, and feature attention. To quantify the normalization effect, there were either one or two stimuli in the receptive field, moving in either the preferred or the opposite direction, and attention was guided to the Gabor outside of the receptive field, which moved orthogonally to the preferred direction. In the spatial attention task variant, the monkeys were cued to attend to a Gabor in the receptive field, whose direction kept changing so that no specific feature was attended and the effect of feature attention was minimized. In the feature attention variant, attention was cued and attracted to the Gabor outside of the receptive field to remove the effect of spatial attention on the recorded neuron. The attended Gabor moved in the neuron's preferred direction and the moving direction was the attended feature.
The authors measured the effect of normalization, spatial attention, and feature attention on MT neurons' responses with normalization modulation index (NMI), spatial modulation index (SMI) and feature modulation index (FMI), respectively. Results showed that SMIs and FMIs were correlated across neurons, and the correlation existed even when controlling for NMIs. The authors attributed the correlation between FMIs and SMIs to the commonality between the mechanism supporting spatial and feature attention. However, as previous researches have shown, attentional modulation strength was also correlated with behavioral performance features like accuracy and reaction time (Cook and Maunsell, 2002; Cohen and Maunsell, 2011). Because the data in this experiment were collected from different neurons on different days, it was possible that the SMI/FMI correlation across neurons simply reflected the behavioral fluctuation across days. To exclude this possibility, one could calculate the SMI/FMI correlation while controlling for behavior performance. Simultaneous recordings in the same session can also address this potential confounder.
Prior studies had shown that the variance of spatial-attention modulation strength across neurons could be explained largely by the variance in normalization strength (Ni et al., 2012; Verhoef and Maunsell, 2016). Ni and Maunsell (2019) tested whether feature-attention modulation strength was correlated with normalization strength and found no correlation, indicating that normalization might explain the difference between spatial and feature attention.
To better understand commonalities and discrepancies between spatial and feature attention, the authors fitted the electrophysical results with a spatially-tuned equal maximum suppression normalization model they proposed previously (Ni and Maunsell, 2017). In the model, a neuron's response was described as a sum of excitatory drives associated with stimuli appearing in the RF and each of them was divided by a suppressive drive separately before summation. Further, the suppressive drive changed with the associated stimulus' location, despite the stimulus always being in the RF. The authors assumed that spatial and feature attention enhanced the excitatory drive associated with the attended stimulus. Because the experimental results indicated that only spatial attention interacted with local normalization, the authors hypothesized that spatial attention modified the suppressive drive of the attended stimulus and feature attention did not. This model successfully explained variance in the neuronal responses, which suggests spatial and feature attention have the same top-down mechanism but differ in their interactions with localized normalization.
It is not clear what the biological mechanism underlying normalization is, even though many models and hypotheses have been proposed (Carandini and Heeger, 2012). A single neural mechanism cannot explain normalization across all brain areas and species. Normalization can be implemented not only through lateral inhibitory inputs from nearby neurons, but also through feedforward inputs or feedback inputs from higher brain areas. Most of these hypotheses assume that there is one unified suppressive drive and excitatory drives are summed before normalization. It is unknown whether these models could extend and reproduce the neuronal response patterns predicted by the equal maximum suppression normalization model proposed by Ni and Maunsell (2019), in which non-uniform suppressive drives are necessary and excitatory drives are summed after normalization. One possibility is that excitatory drives are normalized by suppressive drives before MT and then summed together. In other words, a pair of neurons with overlapping receptive fields from primary sensory areas project to the same neuron in MT and normalization happens both in primary sensory areas and MT.
Attention modulates not only the magnitude of neuronal responses, but also the noise correlation. Many factors may contribute to the noise correlation, such as local connectivity and global fluctuation caused by top-down inputs (Kohn et al., 2016). A previous study suggested that the effect of attention on the noise correlation could also be captured by a normalization model (Verhoef and Maunsell, 2017). However, in Verhoef and Maunsell's (2017) study, limited by the behavioral paradigm, different types of attention modulation could not be disentangled. With the paradigm designed by Ni and Maunsell (2019), researchers could investigate how spatial and feature attention exert leverage on the noise correlation individually. It would provide additional evidence toward the common source hypothesis if discrepancies between spatial and feature attention modulation on the noise correlation could be mostly explained by the normalization mechanism.
Contrary to the work by Ni and Maunsell (2019), which supports the hypothesis that spatial and feature attention have the same top-down mechanism, a study by Bichot et al. (2015) showed that a lesion in the ventral pre-arcuate region of prefrontal cortex impaired feature attention signals in the frontal eye fields but left spatial attention signals intact. One possibility that might reconcile these seemingly contrary results is that the SMI/FMI correlation in MT neurons reflects intrinsic properties of the connections between MT neurons and neurons in the brain areas that are involved in the transmission of both spatial and feature attention signals. MT neurons that are strongly connected with the brain areas, such as the frontal eye fields, conveying both types of attention signals may show relatively large SMIs and FMIs. In comparison, MT neurons that are weakly connected with the brain areas conveying both types of attention signals may show small SMIs and FMIs. Consequently, we could observe the SMI/FMI correlation even if the ventral pre-arcuate region of prefrontal cortex is only causally related to feature attention. Further investigation is needed to examine this possibility.
With a well designed behavioral task, Ni and Maunsell (2019) studied normalization, spatial attention, and feature attention in a unified framework that helps us to understand how top-down attention interacts with local normalization. Unlike in previous studies, when measuring the attentional effects, Ni and Maunsell (2019) regarded the neuronal responses in the normalization task variant as a common baseline, which allowed a reliable comparison between spatial and feature attention. Finally, Ni and Maunsell (2019) is one of the few studies comparing spatial attention and feature attention in a quantitative manner (Cohen and Maunsell, 2011), which provides further clues on the relationship between mechanisms underlying spatial and feature attention.
Footnotes
Editor's Note: These short reviews of recent JNeurosci articles, written exclusively by students or postdoctoral fellows, summarize the important findings of the paper and provide additional insight and commentary. If the authors of the highlighted article have written a response to the Journal Club, the response can be found by viewing the Journal Club at www.jneurosci.org. For more information on the format, review process, and purpose of Journal Club articles, please see https://www.jneurosci.org/content/jneurosci-journal-club.
We thank Tianming Yang and Zhongqiao Lin for helpful discussions and comments on the paper.
The authors declare no competing financial interests.
- Correspondence should be addressed to Zhewei Zhang at zwzhang{at}ion.ac.cn