Abstract
Visual attentional selection is influenced by the value of objects. Previous studies have demonstrated that reward-associated items lead to rapid distraction and associated behavioral costs, which are difficult to override with top-down control. However, it has not been determined whether a perceptually competitive environment could render the reward-driven distraction more susceptible to top-down suppression. Here, we trained both genders of human subjects to associate two orientations with high and low magnitudes of reward. After training, we collected fMRI data while the subjects performed a categorical visual search task. The item in the reward-associated orientation served as the distractor, and the relative physical salience between the target and distractor was carefully controlled to modulate the degree of perceptual competition. The behavioral results showed faster searches in the presence of high, relative to low, reward-associated distractors. However, this effect was evident only if the physical salience of the distractor was higher than that of the target, indicating a context-dependent suppression effect of reward salience that relied on high perceptual competition. By analyzing the fMRI data in primary visual cortex, we found that the behavioral pattern of results could be predicted by the suppressed channel responses tuned to the reward-associated orientation in the distractor location, accompanied by increased responses in the midbrain dopaminergic region. Our results suggest that the learned salience of a reward plays a flexible role in solving perceptual competition, enabling the neural system to adaptively modulate the perceptual representation for behavioral optimization.
SIGNIFICANCE STATEMENT The predictiveness principle in learning theory suggests that the stimulus with high predictability of reward receives priority in attentional selection. This selection bias leads to difficulties in changing approach behaviors, and thus becomes an important factor related to psychiatric disorders with attentional deficits. Here, we demonstrated that such principle is adaptively implemented in attentional suppression in visual search. We showed that the learned salience induced the suppression of the reward-associated distractor if its competition with the target was strong and could not be readily solved. This behavioral pattern was accompanied by increased midbrain fMRI activity and weakened sensory representation of the reward-associated distractor in V1. Our findings provided direct evidence that our brain flexibly uses learned regularities in attentional control.
Introduction
Competing for attentional priority has a vital role in sensory processing when the input information exceeds the limited capacity of the sensory system (Motter, 1993; Desimone and Duncan, 1995; Ito and Gilbert, 1999; Kastner and Ungerleider, 2001; Fecteau and Munoz, 2006; Bisley and Goldberg, 2010; Ptak, 2012). This priority map can be biased by the selection of task-relevant objects that is governed by top-down control, and the detection of unexpected or conspicuous objects in the form of bottom-up capture (Theeuwes, 1992; Egeth and Yantis, 1997).
Recent studies in both humans and primates have shown the advantage of reward-associated stimulus in attentional competition when it is neither physically conspicuous nor task relevant (Della Libera and Chelazzi, 2009; Peck et al., 2009; Hickey et al., 2010; Anderson et al., 2011a). This learned predictiveness of reward was suggested to increase salience and upgrade its selection priority even when it was competing with a task-related and physically salient object (Hickey et al., 2010; Anderson et al., 2011a; Le Pelley et al., 2015). The evidence for the enhanced representation of reward-associated stimuli was found in both visual cortices (Serences, 2008) and dopaminergic midbrain areas (Anderson et al., 2014). Conversely, literature on cognitive control suggests that the dopaminergic modulation of sensory processing varied according to the task demands (van Schouwenburg et al., 2010). This high-level control theory predicts that the strength of dopaminergic modulation could also determine the degree of suppressed encoding of the reward-associated distractors (Hickey and Peelen, 2015). In agreement with this prediction, recent behavioral and electrophysiological investigations have demonstrated effective suppression over reward salience in certain task contexts (Lee and Shomstein, 2014; Sawaki et al., 2015; Gong et al., 2016). However, the mechanism of the suppression effect requires further investigation given the two streams of evidence in opposite directions.
Here, we suggest that an important step to reconcile the inconsistencies is to identify the critical factors that determine the suppression of reward salience in attentional competition. We believe that a plausible explanation concerning the controversy could be ascribed to the necessity of suppression in attentional control. If attending to the target is insufficient in solving perceptual competition, effective suppression of distractors with reward salience would become an alternative way to facilitate attentional selection. This idea is supported by the model of selective attention based on inhibition (Houghton and Tipper, 1994) and corresponding evidence, in which the strength of suppression was shown to adapt to the necessity of ignoring the distraction (Gaspar and McDonald, 2014; Weaver et al., 2017). To test this hypothesis, the experiment needs to meet two criteria. First, the strength of competition between the target and distractor needs to be carefully controlled: the physical salience of the target is either higher or lower than that of the distractor, thus creating conditions of low and high perceptual competition, respectively. Second, the degree of reward salience needs to vary between conditions only for the distractor. In this regard, the effect of reward salience on distractor suppression can be examined under different target-distractor relationships.
In the present study, we tested our hypothesis with a visual search task that met the above criteria. The behavioral results showed that the relative physical salience between the target and distractor played a critical role in the presence of suppression of reward salience. Particularly, the faster searches in presence of high, relative to low, reward associated distractors was evident only if the distractor's physical salience was higher than that of the target. Analyses of fMRI data revealed that this behavioral suppression effect under high competition was associated with increased activity in the midbrain area that is responsible for reward processing, as well as reduced orientation channel response in the area that responded to the reward-associated distractor in the primary visual cortex (V1).
Materials and Methods
Experimental design and statistical analysis
Participants.
Thirty observers were recruited (14 males; 22 ± 1.93 years) for this experiment. They were students at Peking University with normal or corrected-to-normal vision, were right-handed, and gave written informed consent. The observers were paid ¥320 (¥1 ≈ $0.15) for completion of the tasks, with an additional payment up to ¥30 based on their task performance in the reward learning session. The local ethics committee approved the study.
Procedure.
Each observer performed three sessions of the main experiment over 3 d. A practice session (∼20 min) was completed within 1 week before the main experiment. In the practice session, observers were shown the stimuli and trial sequences for each task, separately. After observers had familiarized themselves with the whole procedure and the tasks, they participated in three experimental sessions. (1) In the first scanning session (∼1 h), observers performed the orientation weight estimation task (6 runs; 72 trials/run; 380 s/run). (2) In the training session (∼50 min), observers completed the reward-learning task (5 runs; 160 trials/run; 560 s/run). (3) In the second scanning session (∼1 h), observers performed the categorical visual search task (7 runs; 125 trials/run; 270 s/run) and a separate run of retinotopic mapping (380 s). The practice and training session were completed in a behavioral laboratory, whereas the other two sessions were conducted at the MRI Research Center at Peking University.
Behavior design.
The main experiment consisted of training (Session 2) and test (Session 3) sessions. We used the within-subject design for tasks in both sessions. Specifically, we first trained observers to associate two distinct orientations with high and low magnitudes of reward (Session 2), in which one independent factor of reward was manipulated. Then, we tested the suppression of reward salience with a visual search task (Session 3) in which the target was characterized by its category. In this categorical visual search task, one of these two reward-associated orientations served as a salient distractor, together with a salient target that had not been paired with reward before. Two independent factors (target's physical salience × distractor's reward salience) were manipulated for the categorical visual search task. Each of the factors has two levels (high and low), producing four experimental conditions in total.
fMRI design.
The fMRI data from the orientation weight estimation task (Session 1) were used to estimate the weights of the hypothesized orientation-tuned channels in each voxel for the forward encoding model (Brouwer and Heeger, 2009, 2011). Then, the forward encoding model was applied to collected fMRI data from V1 during the categorical visual search task (Session 3) to extract the voxels' responses at different orientation channels. This analytical approach has a major advantage in linking the changes in neural population activity directly to our manipulation of reward-associated orientations. We used the event-related design for tasks in both scanning sessions. In the orientation weight estimation task, each run contained balanced conditions of stimuli (eight orientations + fixation) and spatial locations (Fig. 1A, odd- or even-numbered locations). In the categorical visual search task, each run contained four experimental conditions and a fixation condition, with their presentation order arranged by M-sequences (Buracas and Boynton, 2002) for each observer.
Statistical analysis.
All statistical analyses on behavioral (i.e., search reaction time [RT] and accuracy) and neural indices (i.e., channel response and ROI-based activity) were performed using repeated-measures ANOVAs in SPSS 20.0 (IBM). The orientation selectivity for each voxel in V1 ROIs, as measured by ANOVA, were separately performed in MATLAB (The MathWorks; the resulting F statistics were consistent with SPSS 20.0). Multiple comparisons were corrected for simple effects by using Bonferroni method to adjust the threshold of significance (α = 0.025). The behavior-neural correlation analysis was performed using Pearson correlation, with two-tailed tests of significance.
Stimuli and apparatus
The stimuli were generated in MATLAB using Psychtoolbox. In the behavioral laboratory, the stimuli were displayed on a gray background of a gamma-corrected cathode ray tube (mean luminance: 22.5 cd/m2) monitor (refresh rate: 60 Hz; resolution: 1024 × 768). The viewing distance was 75 cm. Observers pressed the buttons on the keyboard (left and right arrow keys) to make responses. Inside the MRI scanner, the stimuli were back-projected onto a translucent screen located inside the scanner bore (refresh rate: 60 Hz; resolution: 1024 × 768). The observers viewed the stimuli at a distance of 75 cm through a mirror placed above their eyes. They were instructed to press the buttons on an MRI-compatible response box (left and right buttons).
As shown in Figure 2, half of the observers viewed a set of eight orientations (stimulus set 1: ±20°, ±45°, ±60°, ±80° relative to vertical line), whereas the other half viewed a different set of orientations (stimulus set 2: ±10°, ±30°, ±45°, ±70° relative to vertical line). The motive for using two sets of stimuli was to balance the category of target (i.e., steep and shallow) in the training and test sessions. Notably, the search target in the training session reappeared as the distractor in the test session. The stimuli used in the training and test sessions were comprised of full-contrast Gabor patches (spatial frequency: 1.68 cycles/degree; SD: 0.54°; diameter: 2.98°). The full contrast-reversing sinusoid gratings (spatial frequency: 1.68 cycles/degree; reverse rate: 4 Hz; diameter: 2.98°) at different orientations (matched with the stimuli set for the training and the test session) were used in the orientation weight estimation task.
Tasks
Orientation weight estimation task (Session 1).
Each run began and ended with an 8 and 12 s blank display, respectively. A static dot was shown at the center of the screen throughout the run. In each trial (Fig. 1A), four sinusoid gratings (diameter: 2.98°) of the same orientation were presented in four evenly spaced locations (Fig. 1A, odd- or even-numbered locations) for 2 s, interleaved with an intertrial interval of 2 or 4 s. The fixation dot flickered at a rate of 1 Hz, with 90% probability of red color and 10% probability of green color. The observers were instructed to fixate on the central dot while passively viewing the gratings, and press a button with the index finger of their right hand whenever the dot was green.
Reward learning task (Session 2).
As shown in Figure 1B, each trial started with a central fixation cross shown for 200 or 400 ms (0.6° × 0.6°). Then, a search array composed of eight Gabor patches was presented around the fixation (eccentricity: 6°) with equal spacing for 200 ms. The brief display was set to prevent eye movement. The target was a counterclockwise (CCW) or clockwise (CW) tilted Gabor singleton among the other seven horizontally or vertically oriented Gabors. Half of the observers were assigned to stimulus set 1, in which the target orientation was ±20° relative to a vertical line (i.e., steep orientation). The other half of the observers were assigned to stimulus set 2, in which the target orientation was ±70° relative to a vertical line (i.e., shallow orientation) (Fig. 2). A red or green dot was randomly placed at the center of each Gabor (four red dots and four green dots in each display). The observers were asked to identify the dot color in the target Gabor by pressing a button with their right index and middle fingers. After the offset of the search array, a blank screen was shown for 1300 ms (or until response). Correct response was followed by a high (+5 points) or low reward (+1 point) feedback displayed for 1500 ms. The CCW- and CW-tilted Gabors were shown with equal probability and were, respectively, associated with a high probability (80%) of a high reward and low reward for half the observers, and vice versa for the other half. The total reward earnings were converted to extra payment at the end of the experiment. All observers were naive to the reward-orientation contingency, and most of them reported being unaware of such contingency after completing the whole experiment.
Categorical visual search task (Session 3).
Each run began and ended with an 8 and 12 s blank display, respectively. A static red dot was shown at the center of the screen in the blank display. As shown in Figure 1C, each trial started with a central fixation cross (0.6° × 0.6°) shown for 200 (or 400 ms). A search array composed of eight tilted Gabors was presented around the fixation (eccentricity: 6°) with equal spacing for 200 ms. The eight Gabor items were always of the same titled direction (i.e., either CCW or CW). Each array consisted of a Gabor in the target orientation and a distractor Gabor in a previously rewarded orientation; the other Gabors were of the same diagonal orientation (i.e., CCW or CW 45°) serving as the background stimuli. The observers were assigned to stimulus set 1 or 2 to search for a shallow orientation (±60° or ±80° relative to vertical line) or a steep orientation (±10° or ±30° relative to vertical line), respectively. The target salience was determined solely by its physical salience that depended on the orientation contrast between the target and the background (i.e., 15° or 35°). Therefore, it could either be higher or be lower than the physical salience of the distractor (i.e., 25°). On the other hand, the distractor salience was jointly determined by the physical salience and its associated magnitude of reward. Particularly, we defined the target and distractor using the same feature dimension (i.e., orientation) to compensate for the uncontrollable low-level perceptual differences when the physical salience of the target and distractor was manipulated in different feature dimensions (e.g., color and shape). The observers were instructed to discriminate the dot color in the target Gabor, which was the same as in the reward learning task. After the offset of the search arrays, a fixation display was shown for 1300 ms to wait for a button press followed by a blank display of 300 ms (or 100 ms). No feedback was given in this task.
MRI data acquisition
Imaging data were collected from a Siemens 3T Prisma scanner, equipped with a 64-channel head coil at Peking University. For each observer and each scanning session, high-resolution anatomical images were acquired using MPRAGE T1-weighted sequence (FOV: 256 × 224 mm; resolution: 0.5 × 0.5 × 1 mm; number of slices: 192). Functional images were acquired using EPI sequence (TR: 2000 ms; TE: 30 ms; FOV: 224 × 224 mm; flip angle: 90°; matrix size: 64 × 64; resolution: 3.5 × 3.5 × 3.5 mm, gap: 0.7 mm; number of slices = 33).
fMRI data preprocessing
Imaging data were processed using BrainVoyager QX (Brain Innovation) software. Preprocessing of functional data included slice-timing correction, motion correction, temporal filtering (3 cycles), and removal of linear trends. Spatial smoothing (4 mm Gaussian filter, FWHM) was only applied to the region of interest (ROI) analysis. For each scanning session, the functional data were first aligned to their first functional slice and then coregistered to the anatomical data obtained in the same session. The anatomical images in the second session were aligned to those in the first session. For the group-level analysis, the aligned functional data were resampled into 3 × 3 × 3 mm resolution and transformed into the standardized Talairach space. Trials with incorrect responses in the categorical visual search task were eliminated from further analysis.
Forward encoding model
The forward encoding model allows us to infer how feature-specific neuronal populations are modulated by experimental manipulation, rather than the pooled activity across orientations at the voxel level (Serences and Saproo, 2012). The model assumes that each voxel contains a large number of orientation selective neurons, and the orientation selectivity of each neuron can be characterized as a linearly weighted sum of eight hypothetical orientation channels (Brouwer and Heeger, 2009, 2011). Each channel is represented by a basis function that models its idealized orientation tuning curve. In practice, each basis function is a sinusoid raised to the sixth power to approximate the tuning function of V1 neurons (Schiller et al., 1976; Swindale, 1998), with the orientation preference selected from one of the stimulus sets. The reconstruction of channel responses followed three stages. First, we defined the ROIs in V1 that corresponded to the eight locations in the search array. Second, we estimated the channel weights for each voxel within the ROI. Third, we calculated the channel responses associated with each of the orientations in the stimulus set.
ROI definition.
For each observer, we defined the location-specific V1 ROIs by selecting the voxels in V1 that responded to the stimulation at each of the eight locations using data from retinotopic mapping and orientation weight estimation runs (Fig. 3A). To define the V1 boundary, we used the standard phase-encoded method (Sereno et al., 1995; Engel et al., 1997) in which observers viewed rotating wedge that created traveling waves of neural activity in the visual cortex. To identify eight corresponding locations within V1, we applied a GLM to the data collected during the orientation weight estimation runs (Session 1). There were two regressors in the GLM that corresponded to two conditions where the stimuli were displayed at odd- or even-numbered locations. We mapped separate voxel neighborhoods in V1 that were significantly modulated by one of the two conditions as the location-specific ROIs.
Channel weights.
For each observer, we used functional data collected during the orientation weight estimation runs (Session 1) to compute the channel weights for the forward encoding model. First, we estimated the hemodynamic impulse response functions (HIRFs) for each voxel in the ROIs with the deconvolution method (Dale, 1999) (Fig. 3B). The design matrix consisted of two conditions of the stimulus location (i.e., odd- or even-numbered locations) concatenated across runs, regardless of the orientations. HIRFs were calculated by multiplying the pseudoinverse of this design matrix with the preprocessed fMRI response time courses. Next, we estimated the response amplitudes (β) to each orientation separately for each voxel and each run. The regression matrices were constructed by convolving the location-specific HIRF with a binary time course corresponding to the onset of each of the eight orientations. The response amplitudes were calculated by multiplying the pseudoinverse of the regression matrices with the preprocessed fMRI response time courses, and then normalized (z-score) separately for each voxel and for each run. To enhance the signal-to-noise ratio, we selected a subset of informative voxels that responded most differentially across orientations (Brouwer and Heeger, 2011). For each voxel, we performed an ANOVA on its normalized responses across eight orientations, with the number of runs as the repeated measurements. Each voxel's orientation selectivity can be denoted by F statistics (i.e., a larger F-statistic indicates higher selectivity). Because of the small physical size of each location in visual space, we were able to identify only a small neighborhood of voxels in V1 for each location. We selected 30 voxels (i.e., the minimal amount across ROIs and observers) with the largest F statistics for further analysis (Fig. 3C), controlling for possible differences in variation in the number of voxels.
Let m be the number of selected voxels (i.e., 30), n1 be the number of repeated measurements (i.e., 8 orientations × 6 runs), and k be the number of hypothetical channels. As shown in Figure 3D, we constructed the matrix of selected voxel responses that corresponded to each of the orientations in the stimulus set (B1, m × n1) (response matrix). The orientation tuning model was comprised of the matrix associated with the hypothetical channel responses (C1, k × n1). The least-squares estimates were used to compute the channel weights matrix (W, m × k) as follows:
Channel responses.
For each observer, we used functional data from the categorical visual search task (Session 3) to compute the channel responses. For each trial, we obtained the response amplitudes from the selected 30 voxels in ROIs corresponding to the target- and distractor-related locations separately. We constructed a matrix of voxel response by extracting the peak amplitude of each voxel (6 s after the stimulus onset, see Fig. 3B), using their coordinates in normalized space. Let n2 be the number of trials, and the channel responses (C2, k × n2) associated with the distributed pattern of activity during the categorical visual search task were estimated based on the matrix of voxel responses (B2, m × n2) (Fig. 3D, voxel response) and the weights matrix acquired from the first stage of analysis. After this process, we separately grouped the responses of channels preferring the target and distractor orientations after sorting the trials into four experimental conditions as follows:
Midbrain ROI analysis
To test the possible role of dopaminergic modulation in the categorical visual search task (Session 3), we drew the ventral tegmental area (VTA) on the normalized high-resolution MR image for each observer by hand. Using Mango image processing software (http://ric.uthscsa.edu/mango/), we drew the VTA in the axial section (Fig. 4E, left) following the descriptions of boundaries by Ballard et al. (2011) and created volumes of interest. The mean volume of the VTA across observers was 156 ± 19.7 mm3. The mean coordinates for the VTA in MNI/Talairach space were as follows: (left) [−2.0, −18.2, −14.5]/[−2.1, −18.2, −10.4] and (right) [2.4, −18.8, −14.3]/[1.7, −18.2, −10.7]. The estimated volume and coordinates were similar to previously reported results (Eapen et al., 2011; Murty et al., 2014).
Results
Behavioral results
During the reward learning task, observers' performance improved over time for both the high and low reward conditions (Fig. 4A). We grouped the data from this task into 5 time bins, one for each run. The repeated-measures ANOVA (reward condition × time bin) revealed a main effect of time bin for search accuracy (F(4,116) = 23.23, p < 0.001, ηp2 = 0.45) and a tendency for faster RTs (F(4,116) = 2.19, p = 0.074, ηp2 = 0.07). These results indicated the observers' active involvement in the task. However, the influence of reward was not shown either as a main effect (p > 0.6 for search accuracy and RTs) or an interaction effect (p > 0.8 for search accuracy and RTs). Studies that used similar reward associative learning paradigms have shown mixed results regarding the reward effect during training. The influence of reward could be observed on the mean training performance (Sha and Jiang, 2016; Anderson et al., 2016), in the last block of training (Anderson et al., 2014), or on the second day of training (Laurent et al., 2015), whereas similar paradigms could also lead to nonreward effects during training (Anderson et al., 2011b; Chelazzi et al., 2014; Gong and Li, 2014). Despite the large variance in reward effects during training across studies or within the same study across experiments (Gong et al., 2016; Rajsic et al., 2017), all of these studies reported findings of significant reward effects on multiple test tasks.
Although the effect of reward was not shown during training, its impact on the test session confirmed the observers' acquisition of reward-orientation associations. The mean behavioral accuracy in the categorical visual search task was 72.8%, confirming that the task had an intermediate level of difficulty. Figure 4B (left) shows the search RTs of the correct trials during the categorical visual search task. We entered the RT data into a repeated-measures ANOVA (target's physical salience × distractor's reward salience). The main effects of reward salience (F(1,29) = 4.45, p = 0.044, ηp2 = 0.13) and physical salience (F(1,29) = 185.49, p < 0.001, ηp2 = 0.86) were significant. Additionally, the two-factor interaction was marginally significant (F(1,29) = 4.16, p = 0.051, ηp2 = 0.13). Simple effect analysis suggests that the suppression of the reward-associated distractor was critically dependent on the physical salience of the target (low salience: F(1,29) = 8.66, p = 0.006, ηp2 = 0.23; high salience: F(1,29) = 0.09, p = 0.769, ηp2 = 0; Bonferroni-corrected threshold: α = 0.025). That is, when the target was physically less salient than the distractor (15° vs 25°), there was stronger suppression over the distractor with high reward salience. On the contrary, when the target was physically more salient than the distractor (35° vs 25°), the search RTs of the trials with high- and low-reward associated distractors were indistinguishable. This finding is consistent with our hypothesis that suppression of a distractor with reward salience occurs if attending to the target is insufficient for solving perceptual competition. On the other hand, the effect induced by target-related physical salience was independent of the distractor's reward salience (low salience: F(1,29) = 141.79, p < 0.001, ηp2 = 0.83; high salience: F(1,29) = 143.74, p < 0.001, ηp2 = 0.83; Bonferroni-corrected threshold: α = 0.025).
A repeated-measures ANOVA was also used to examine search accuracy (Fig. 4B, right). The results showed a main effect of reward salience (F(1,29) = 4.67, p = 0.039, ηp2 = 0.14) and physical salience (F(1,29) = 49.40, p < 0.001, ηp2 = 0.63). No interaction was found between the two factors (F(1,29) = 0.004, p = 0.949, ηp2 = 0). These results demonstrated that the behavioral benefits related to our reward manipulation were not due to a speed-accuracy tradeoff. The interaction effect in RT also ruled out the possibility of an arousal effect induced by the appearance of high reward-associated distractors, allowing the behavioral data to be best accounted for by the suppression of the reward-associated distractor.
Forward encoding model: the suppressed channel response toward a reward-associated distractor
We were motivated to use the forward encoding model for two reasons. First, each voxel contained neurons that were tuned to a wide range of orientations, whereas our reward manipulation was only related to two particular orientations. Examining the channel-based signal change at the neuronal population level improves the signal-to-noise ratio compared with voxel-level analysis. Second, the suppression of the reward-associated orientation might, at the same time, decrease the response of the neurons that were tuned to that orientation and increase the response of the neurons that were tuned to other orientations (Martinez-Trujillo and Treue, 2004). These response changes may cancel each other out and lead to negligible overall changes at the voxel level.
For each observer and each trial, we separately calculated the channel responses in the ROIs corresponding to the locations of the target and distractor. The averaged channel responses for each experimental condition were computed and entered into repeated-measures ANOVAs (target's physical salience × distractor's reward salience). In the distractor-related ROI (Fig. 4C), we found that the channel preferring the distractor orientation showed a significant main effect of reward salience (F(1,29) = 4.77, p = 0.037, ηp2 = 0.14) and a significant interaction between the two factors (F(1,29) = 7.87, p = 0.009, ηp2 = 0.21), whereas the main effect of physical salience did not reach significance (F(1,29) = 1.13, p = 0.296, ηp2 = 0.04). Simple effect analysis revealed a reward-driven reduction of channel responses when the distractor was physically more salient than the target, but not vice versa (low target salience: F(1,29) = 11.67, p = 0.002, ηp2 = 0.29; high target salience: F(1,29) = 0.03, p = 0.873, ηp2 = 0.001; Bonferroni-corrected threshold: α = 0.025). The sign of the channel response values in V1 (Fig. 4C) could not provide direct indication for either enhanced or suppressed responses of specific channels due to the normalization of neural responses in the data preprocessing. The relative values between the two levels of reward condition were of the interest for these channel responses. These results suggest that the two factors (i.e., physical salience and reward salience) interacted to jointly determine the outcome of attentional competition. For the channel that preferred the target orientation, we found no significant main effect of reward salience or two-factor interaction, and only a marginally significant effect of physical salience (F(1,29) = 3.84, p = 0.060, ηp2 = 0.12).
These results are well aligned with the explanation for behavioral performance in the categorical visual search task, indicating a potential correlation between behavioral effects and the effects in channel responses. To test this possibility, we calculated the differences between two levels of reward condition (i.e., high vs low distractor's reward salience) separately for the search RT and distractor-related channel responses under two levels of physical salience condition (i.e., low and high target's physical salience). A significant correlation was found in the low target salience condition (r = 0.386, p = 0.035, two-tailed Pearson correlation; Fig. 4D), whereas such correlation was not shown in the high target salience condition (r = −0.26, p = 0.165, two-tailed Pearson correlation).
Similar repeated-measures ANOVAs were conducted on the channel responses in the target-related ROIs. There were no effects of channel responses that were tuned to the target and distractor orientations (for all comparisons, p > 0.16). To confirm the specificity of the response reduction for the distractor orientation, we analyzed the channel responses for neutral distractors in the search arrays (i.e., CCW and CW 45°) separately in the target- and distractor-related ROIs. None of these channels showed significant differences across experimental conditions (for all comparisons, p > 0.11).
Together, the reward-induced reduction of channel response was specific to the high reward-associated orientation channel in the distractor-responsive location in V1. The changes in channel responses can be seen as the neuronal solution to weaken the competition of the reward-associated distractor in a very early stage of visual processing. The distractor-related channel responses that covaried with behavioral performance reflected the modulation of reward depending on the target-distractor competition. Importantly, the outcome of the competition was demonstrated with the significant interaction effect of the channel responses. Specifically, in the high target salience condition, the target was likely to dominate the competition with the distractor because it was favored by both the task goal and physical salience. Therefore, regardless of the reward salience, the representation of the distractor was inevitably weakened under this condition, as can be observed in the behavioral results in Figure 4B (faster RTs) and channel responses in Figure 4C (smaller responses). On the other hand, when the target was only favored by the task goal in the low target salience condition, the representation of the distractor was supposed to be higher due to its high physical salience under this condition. This difference may contribute to the higher response for the low reward distractor in the low target salience condition. Meanwhile, there was better suppression of the high-reward associated distractor that reduced its channel response to a level that was similar to the high target salience condition. These results suggest that the representation of the distractor can be easily overridden when the target was physically more salient, whereas active suppression over distractors occurred when the distractor was more salient in terms of physical salience and reward association.
ROI analysis: activation in the midbrain dopaminergic area
To test whether dopaminergic modulation contributes to the observed reward effect, we compared the VTA activity across experimental conditions. Consistent with the pattern of results in behavior and forward encoding models (Fig. 4E), we found a significant interaction effect of VTA activity in its peak response (i.e., 4 s after stimulus onset): (F(1,29) = 5.73, p = 0.023, ηp2 = 0.17). Further simple effect analysis revealed a reward-enhanced response depending on the salience of the target (low target salience: F(1,29) = 6.75, p = 0.015, ηp2 = 0.19; high target salience: F(1,29) = 0.46, p = 0.503, ηp2 = 0.02; Bonferroni-corrected threshold: α = 0.025). This result implies that the midbrain activity related to reward salience contributed to the behavioral suppression effect in the low target salience condition.
Discussion
The present study provided direct evidence that can account for the contradictory results in the literature regarding the suppression of reward salience. Our results showed that the suppression of a reward-salient distractor was evident if its physical salience was higher than that of the target, indicating the important role of target-distractor competition in the suppression mechanism. Critically, the high reward-salient distractor was better suppressed in the low target salience condition, probably because the top-down target selection was not sufficient to win the perceptual competition easily. This behavioral suppression effect was accompanied by weakened competition from the neuronal population in V1 that responded to the distractor with the high reward-associated orientation. On the other hand, suppression of reward salience was not observed when it was unnecessary in the high target salience condition, in which the target dominated the competition with the distractor and was favored by both top-down and bottom-up attentional selection.
Our results demonstrate that, when the distinction between the target and distractor is unambiguous (e.g., categorically separable), top-down suppression of distractors can take place if necessary. These findings agree with current literature in two ways. First, the results are well aligned with the attention model based on inhibition that suggests stronger suppression for more potent distractors (Houghton and Tipper, 1994; Houghton et al., 1996). The recent electrophysiological results have demonstrated that this is indeed a possible mechanism used by the human brain to facilitate visual search performance when a distractor is highly salient (Gaspar and McDonald, 2014; Gaspar et al., 2016; Weaver et al., 2017). Second, the results dovetail with the recent proposal of the signal suppression hypothesis of controlled attentional capture (Sawaki and Luck, 2010, 2011; Sawaki et al., 2012). This theory emphasizes the situation where the physically salient item could be better suppressed due to the active suppression of salience-induced attentional priority (Geng and Diquattro, 2010; Gaspelin et al., 2015), the rationale of the reversed salience effect is the dominant control of top-down attention. In our study, the observers were aware of the attribute of the target (e.g., the steep orientation) and the to-be-ignored distractor (e.g., the shallow orientation), making the top-down control ready to initiate the suppression mechanism. However, an initial attraction to the distractor with high reward salience followed by strong suppression could be an alternative mechanism for our findings, as indicated by a study using time-sensitive measures of eye movement (Hickey and van Zoest, 2012). Notably, the suppression of reward salience reduced its impairment on performance did not eliminate the physical salience effect on the search performance. It was the task goal, physical salience, and the reward history that worked together to modify the priority map for attentional selection (Awh et al., 2012).
Apart from the theoretical implications for the suppression of reward salience, our results can also be reconciled with the empirical studies that did not observe this suppression effect (Hickey et al., 2011; Anderson et al., 2014; Failing et al., 2015; Le Pelley et al., 2015). Particularly, the search target in these test tasks was always defined by feature singleton (i.e., search for a unique shape), allowing visual search to be dominated by the singleton detection mode. In contrast to these studies where there was not a role for top-down control of attention, we used a feature-based search task with the target identity defined by its categorical feature. Critically, the search arrays consisted of two physically salient items that disabled the use of singleton detection mode. These settings encouraged cognitive control to play a role in the adjustment of selective attention (Bacon and Egeth, 1994). In support of this notion, a physically salient distractor that consistently captured attention in a singleton-based visual search task became actively suppressed when switching the task to a feature-based search (Gaspelin et al., 2015). Therefore, the distinction in the search mode could also be a main source of these seemingly contradictory findings.
Neurophysiology and functional brain imaging studies related to perceptual competition among multiple stimuli have suggested that there is mutual suppression of the neural responses at the level of the receptive field (Moran and Desimone, 1985; Kastner et al., 1998; Reynolds et al., 1999) and feature-based suppression across the whole visual field (Moher et al., 2014; Störmer and Alvarez, 2014). Meanwhile, theories of attention suggest that biased activation due to the allocation of attention starts as early as in V1 if more than one item competes for priority in the visual field (Motter, 1993; Ito and Gilbert, 1999). In the present study, we took advantage of the forward encoding model with its assumption about the underlying relation between the orientation-tuned neuronal populations and the measured fMRI signals. With this model, we were able to detect the population-level neuronal changes specific to the reward-associated orientations in V1. These results provided clear evidence for feature-based competition between retinal locations at V1. Furthermore, our results are also consistent with the previous studies that reported attentional suppressive effects in V1 and subcortical areas (Gouws et al., 2014). Critically, our findings suggest that the weakened channel responses to the distractor with reward salience were correlated with reduced behavioral costs in search performance, confirming the central roles of reward salience and target-distractor competition in attentional suppression.
Substantial evidence suggests the important role of the midbrain dopamine neurons in stimulus–reward associative learning (Schultz et al., 1997). Our finding of the patterns for midbrain activity in VTA and the channel response in V1 suggests that greater midbrain activation led to stronger suppression in sensory areas. The possible account for such a VTA-V1 relationship could be drawn from the literature that has suggested the central role of reward-related signals in mediating higher-order cognitive functions (Frank et al., 2001). In the high reward distractor and low target salience condition, our observations of stronger activity in VTA may enhance the dopaminergic projection to the prefrontal cortex, enabling strengthened top-down control of attention effect in visual areas (Noudoost and Moore, 2011). Thus, the weakened sensory representation of the reward salient distractor as reflected in V1 channel responses could be ascribed to the increased top-down suppression triggered by midbrain modulation. This interpretation also parallels with the findings of reward effects in V1 activity in rats (Shuler and Bear, 2006), monkeys (Stănişor et al., 2013), and humans (Serences, 2008). Nevertheless, the present data were not sufficient to draw a direct inference between the patterns of activity in VTA and channel responses in V1. Further studies with specialized designs are required to verify this account.
The recent findings of automatic attraction of attention by reward-associated items (Della Libera and Chelazzi, 2009; Peck et al., 2009; Raymond and O'Brien, 2009; Anderson et al., 2011a; Hickey and van Zoest, 2012) are in accordance with the classical idea in learning theory that the stimulus with high predictiveness of reward or important events receives attentional priority (Mackintosh, 1975; Le Pelley et al., 2016). This principle of associative learning has been suggested to play important roles in shaping multiple cognitive functions, such as working memory (Gong and Li, 2014; Wallis et al., 2015; Thomas et al., 2016) and long-term memory (Murayama and Kitagami, 2014). Moreover, it has been demonstrated that, by holding reward-associated cues in working memory, cognitive control could be selectively modulated by reward to enhance distractor suppression accompanied with the increased amplitude of the Pd component (Sawaki et al., 2015) or frontal theta oscillation (Gong et al., 2016). In a closely related fMRI study, reward was associated with a specific category in naturalistic scenes (Hickey and Peelen, 2015). Using the multivoxel pattern analysis approach, they found decreased information for scenes with reward-related distractors compared with reward-neutral distractors in the object-selective visual cortex. Their study and ours shared key findings in demonstrating neural suppression in the visual cortex of reward salient distractors and the reward-driven changes in midbrain activity. Nevertheless, our results differed from those of Hickey and Peelen (2015) in two main aspects. First, whereas they did not directly control the relative physical salience between the target and distractor, our study included this factor and found a novel result that suggests a perceptual competition-dependent effect of suppression of reward-associated distractors. Second, Hickey and Peelen (2015) demonstrated the reduced behavioral cost accompanied with the suppressed information representation of distractors in the object-selective cortex. Our study, using the lower level visual feature of orientation, showed that the behavioral benefit of high reward versus low reward-associated distractors was realized at the initial stage of visual processing in V1. These two studies complement each other and suggest the implementation of suppression of reward salience in early visual pathways. To our knowledge, the results from the present study demonstrate, for the first time, the weakened representation of reward-associated distractors in V1 through enhanced suppression.
Footnotes
This work was supported by the National Natural Science Foundation of China 31230029, 31470974, and 31271081.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Sheng Li, School of Psychological and Cognitive Sciences, Peking University, 5 Yiheyuan Road, Haidian, Beijing, 100871, China. sli{at}pku.edu.cn