Abstract
When perceptual decisions are coupled to a specific effector, preparatory motor cortical activity may provide a window into the dynamics of the perceptual choice. Specifically, previous studies have observed a buildup of choice-selective activity in motor regions over time reflecting the integrated sensory evidence provided by visual cortex. Here we ask how this choice-selective motor activity is modified by prior expectation during a visual motion discrimination task. Computational models of decision making formalize decisions as the accumulation of evidence from a starting point to a decision bound. Within this framework, expectation could change the starting point, rate of accumulation, or the decision bound. Using magneto-encephalography in human observers, we specifically tested for changes in the starting point in choice-selective oscillatory activity over motor cortex. Inducing prior expectation about motion direction biased subjects' perceptual judgments as well as the choice-selective motor activity in the 8–30 Hz frequency range before stimulus onset; the individual strength of these behavioral and neural biases were correlated across subjects. In the absence of explicit expectation cues, spontaneous biases in choice-selective activity were evident over motor cortex. These also predicted eventual perceptual choice and were, at least in part, induced by the choice on the previous trial. We conclude that both endogenous and explicitly induced perceptual expectations bias the starting point of decision-related activity, before the accumulation of sensory evidence.
Introduction
Prior beliefs about the visual world can strongly bias perception and visually guided behavior. For example, statistical regularities of observed motion direction alter the perception of new motion directions, inducing an attractive bias in the perceived direction (Chalk et al., 2010). It is, however, largely unclear how and at what processing stages these biases affect the perceptual decision.
Perceptual decisions can be formalized as a process of the accumulation of sensory evidence toward a decision bound (Usher and McClelland, 2001; Smith and Ratcliff, 2004; Bogacz et al., 2006; Ratcliff and McKoon, 2008). Within this scheme, prior beliefs could impact the decision at several stages.
First, expectation could alter the state of the decision variable already before the onset of accumulation of sensory evidence. Second, expectation could change the gain (or rate) of accumulation toward the expected outcome. Third, expectation could selectively lower the amount of evidence required for the expected outcome (i.e., the decision bound). In this study, we set out to test whether expectation changes the starting point of choice-selective neural activity.
When decisions are coupled to an effector, previous studies have shown that the ongoing decision process may be “read out” by monitoring activity in neural structures that are involved in preparing the associated motor response (Gold and Shadlen, 2007; Heekeren et al., 2008; Donner et al., 2009; Gould et al., 2012). In monkeys, such accumulation processes have been extensively documented in oculomotor regions of parietal and prefrontal cortex, when perceptual decisions were coupled to saccadic eye movements (Shadlen and Newsome, 1996; Schall, 2001; Yang and Shadlen, 2007). Similarly, recent studies in humans have observed evidence of accumulation-related activity in response-related regions when the perceptual decision was coupled to specific effectors (Tosoni et al., 2008; Donner et al., 2009; Gould et al., 2012), in line with the proposition that there is substantial “leakage” of perceptual and cognitive states into the motor system (Song and Nakayama, 2009). Therefore, motor activity could potentially provide a window into the dynamics of the perceptual decision process.
Using these neuronal markers, neurophysiological and neuroimaging studies have garnered some empirical support for both changes in starting point (Shadlen and Newsome, 2001; Churchland et al., 2008; Preuschhof et al., 2010; Rao et al., 2012) and sensory gain (Rahnev et al., 2011) as a result of biases and learning, but it remains an open question how perceptual decision making is altered by explicitly inducing prior expectations.
Here we examine this issue by investigating how activity in choice-selective motor activity before stimulus onset is affected by perceptual expectations. Subjects made decisions about the direction of motion in moving random-dot displays. We induced perceptual expectations by providing probabilistic cues about motion direction and tracked the effect on the decision variable by coupling different perceptual decisions to different motor responses and measuring motor cortical activity before and during the perceptual decision. We also investigated whether spontaneous fluctuations in prestimulus motor cortical activity could affect perceptual choice, in the absence of explicit expectation cues, and we sought to link these spontaneous biases to sequential effects of choice behavior over trials (Gao et al., 2009).
Materials and Methods
Participants.
Nineteen healthy participants (12 male, 7 female; age, 25 ± 4.7 years, mean ± SD) participated in the experiment. None of the participants had a history of neurological or psychiatric disorders. All participants had normal or corrected-to-normal vision. The study was approved by the local ethics committee at Radboud University Nijmegen, and a written informed consent was obtained from the subjects according to the Declaration of Helsinki.
Stimuli.
Target stimuli were composed of white dots (density = 2.4 dots/°2; speed = 6°/s) presented on a black annulus (outer circle radius = 10°; inner circle radius = 1°). Stimuli were projected from a calibrated LCD projector (situated outside the magnetically shielded room) onto a back-projection screen through a tube by a mirror system. The refresh rate of the projector was 60 Hz. The direction of net motion was either leftward or rightward. For each frame, a random subset of the dots was chosen to carry the coherent motion. Incoherent dots moved randomly with the same speed as the coherent dots. When dots moved off the annulus, they were replotted at a random location within the annulus. We used three levels of motion coherence (low, intermediate, high). The intermediate coherence level was individually chosen to produce ∼75% correct responses in the absence of a cue (mean proportion of coherently moving dots = 11.1%, SD = 3.3%, range = 4.8–18.4%). The low motion coherence was fixed at 50% of the intermediate coherence (mean proportion of coherently moving dots = 5.5%), while the high motion coherence was 200% of the intermediate coherence (mean proportion of coherently moving dots = 22.2%).
A small fixation square was presented for the duration of the trial, and subjects were required to maintain fixation on it. The stimuli were presented on a gray background and were generated using Psychophysics Toolbox (Brainard, 1997) in MATLAB (MathWorks).
Experimental design.
Each trial began with the presentation of a cue. The cue was either predictive (“left” or “right,” each 33% of all trials) or nonpredictive (“neutral,” 33% of all trials) for the upcoming net direction of stimulus motion. The predictive cues correctly indicated the upcoming motion direction on 75% of the trials. We refer to these trials as “valid.” In the remaining 25% of the trials, the predictive cues indicated the wrong direction of motion. We refer to these trials as “invalid,” meaning invalidly cued. The nonpredictive cues were followed by leftward and rightward motion equally often. The nonpredictive cues are referred to as neutral in the rest of the paper. Thus, the optimal strategy in this task (in terms of maximizing percentage correct) was to apply a directional bias corresponding to the cue (i.e., zero bias for neutral cues). Subjects were fully informed about the above contingencies and were encouraged to take the cue into account when making their perceptual decisions.
The cue consisted of one of the words left, right, or neutral that was presented for 200 ms followed by a fixation dot that lasted between 800 and 1300 ms. Then, the motion stimulus appeared until subjects provided their response or 3 s had elapsed. Subjects responded with the index finger of their left hand when they perceived leftward motion and with the index finger of their right hand when they perceived rightward motion. After the response, there was an intertrial interval of between 2 and 2.5 s.
Before the magneto-encephalography (MEG) experiment, each subject took part in a 30 min training session, in which the subject performed 138 trials, distributed over six training blocks, to establish motion coherence thresholds (see Stimuli). The first three training blocks (in a total of 30 trials) had a fixed and high coherence to get subjects acquainted with the task demands. Then, based on the performance on the third block (of 10 trials), the coherence level was chosen for the fourth block (24 trials), and so on for the fifth (24 trials) and the final sixth (60 trials) block. This method of thresholding ensured a reasonably stable performance for the medium coherence (70.0 ± 12.8%, mean ± SD), within the time constraints posed by the experiment. During the experiment, the presentation of motion coherence was pseudo-randomized for each subject such that each coherence level appeared equally often with each combination of cue identity and motion direction. The experiment consisted of six blocks of 144 trials, with predictive and nonpredictive trials randomly intermixed.
Behavioral analysis.
We computed reaction times and error rates, as well as the signal detection theoretic measures d′ and c (Green and Swets, 1966; Macmillan and Creelman, 2005). d′ quantifies a subject's stimulus sensitivity (in signal-to-noise ratio units), whereas c quantifies a subject's decision criterion (a nonzero c value implies a bias). These measures were calculated on the basis of the proportion of leftward choices on leftward and rightward motion trials (analogous to hits and false alarms in a detection task). To quantify the effect of the cue on sensitivity and criterion, we computed d′ and c separately for trials preceded by left, right, or neutral cues. We refer to these measures for the left cue trials as d′left and cleft. Similarly, we refer to these measures for the right cue trials as d′right and cright. Since we were not interested in general performance differences between leftward and rightward motion, we then averaged across these motion types to obtain d′predictive, which reflects the discriminability of the motion stimuli in the context of a predictive cue. On the other hand, cleft and cright are the biases for answering “leftward” when a leftward or rightward cue was presented, respectively. Thus, we expected cleft to be negative (corresponding to answering leftward >50% of the time), and cright to be positive (corresponding to answering leftward <50% of the time). Criterion shift was defined as cright − cleft, i.e., the difference of how much the cues were able to move subjects' criterion for picking one response option over the other. The bigger this shift was, the more the subject adjusted their behavior based on the predictive cue. Reaction times and error rates were subjected to a 3 × 3 repeated-measures ANOVA with the factors cue validity (valid, invalid, neutral) and motion coherence (low, medium, high). d′ and c were subjected to a 2 × 3 repeated-measures ANOVA with the factors cue type (predictive, neutral) and motion coherence (low, medium, high). To further qualify the direction of the criterion shift for left- and right-cued trials, we also quantified the presence and direction of the criterion shift for each cue type separately, using post hoc t tests. For each comparison, we used a significance level of p < 0.05. Although it could be argued that the fact that we perform several statistical tests may warrant a correction for multiple comparisons, we elected to report individual significance levels, as we are not particularly interested in the null hypothesis that none of the effects would be significant, but rather treat each test as an independent question.
MEG measurements.
Ongoing brain activity was recorded using a whole-head MEG with 275 axial gradiometers (CTF Systems). Data were collected at 12 kHz, and down-sampled to 1200 Hz. To prevent aliasing, an eighth-order elliptic infinite impulse response filter was used with a cutoff at one-fourth of the sampling frequency, a 0.1 dB pass band ripple, and 120 dB attenuation at the Nyquist frequency as an anti-aliasing filter. Head localization was monitored continuously during the experiment using coils that were placed at the cardinal points of the head (nasion, and left and right ear canal). The magnetic fields produced by these coils were used to measure the position of the subject's head with respect to the MEG sensor array. In addition to the MEG, the electrooculogram (EOG) was recorded from the supraorbital and infraorbital ridges of the left eye for the subsequent artifact rejection. Also, the electromyogram (EMG) and electrocardiogram (ECG) were recorded using 10-mm-diameter Ag–AgCl surface electrodes. EMG electrodes were placed on the left and right forearm, in a belly–tendon arrangement, following standard skin preparation.
MEG data analysis.
All MEG data analysis was performed using the FieldTrip toolbox developed at the Donders Institute for Brain, Cognition, and Behavior (Oostenveld et al., 2011) using Matlab 7 (MathWorks). Data were checked for artifacts using a semiautomatic routine that helped detecting and rejecting eye blinks, muscle artifacts, and jumps in the MEG signal caused by the SQUID electronics, using a two-step procedure. First, an infomax ICA algorithm (Delorme and Makeig, 2004) was used to decompose the signal into maximally independent sources, and components that were related to heartbeat and eyeblink artifacts were removed from the data. These components were identified pseudo-automatically, on the basis of their temporal correlation with the ECG and EOG signals, as well as their stereotypical topography. Subsequently, all data epochs of interest (between 2 s before stimulus onset until 500 ms after response onset) were visually inspected, and those contaminated by artifacts were removed. This inspection procedure was blind to condition and was based on the variance of the signal for each trial. On average, 2.4 ± 1.8% (mean ± SD) of all trials were removed by this procedure.
An estimate of the planar gradient was calculated (Bastiaansen and Knösche, 2000). The horizontal and vertical components of the planar gradients were calculated for each sensor using the signals from the neighboring sensors, thus approximating the signal measured by MEG systems with planar gradiometers. The planar field gradient simplifies the interpretation of the sensor-level data because the maximal signal typically is located above the source (Hamalainen et al., 1993). For each sensor and each single trial, we calculated time–frequency representations (TFRs) of the MEG power using a Fourier transform approach applied to short sliding time windows (Mitra and Pesaran, 1999). One (low frequencies) or three (high frequencies) “tapers” were multiplied with the data for each time window, and the result was submitted to a Fourier transform. Power estimates were averaged across tapers. The power values were calculated for the horizontal and vertical components of the planar gradient and then summed. We used the median of the planar gradient power estimates for all trials within a condition as our measure of MEG power for a given location in sensor space. We used the median because as a summary statistic it is less sensitive to extreme values (outliers) than the mean, i.e., it is a more robust statistic. For the lower (5–35 Hz) frequency range, we used a single Hanning taper and applied an adaptive time window (T) of four cycles for each frequency (ΔT = 4/f) which resulted in an adaptive smoothing of Δf = 1/ΔT. For the higher (35–140 Hz) frequency range, we used a fixed taper length of 200 ms with a Δf = 20 Hz frequency smoothing (Percival and Walden, 1993). The percentage of change in power was calculated with respect to the frequency-specific power during a baseline window, which was centered ∼500–300 ms before the presentation of the cue. The window length of the baseline interval was equal to the window length of each estimated time point of interest (i.e., four cycles for the low-frequency range, and fixed at 200 ms for the higher frequency range).
TFRs of motor lateralization of power (sensors overlying left vs right motor cortex, see below) or occipital power (sensors overlying visual cortex, see below) were statistically compared between different conditions using nonparametric cluster-based permutation t tests (Maris and Oostenveld, 2007). This type of test controls the type I error rate in the context of multiple comparisons by identifying clusters of significant differences over space, time, and/or frequency instead of performing a separate test on each sensor, sample, and frequency pair. For the quantification of motor-related lateralization (Fig. 1A), we identified the 30 channels (15 left and 15 right channels) that showed strongest lateralization in low-frequency (8–30 Hz) power during the 500 ms preceding the choice, when contrasting left with right motor cortex, in line with earlier work (Donner et al., 2007, , 2009). Quantification on the basis of high-frequency (60–90 Hz) power resulted in a highly overlapping topography, with power values of opposite sign (Fig. 1A, compare top, bottom; spatial correlation r = −0.79, p < 0.001). For the quantification of occipital activity modulations (Fig. 1B), we identified the 30 channels that showed the overall strongest increase in high-frequency (60–140 Hz) oscillatory power during the stimulus period (200–500 ms after stimulus onset), in line with an earlier study (Siegel et al., 2007).
For both analyses, we averaged over the spatial (channel) dimension, based on independent localization of the relevant channels. As such, our statistical analysis considered two-dimensional (spectro-temporal) clusters. All cluster-level statistics, defined as the sum of t values within each cluster, were evaluated under the permutation distribution of the maximum (minimum) cluster-level statistic. This permutation distribution was approximated by drawing 1000 random permutations of the observed data. The obtained p values represent the probability under the null hypothesis (no difference between the conditions) of observing a maximum (minimum) cluster-level statistic that is larger (smaller) than the observed cluster-level statistics. We used this method to assess whether there were significant spectro-temporal (TFR) clusters of differential activity.
To assess whether there was a relationship between the behaviorally observed bias induced by the expectation cue and the neurally observed bias, we performed several correlation analyses. For the motor lateralization, we computed the motor lateralization difference between “expect left” and “expect right” trials for each individual subject. A larger value corresponds to a bigger difference in lateralization between expect left and expect right trials. Similarly, for the occipital alpha power difference, we calculated the magnitude of the power difference between “expectation” and “no-expectation” trials (the difference between cued and neutral conditions). These measures were correlated with the behaviorally observed criterion shift, calculated by subtracting the criterion used for expect left from the criterion used for expect right trials (larger values therefore correspond to subjects that were more strongly biased by the cues).
Finally, we assessed whether motor lateralization showed a modulation by motion coherence after stimulus onset (Donner et al., 2009). To compare the accumulation rates for different levels of coherence with maximal sensitivity, we averaged across trials with leftward and rightward motion, taking into account the sign of the motor lateralization. We then calculated and compared the slopes of motor lateralization after stimulus onset (in the 200–500 ms poststimulus), as well as before response onset (in the 500 ms window preceding the response), using a linear regression approach.
Results
Behavioral results
As expected, subjects were less error prone and faster during trials with larger motion coherence [error rate (ER): F(2,36) = 54.6, p < 0.001; reaction time (RT): F(2,36) = 18.2, p < 0.001; Figure 2A,B]. Subjects were also less error prone and faster when the direction of the motion in the stimulus matched the prior expectation (ER: F(2,36) = 47.3, p < 0.001; RT: F(2,36) = 10.3, p < 0.001). The effects of cue validity on RT were slightly larger when motion coherence was high (motion × cue: F(4,72) = 2.64, p = 0.041), while the effects of cue validity on ER were larger when motion coherence was low (motion × cue: F(4,72) = 22.5, p < 0.001).
We next investigated the effect of the predictive cues on discrimination sensitivity and bias, as measured with the signal detection measures d′ and c We computed d′predictive using all the valid and invalid trials and d′neutral using all the neutral trials. The key point to note about the procedure is that to compute d′predictive we separated the predictive (valid and invalid) trials into two categories: one in which the cue indicated that the upcoming stimulus was likely to be leftward (d′left) and one in which the cue indicated a likely rightward stimulus (d′right). Then, d′predictive was simply the average of d′left and d′right (see Materials and Method for more details). Similarly, we computed the signal detection measure of bias c for predictive (cpredictive = cright − cleft) and neutral (cneutral) trials.
As expected, d′ increased with larger motion coherence (F(1,18) = 48.5, p < 0.001; Fig. 2C). There was a an influence of the cues on d′ that interacted with motion coherence, such that stimulus processing sensitivity (independent of the aid provided by the cue) became worse after the presentation of a predictive cue, particularly for the highest coherence level (cue × motion coherence interaction: F(2,36) = 3.31, p = 0.048). Post hoc tests indicated that d′ was indeed significantly lower for predictive compared with neutral cues for high motion coherence (p = 0.035) but not for low (p = 0.17) or medium (p = 0.74) coherence. Finally, predictive cues led to a large criterion shift (F(1,18) = 53.7, p < 0.001; Fig. 1D), which did not interact with the amount of motion coherence (F(2,36) = 2.02, p = 0.15). Post hoc tests showed that criterion was negative when trials were leftward cued (t(18)= −7.61, p < 0.001) and positive when trials were rightward cued (t(18) = 7.57, p < 0.001), while there was no significant deviation from a zero bias for neutrally cued trials (t(18) = 0.59, p = 0.56).
Expectations bias choice-selective low-frequency motor activity
Perceptual (leftward/rightward motion) decisions were, by design, coupled to button presses with the left/right hand. Therefore, we used the choice-selective lateralization of activity over motor cortex as a proxy for the dynamics of decision-related brain activity during the trial (Donner et al., 2009). Indeed, immediately preceding the decision (−250 to 0 ms before the button press), there was suppression of low-frequency power (8–30 Hz), and enhancement of high-frequency power (60–90 Hz), over the motor areas contralateral to the upcoming movement (Fig. 1A). We operationalized the “motor decision signal” by taking the difference in oscillatory power between the 15 left and 15 right sensors that were showing maximal oscillatory power differences. We then assessed whether there were any activity differences in the interval preceding the stimulus, as a function of prior expectation. When comparing expectation of leftward (Fig. 3A) with rightward (Fig. 3B) motion, there was a significant difference in the decision signal in low frequencies (spanning both the alpha- and beta-band), which was significant from 600 ms preceding stimulus onset [significant temporospectral cluster with time range (−0.6–0 s) and frequency range (7–34) Hz; pcluster = 0.0015; Fig. 3C]. Thus, perceptual expectations about the upcoming stimulus were manifest in a motor bias before the onset of the visual stimulus. To assess the spatial specificity of this phenomenon, we inspected the topographical distribution of the expectation-related effect (Fig. 1C). Indeed, the topography of the prestimulus motor bias is highly similar to the prechoice topography (Fig. 1A, bottom; spatial correlation: r = 0.84, p < 0.001). The modulation of motor cortical oscillations by expectation was confined to low-frequency oscillations: although the gamma-band activity (60–90 Hz) clearly dissociated leftward from rightward choice at the moment of the overt choice, we did not find any evidence for prestimulus or poststimulus modulation of gamma-band motor cortical activity as a function of perceptual expectation (p > 0.10; Fig. 3E). The expectation-induced lateralization of low-frequency activity was dissociable from final choice (Fig. 3D): prestimulus lateralization was similar for trials where subjects did or did not change their decision on the basis of the sensory evidence (p > 0.10). In the latter case, there was a gradual poststimulus reversal of the decision signal (Fig. 3D, dotted lines). This decoupling is caused by the fact that the final decision is not fully governed by the prestimulus bias but also by the sensory stimulus itself, i.e., on a proportion of trials the subject decides in a direction that deviates from the cue.
Spontaneous low-frequency motor activity and sequential effects
On a subset of trials (33%), subjects did not receive an informative cue about upcoming motion direction (neutral trials). Hence, on these trials they did not have any explicit expectation about upcoming motion direction. Therefore, these trials allowed us to probe whether spontaneous biases would also be visible in prestimulus activity modulations over the motor cortex. To investigate this issue, we conditioned all neutral trials on the final choice of the subject (leftward vs rightward), and compared prestimulus activity between leftward (Fig. 4A) and rightward choices (Fig. 4B). If the choice is merely driven by the sensory evidence, we would not expect any prestimulus activity differences. In contrast, we observed a lateralization of prestimulus low-frequency power over motor cortex (Fig. 4C), which was significant from 250 ms preceding the stimulus onset and most strongly present in the beta-band [significant temporospectral cluster with time range (−0.25 to −0.05 s) and frequency range (20–28 Hz), pcluster = 0.013; Fig. 4D]. This spontaneous bias was not statistically different for trials with high vs low motion coherence (t = −1.35, p = 0.20). We did not find evidence for a relation between spontaneous fluctuations in lateralization of motor gamma-band power and eventual choice (p > 0.10; Fig. 4E).
To investigate the potential origin of these spontaneously emerging neural biases as well as their effect on behavior, we examined whether the neural biases were induced by previous choices (Leopold et al., 2002). Behavioral and computational studies of performance on two-choice decision making suggest a behavioral advantage for a particular choice when it has been selected on the immediately preceding trial (Gao et al., 2009). To examine the presence of such (first-order) sequential effects and their possible interaction with motion coherence (strongest sequential effects are expected for low-coherence trials), we quantified the proportion of trials on which subjects chose for rightward motion, conditioned on the subject's choice on the preceding trial. We performed this analysis by focusing on the neutrally cued trials, and separately for the different levels of motion coherence on the current trial.
Indeed, there was a substantial effect of the previous trial on neural activity and behavior. Trials that were marked by a leftward choice on the preceding trial were biased toward a leftward choice on the current trial, and trials that were marked by a rightward choice on the preceding trial were biased toward a rightward choice on the current trial (T = 1.99, p = 0.031; Fig. 5A). This bias tended to linearly decrease with increasing motion coherence (T = 1.68, p = 0.055), with the largest bias for low coherence trials (6.95%), an intermediate bias for intermediate coherence (4.34%), and virtually no bias for high coherence trials (0.25%). The bias was not present when conditioning on the previous cue, instead of the previous choice (T = −0.65, p = 0.26), suggesting it reflects a leakage of the previous decision, rather than the previous cue. Next, we investigated prestimulus low-frequency lateralization in motor areas as a function of the preceding choice. Here, we also observed significant prestimulus lateralization on the current trial into the direction of the choice on the preceding trial (T = 2.56, p = 0.010). This difference was present for all three motion coherence levels (all p < 0.05) and was not modulated by coherence (T = 1.04, p = 0.16) (Fig. 5B). These results suggest that the decision on a previous trial indeed generates a decision bias on the current trial, which is visible in prestimulus motor lateralization, and the influence of which on behavior is dependent on the strength of the motion coherence of the impending stimulus.
Expectations bias prestimulus alpha power over visual cortex
We did not expect to detect putative direction-selective biases in neural activity in visual cortex (presumably organized on a mesoscopic scale) (Liu and Newsome, 2006) with our macroscopic MEG measurements. But, as a proxy for “global” state changes in visual cortex due to expectation, we compared overall prestimulus visual activity, as defined in Figure 1B, on trials in which there was an expectation with those in which there was no expectation. Here, we observed larger alpha-band power over visual cortex during trials in which subjects had a prior expectation (leftward or rightward, Fig. 6A) than trials in which subjects did not have a prior expectation (neutral; Fig. 6B). This power difference was significant throughout the whole prestimulus period [significant temporospectral cluster with time range (−1.05 to −0.1 s) and frequency range (7–15 Hz); pcluster = 0.010; Fig. 6C], longer than what could be explained by the length of the sliding window for this frequency range (∼400 ms). To assess the spatial specificity of this phenomenon, we inspected the topographical distribution of the visual expectation-related effect (Fig. 1D). The topography of the prestimulus increase in low-frequency power for trials in which subjects had a prior expectation provided a good match with the sensors that were marked by low-frequency suppression and high-frequency enhancement of oscillatory activity during the stimulus period (Fig. 1B; spatial correlation analysis: r = 0.58, p < 0.001). This appears to rule out the possibility that the observed prestimulus effects of expectation over occipital sensors are driven by volume conduction of motor-related effects.
There was a small physical difference between the cue types: neutral cues were composed of more characters than predictive cues (7 vs 4.5 characters, see Materials and Methods). Accordingly, there was larger gamma-band power at the time of the cue presentation for neutral cues [significant temporospectral cluster with time range (−1.2 to −0.95 s) and frequency range (75–90 Hz); pcluster = 0.047). This difference was however transient in nature, and markedly earlier than the low-frequency effects described above. Finally, there were no differences in low- or high-frequency power (Fig. 6D,E) after stimulus onset with either cue presence or validity (p > 0.10).
Brain–behavior correlations
Finally, we examined whether individual variability in perceptual/decision bias induced by the expectation cue correlated with prestimulus activity biasing of the choice-selective motor activity, as well as the occipital alpha increase. Indeed, there was a significant correlation between prestimulus motor lateralization and criterion shift (r = 0.54, p = 0.009; Fig. 7A), as well as a significant correlation between prestimulus visual alpha increase and criterion shift (r = 0.45, p = 0.026; Fig. 7B). Indeed, the prestimulus motor lateralization and visual alpha increase were themselves significantly correlated (r = 0.40, p = 0.047), possibly due to the fact that they were comodulated by the expectation cue. Thus, these two effects, which are manifest at the sensory and motor peripheries of the decision process, appear both related to the behaviorally induced biasing of choice.
Poststimulus motor lateralization
We assessed whether motor lateralization showed a modulation by the strength of the sensory evidence after stimulus onset, as has been observed during a motion detection task (Donner et al., 2009). For this, we quantified the slope of motor lateralization in both the poststimulus and preresponse time interval. We expected a steeper poststimulus and preresponse slope with increasing motion coherence (Fig. 8A). In the interval after stimulus onset, the slopes of choice-selective motor activity were modest and not significantly different from each other (F = 0.20, p > 0.10). In the preresponse period however, there was a significant difference in slope between the three motion coherence levels (F = 7.94, p = 0.0014). Post hoc tests revealed that there was a significantly larger slope of choice-selective motor activity for the high than low coherence motion stimuli (Fig. 8B; slopehigh = 0.034, slopelow = 0.021; T = 3.16, p = 0.0027), with the medium coherence motion falling in between low and high coherence motion (slopemedium = 0.031).
Discussion
In this study, we examined how prior expectation influences perceptual decision making. Using MEG, we tracked choice-selective activity in the motor system while the decision process was unfolding. We reasoned that motor cortical activity may provide a window into the decision process since the different perceptual choices were coupled to different effectors (Coles et al., 1985; Tosoni et al., 2008; Donner et al., 2009; Song and Nakayama, 2009; Gould et al., 2012). We observed that prior expectation resulted in a strong prestimulus lateralization of motor activity in low frequencies (alpha and beta bands). Spontaneous biases in this lateralization, occurring in the absence of any explicitly induced expectation, were also related to upcoming perceptual choice, and were, at least in part, driven by the choice from the previous trial. During the stimulus period, these oscillations also tracked the fidelity of the perceptual signal. Expectations also led to increased low-frequency oscillations in visual cortex before the onset of the stimulus. Finally, the extent to which expectation changed prestimulus motor lateralization and low-frequency oscillations in visual cortex within a given subject partly predicted the subjects' behavioral criterion shift induced by the expectation.
Biasing mechanisms of perceptual choice
Prior probability can change perceptual decision making in several ways, and at different stages of the decision process. Computational models of decision making have stipulated that prior probability, which has been manipulated by changing, for example, the relative proportions of two stimulus types, may lead to changes in the starting point, the gain of accumulation, or both. The presence of both types of effects have been inferred from reaction time and error rate distribution in human observers performing motion discrimination tasks (Ratcliff and McKoon, 2008), but the evidence conferred by these models is necessarily indirect. Neuroimaging studies using fMRI have asked a similar question: Which brain regions are affected by manipulating prior expectation during perceptual decision making: sensory areas collecting the evidence or decision-related areas accumulating the evidence? These studies have found evidence for biasing of activity of both the sensory areas encoding the evidence (Shulman et al., 1999; Preuschhof et al., 2010; Todorovic et al., 2011; Kok et al., 2012a,b) and associative areas related to action planning (Domenech and Dreher, 2010; Forstmann et al., 2010; Hanks et al., 2011), as well as the connectivity between sensory and associative cortical areas (Rahnev et al., 2011).
However, previous studies did not dynamically track the influence of prior probability on choice-predictive signals in the human brain. Further, no previous study directly compared such choice-selective cortical biases before stimulus onset, depending on whether they emerged spontaneously or were externally cued. Finally, no study has so far directly linked expectation effects in visual cortex to choice-selective biases in the motor cortex during perceptual choice tasks. The current study therefore provides a critical extension of previous studies by noninvasively quantifying sensory and choice-selective activity using MEG and measuring its perturbation by prior probability. Overall, our results provide evidence for the notion that prior probability modifies the starting point of the motor decision variable, as demonstrated by the prestimulus lateralization of low-frequency motor cortical activity. Previous studies have shown that low-frequency oscillations, particularly in the beta band, are attenuated before and during voluntary movements (Jenkinson and Brown, 2011; Joundi et al., 2012). Therefore, our results are in line with earlier studies that observed increased excitability (Bestmann et al., 2008) and activity (Scheibe et al., 2010) in the motor system with increasing decision certainty. They are also congruent with neurophysiological data in monkeys showing that fluctuations in prestimulus firing rates in the lateral intraparietal cortex predict upcoming perceptual choice, especially when the evidence conferred by the stimulus is weak (Shadlen and Newsome, 2001), and are selectively increased by prior expectations (Rao et al., 2012).
Our results also suggest changes in visual cortex contingent on the presence of a prior expectation (Fig. 6), as suggested previously (Schlack and Albright, 2007). However, our results lack the spatial resolution to make strong claims about the underlying mechanism of this biasing process. We speculate that the expectation-related effect over occipital cortex could reflect a state of overall reduced excitability (Jensen and Mazaheri, 2010), akin to an attentional modulation that is driven by the expectation cue (Summerfield and Egner, 2009). More specifically, since retinotopically specific alpha-band power reductions are indicative of increased visuospatial attention (Worden et al., 2000), we believe that the larger low-frequency power reduction for no-expectation trials could reflect the fact that a larger population of neurons (i.e., direction-selective cells for both motion directions) is released from inhibition (Jensen and Mazaheri, 2010) in these trials than for the expectation trials, where only the relevant (expected) subpopulation of neurons is released from inhibition. A limitation of the current study is that our expectation cues were perceptually not completely matched (i.e., they were slightly larger for neutral than non-neutral cues), which could potentially constitute an alternative explanation for the effect of expectation observed in the visual cortex. Although we cannot exclude this possibility, the fact that the magnitude of the neural effect of expectation correlated with the magnitude of the behaviorally observed criterion shift effect, and occurred in a period that was more than ∼500–600 ms after the offset of the cue presentation, suggest, however, that the neural effect in visual cortex is related to the cognitive/behavioral consequences of the cue, rather than its physical difference.
Relationship between motor bias and perceptual choice
In our experiment, we coupled perceptual choice to particular effector choices, allowing us to read out the choice-related activity from the motor cortex. This, however, begs the question whether the activity modulations that we observed may constitute a response bias that is independent from any perceptual biases that are potentially induced by the expectation cues. Although the motor lateralization is undoubtedly a reflection of motor preparation, there are several arguments for why this motor preparation signal may, at least under the current experimental conditions, provide a window into the decision process unfolding in the brain. First, previous studies have shown a tight relationship between the choice-selective motor preparatory activity and the temporal integral of sensory evidence, both in monkeys for review, see (Gold and Shadlen, 2007) and in humans (Donner et al., 2009; Siegel et al., 2011; Gould et al., 2012). As such, there is converging evidence that perceptual and cognitive processes “leak into” the motor system (Song and Nakayama, 2009). Indeed, also in our study we observed an increase in motor preparatory activity during the stimulus encoding period that is suggestive of an integration process of perceptual evidence (Fig. 8). Second, there were concomitant changes in visual cortex induced by the expectation cues, in the form of stronger alpha-band power when subjects had an expectation about upcoming motion direction compared with no expectation. Notably, the individual strength of the behavioral choice bias induced by the expectation correlated with the individual strength of both the alpha-power increase in occipital cortex and choice-selective activity in motor cortex. This suggests that these neuronal signatures of expectation evident at the sensory and motor peripheries of the decision process may in fact be produced by the same biasing mechanism.
Interestingly, we observed that spontaneous fluctuations in motor lateralization, in the absence of any explicit cueing about the probable stimulus/response, also partly predicted upcoming choice. Thus, even in the absence of explicit cues, the state of motor cortex has repercussions for perceptual choice. An analogous pattern of results has been previously found in single-unit activity in the monkey lateral and ventral intraparietal areas (Shadlen and Newsome, 2001; Williams et al., 2003). In view of the tight and bidirectional links between cortical sensory and motor systems, it is possible that these spontaneous motor fluctuations may be a downstream reflection of spontaneous fluctuations in sensory cortices (Leopold et al., 2002; Wilke et al., 2006; Hesselmann et al., 2008; Berkes et al., 2011), or vice versa. We also observed a possible source for the generation of these spontaneous fluctuations. Namely, decisions on the immediately previous trial resulted in a significant prestimulus lateralization on the current trial into the direction of the choice on the preceding trial, in line with computational models of two-choice behavior (Gao et al., 2009). Interestingly, this neural bias interacted with motion coherence, with diminishing influence of the prestimulus bias on behavior with increasing poststimulus strength (Fig. 5).
Our results do not imply that the motor cortex itself accumulates the sensory evidence during the decision process (Gold and Shadlen, 2003; Donner et al., 2009; Siegel et al., 2011). Indeed, it is likely that observers can accumulate sensory evidence in the absence of a corresponding motor plan. In line with this, we (Rahnev et al., 2011) and others (Heekeren et al., 2004; Ho et al., 2009) have previously isolated neural activity that is suggestive of an evidence accumulation signal in the dorsolateral prefrontal cortex—outside the motor system—when perceptual evidence was not directly mapped to particular responses. Integrating these former studies with the current study, we speculate that prior expectation may bias activity in associative cortical areas, such as the posterior parietal and dorsolateral prefrontal cortex, which broadcast the decision variable to motor cortex when the decisions can be mapped onto a particular motor effector.
Conclusions
We found that prior expectation affects perceptual choice behavior by altering the state of both visual and motor cortices. In particular, perceptual expectations bias the lateralization of prestimulus low-frequency activity in motor cortex and the amount of low-frequency activity in visual cortex. Moreover, spontaneous fluctuations in motor cortex lateralization could partly predict upcoming perceptual choice. Together, these results show how perceptual expectations leak into the motor system to facilitate flexible and adaptive decision making.
Footnotes
H.L. and F.P.d.L. received funding from the Human Frontiers Science Project (HFSP Short-Term Fellowship), the Netherlands Organisation for Scientific Research (NWO-VENI and NWO-Open Competition), and the Templeton Foundation (Grant 21569).
- Correspondence should be addressed to Floris P. de Lange, Donders Institute for Brain, Cognition and Behavior, Radboud University Nijmegen, Kapittelweg 29, 6500 HB Nijmegen, The Netherlands. floris.delange{at}donders.ru.nl