Abstract
The understanding of the electrophysiological properties of the subthalamic nucleus (STN) neurons is crucial since it represents the main target of deep brain stimulation for the treatment of Parkinson's disease and obsessive-compulsive disorders. The study of its nonmotor properties could shed light on the cognitive and motivational alterations possibly encountered after stimulation. In this study, we recorded the activity of STN neurons in two male behaving monkeys (Macaca mulatta) while they performed a visuomotor motivational task in which visual cues indicated which amount of force was required to obtain which amount of reward. Our results evidenced force- and reward-modulated neurons. After the occurrence of the visual stimuli, the force-modulated neurons mainly fired when a high effort was required. Differently, the activity of the population of reward-modulated neurons encoded the motivational value of the stimuli. This population consisted of neurons increasing or decreasing their activity according to the motivational ranking of the task conditions. Both populations could play complementary roles, one in the implementation of the difficulty of the action and the other in enhancing or slowing its execution based on the subjective value of each condition.
SIGNIFICANCE STATEMENT An increasing number of studies confers a role to the subthalamic nucleus (STN) in motivational and reward-related processes. However, the electrophysiological bases of such properties at the neuronal level remain unclear. The present study investigated the modulation of STN neuronal activity in monkeys performing a motivational task in which the force to produce and the reward obtained were manipulated. We found two main populations of neurons, one modulated by the effort required and the other integrating the motivational subjective value of the stimuli. This last population could help at improving decision-making to act or not, depending on the subjective value set by the motivational context. This highlights the pivotal role of STN in the valuation of cost/benefit for decision-making processes.
Introduction
Clinical and experimental data have shown that the basal ganglia (BGs) are involved in goal-directed behaviors and play a role in several processes including the selection and execution of actions, but also reward-related learning and integration of reward value. The subthalamic nucleus (STN) is considered as one of the two main input structures of the BG with the striatum, since it receives direct inputs from the cortex via the hyperdirect pathway (Nambu et al., 2002; Haynes and Haber, 2013). The involvement of the STN in motivational processes is supported first anatomically by the presence of direct projections from the ventromedial prefrontal cortex, the orbitofrontal cortex, and the anterior cingulate cortex (Takada et al., 2001; Nambu et al., 2002; Haynes and Haber, 2013; Nougaret et al., 2013), known for their pivotal role in the integration of reward information. Second, STN lesion and deep brain stimulation (DBS) studies in rodents have shown STN involvement in impulsivity and perseverative behaviors toward sweet food reward (Baunez and Robbins, 1997; Baunez et al., 2002) and the opposite motivation for natural reward and drug of abuse (Baunez et al., 2005; Rouaud et al., 2010,) and that it could affect the amount of salience allocated to stimuli conveying reward-related information (Baunez et al., 2002; Uslaner et al., 2008). Accordingly, numbers of clinical studies using DBS to treat motor symptoms of Parkinson's disease (PD) reported cognitive and motivational side effects such as impulsive choices and alteration of decision-making (Frank et al., 2007; Cavanagh et al., 2011; Coulthard et al., 2012). Third, electrophysiological recordings acquired in PD patients while performing cognitive tasks revealed strong relationships between the oscillatory activity of the local field potentials (LFPs) of the STN and the mechanisms of response inhibition and the regulation of decision processes (Cavanagh et al., 2011; Brittain et al., 2012; Zavala et al., 2014). Studying the STN LFP oscillations also revealed that the subjective cost of an action, the subjective value of a reward (Zénon et al., 2016), and the specific motor effort to assign to a motor response are represented at the STN level (Tan et al., 2015) and that this structure is involved in monetary reward processing (Fumagalli et al., 2015) and economic decisions (Rosa et al., 2013). Moreover, electrophysiological data from behaving rodents and nonhuman primates indicate that STN neurons are modulated by cues predicting reward and reward occurrence (Matsumura et al., 1992; Darbaky et al., 2005; Teagarden and Rebec, 2007; Lardeux et al., 2009, 2013; Espinosa-Parrilla et al., 2013, 2015; Breysse et al., 2015), and that they could link reward information to the motor output (Espinosa-Parrilla et al., 2013) and differentiate reward types and relative values of reward (Lardeux et al., 2009, 2013; Breysse et al., 2015; Espinosa-Parrilla et al., 2015). The STN activity correlates with the discharge balance and produces a matching change of the BG downstream structure (Deffains et al., 2016). By acting on the output structures of the BG, STN could suppress undesired movements by stimulating their inhibitory influence (Mink, 1996; Isoda and Hikosaka, 2008), but, conversely, it could thus also enhance some actions by alleviating this influence, impairing decision-making (Frank, 2006). Together, these studies suggest a critical role of the STN in decision-making and motivated behaviors and a strategic position in the cortico–BG–cortical loops involving the prefrontal cortices.
It remains unknown, however, how these functions are exerted at the single-cell level by STN neurons and particularly how two major components of a motivated behavior, the effort it requires and the benefit it brings, are integrated. To this aim, the activities of STN neurons were recorded while monkeys had to exert one of two possible levels of force on a lever to gain one reward of two possible magnitudes. Visual stimuli, displayed simultaneously, were used to indicate to the animals the level of force required and the reward magnitude on board. They set a motivational value for each condition and triggered the movement. Activities were analyzed after stimuli occurrence to examine whether these two variables were encoded by the same or different populations of neurons. Our data suggest that a population of STN neurons mainly encode the effort to be produced when a high effort is required, whereas another population of STN neurons not only encode the expected reward, but the subjective motivational value of the action requiring integration of reward and force values.
Materials and Methods
Animal and apparatus
We trained two male rhesus monkeys (Macaca mulatta), weighing 8 and 7 kg at the beginning of the experiments (Monkeys M and Y, respectively), to apply and maintain a pressing force on a lever in response to visual cues to receive a liquid reward. All experimental procedures were in compliance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals, the French laws on animal experimentation, and the European Directive on the protection of animals used for scientific purposes.
Behavioral procedures
The monkeys were seated in a Plexiglas primate chair and in front of a panel supporting a 17 inch screen on which visual cues could be presented. It was positioned 18 cm from the monkey. A lever outfitted with strain gauges in the lower part of the panel was positioned at waist level. At the front panel of the primate chair, a sliding door was opened to allow the animal to position its hand on the lever. The liquid reward (apple sauce diluted with water) was delivered through a distributor equipped with a peristaltic pump installed outside the recording room and released via a metal spout positioned directly in front of the mouth of the monkey. Figure 1A illustrates the trial schedule. At the beginning of a trial, the monkeys had to develop a basal pressing force on the lever during a 1 s preparatory period. This force was determined to be between 0% and 20% of the maximal force, experimentally defined as 900 × g based on the capabilities of the animals. After this preparatory period, two visual cues, a green one and a red one, each being either a filled circle or a filled square, were displayed vertically in the center of the screen. Their shapes indicated, for the green stimulus, the level of force the animals had to produce on the lever, and for the red stimulus the size of the upcoming reward. A green circle indicated that the animals had to produce a force between 20% and 55% of the maximal force [180–495 × g; low force (f)] and a green square, a force between 55% and 90% of the maximal force [495–810 × g; high force (F)]. Similarly, a red circle indicated to the animals that they could receive a small amount of reward [0.3 ml; small reward (r)], whereas a red square indicated that a large amount of reward could be delivered [1.2 ml; large reward (R)]. Consequently, there were four possible combinations of cues (fR, FR, fr, and Fr) that set the four different conditions of the task. In response to a pair of stimuli, monkeys had to increase their pressing force on the lever to reach the required force in a period <1 s and to hold this force for 1 s (holding time) to get the reward. According to the shape of the red stimulus, monkeys were rewarded with a small or large reward for each correct trial. The pair of stimuli was extinguished as soon as the reward was delivered. A vertical rectangle representing the range of the required force, located below the pair of stimuli, helped the monkeys to reach the required force. Indeed, a white cursor displayed in the rectangle indicated the force developed on the lever in real time when they were in the required force range. To keep cues constant across trial conditions, the animals saw the same rectangle for both the low-force and high-force ranges. Once the reward was delivered, the monkeys returned to a basal pressing force in preparation for the next trial. The next trial did not begin until the total duration of the current trial had elapsed (i.e., 4.5 s, regardless of the animal behavior). Monkeys could fail to perform a trial in three different cases. First, they did not reach the required force within the 1 s force development period. These trials were considered to be “omission errors.” Second, they did not hold the required force for at least 1 s (holding time). These trials were considered to be “holding errors.” Last, they developed a force that was greater than the upper limit of the required force (495 and 810 × g for the low and high forces, respectively). These trials were considered to be “threshold errors.” After an error, the same condition was presented again to the monkeys until they performed the trial correctly to prevent the monkey avoiding the trials of a particular condition. Moreover, trials in which the monkeys began to increase their pressing force within 100 ms after the occurrence of the cues were considered to be anticipations and were excluded from the database. Both monkeys were extensively trained (4–6 months) until they achieved a performance of 80% of correct trials. In each recording session, the four different conditions were displayed pseudorandomly from trial to trial. The same condition was not displayed more than three times sequentially if trials were performed correctly.
Task design and localization of the subthalamic nucleus recordings. A, Task design. A trial started when the monkey applied a basal force on the lever and maintained it during a 1 s preparatory period after which a pair of visual stimuli appeared on the screen (occurrence of the visual stimuli). In response to these stimuli, the monkey had to increase its pressing force until it reached the required force range materialized by a rectangle and a gauge on the screen (the time to reach the target force being the cue threshold period), and held its force for 1 s (i.e., holding period) to obtain the reward. B, Table illustrating the combinations of visual stimuli. Four possible pairs of visual stimuli indicated to the animal the force to be developed and the size of the upcoming reward. Green represented the force (F or f) and red the reward (R or r), a circle meant small (f or r), and a square meant large (F or R). The example condition shown in A was low force/large reward. C, Left, MR image from Monkey Y (left) and Monkey M (right), respectively, at +13 and +14 mm from the midpoint of the interaural line. Both images have been reoriented to fit the electrode track (Monkey Y: anteroposterior angle, −4.5°; lateral angle, 18°; Monkey M: anteroposterior angle, 6°; lateral angle, 17°).
Surgery
The surgery protocol was the same as previously described by Nougaret and Ravel (2015, 2018). Under anesthesia [first, intramuscular injection of ketamine (10 mg/kg) and xylazine (0.5 mg/kg), followed by deep anesthesia induced by isoflurane], two monkeys were implanted over the left hemisphere with a polyether-ether-ketone recording chamber (inner diameter, 19 mm). These recording chambers were positioned with a 20° angle laterally in the coronal plane. For Monkey M, the targeted stereotaxic coordinates, relative to the ear bars, were +18 mm on the anteroposterior axis and +16 mm in laterality. For Monkey Y, they were +14 mm in the anteroposterior axis and +16 mm in laterality. These landmarks were based on the atlas of Saleem and Logothetis (2007). Moreover, a device for head restraint for the future neuronal recordings composed of two titanium cylinders embedded in orthopedic cement (Palacos with gentamycin) was fixed to the skull with titanium orthopedic bone screws. Antiobiotics (marbocyl, 2 mg/kg) and analgesics (tolfedine, 4 mg/kg) were administered to the monkeys on the day of the surgery and for the following 4 d. The antibiotics (marbocyl, 2 mg/ml) were also used to fill the recording chamber before sealing it with a removable cap.
Electrophysiological recordings
The extracellular activity of single neurons was recorded with microelectrodes while the monkeys performed the task with head immobilization. These microelectrodes were custom made with glass-insulated tungsten following the technique of Merrill and Ainsworth (1972). To reach the BG structures, the microelectrode was inserted inside a stainless steel guide tube (diameter, 0.6 mm) lowered below the surface of the dura and was advanced using a manual hydraulic microdrive (model M096, Narishige). The microelectrode was connected to a preamplifier situated close to the microdrive. The electric signal was then amplified 5000 times and filtered at 0.3–1.5 kHz, and was converted to digital pulses through a window discriminator (Neurolog, Digitimer). A computer using a custom-designed software written in LabVIEW (LabVIEW, National Instruments) was used to present the visual stimuli on a screen in front of the monkey, to deliver the reward and to store in real time the force developed by the animal on the lever and the digital pulses from neuronal activity.
The microelectrode was lowered to isolate neurons while the monkey was performing the task. Single neurons were isolated from the background noise and from other neurons by continuously monitoring the waveform of the recorded neuronal impulses on an oscilloscope. Before recording in the STN, the anterior limit of the external pallidal segment (GPe) was identified for another study (Nougaret and Ravel, 2018), and neurons from both caudate nucleus and putamen were recorded (Nougaret and Ravel, 2015). Additionally, the preliminary mapping we performed, based on the atlas of Saleem and Logothetis (2007), allowed us to map electrophysiologically the surrounding structure of the STN and was very helpful in defining its boundaries. STN neurons were identified based on their firing characteristics, which have been described in previous studies (Wichmann et al., 1994; Darbaky et al., 2005; Isoda and Hikosaka, 2008; Espinosa-Parrilla et al., 2013, 2015), and on the characteristic firing patterns associated with neurons in regions dorsal and ventral to the structure that daily helped us to ensure the localization of our recordings. Indeed, along the electrode trajectory were encountered the thalamus, zona incerta, the STN, and finally the substantia nigra pars reticulata (SNr) or SN pars compacta. The differences in the baseline activity of these structures and their background noises made clear the transitions between them. The very specific and high-frequency activity of the SNr was particularly useful in confirming the localization of neurons previously recorded along the electrode track. The activity of the first well isolated and stable STN unit in a trajectory was recorded for at least 10 trials per condition. After recording from an STN neuron, the electrode was moved forward until another STN neuron was encountered. Data from all STN neurons recorded were included in analyses.
Localization of recordings
To assess the localization of our recordings, we used a high-resolution MRI scan for each monkey with electrodes positioned in trajectories from which we recorded neurons from the STN, the GPe, and the striatum. MR images were collected using a T1-weighed sequence (recovery time, 1700 ms; echo time, 4.414 ms; flip angle, 30°; in-plane resolution, 0.6 × 0.6 mm; thickness, 0.6 mm). On the basis of the localization of these electrode tips, we extrapolated the inferior/superior, anterior/posterior, and medial/lateral positions of each recorded neuron to generate a 3-D reconstruction of the whole neuronal population using Brainsight software (Rogue Research). The coordinates of each neurons were calculated based on their relative distance with the midpoint of the interaural line for each monkey. Because of the difficulties to clearly evidence the STN boundaries based on MR images, each neuron was then projected on a reconstruction of the STN based on the coordinates of its boundaries on the atlas of Saleem and Logothetis (2007).
Data analyses
All data analyses were performed using conventional statistical procedures with the R statistical computing environment (R Development Core Team, 2011), except for the population decoding analysis that was performed using the neural decoding toolbox (Meyers, 2013) on MATLAB (MathWorks). Data were analyzed from 8469 trials performed (correct and incorrect) by the animals, while a total of 78 STN neurons were recorded.
Behavioral analyses
Two different measures were analyzed to evaluate the behavior, reaction times (RTs), and acceptance levels of the animal. The RTs were defined as the duration between the onset of the cues and the time at which the monkey started to increase its pressing force on the lever and were only calculated for correct trials. After being changed into z scores for normalization purposes, a two-way ANOVA was performed with required force and expected reward as the two factors. Acceptance levels were computed by dividing the total number of trials accepted by the animal in a given condition (correct trials + holding and threshold errors) by the total number of trials proposed to the animal in this condition (trials accepted + omission errors) and compared with a Pearson's χ2 test. This acceptance level reflects whether the animal chose to perform the task or not, depending on the level of force and the reward size. The force developed by the animals in each trial at each time of the task was collected and averaged by condition to highlight possible differences within a same range of force between two different reward conditions.
Electrophysiological analysis
Response of STN neurons to the force and reward factors
We focused our analysis on the “cue-threshold period” (Fig. 1A), which started with the occurrence of the cues and ended when the force developed on the lever exceeded the lower threshold of the force range. It corresponded to the period in which the animal saw the cues, integrated their significance, and reacted to them accordingly to reach the required force range. The duration of this period varied across trials depending on the behavior of the animal. In our task, the force required to correctly perform the trial, based on the shape of the stimuli, and the force applied by the animals on the lever highly covary and could not be inserted as factors of the same model for electrophysiological analysis. To disentangle the “motor” modulation, that is, modulation by the force applied by the animals on the lever, from the “factors” modulation, that is, the force required, the expected reward, and the interaction between both, we have performed a two-step iterative general linear model (GLM). First, we considered a model in which the force applied (ForceApplied) could be explained by the amount of required force (Force), the amount of expected reward (Reward), the interaction between both factors (Force:Reward), and a residual part not explained by these factors (ResidualsForceApplied) as follows:
The goal of this first iteration was to extract the residual part ResidualsForceApplied, which was the part of the force applied not explained by the factors. This part was then used in the second iteration together with the force and reward factors. It allowed evaluation of the modulation of the firing rate by the force applied, after the modulation by the factors had been extracted from it. We defined the second iteration as follows:
ResidualsForceApplied represented the modulation by the force applied on the lever not explained by the force and reward factors. Force represented the modulation by the amount of force required, Reward the modulation by the size of expected reward, and Force:Reward by the modulation by the interaction between both. ResidualsFiringRate represented the part of the variance not explained by these variables. To minimize the probability of making type I errors under the null hypothesis and to compensate for the high risk of a familywise error because of multiple comparisons (78 neurons), we performed bootstrap analyses for the second iteration (Maris and Oostenveld, 2007; Lindquist and Mejia, 2015). This allowed us to compute p values without making assumptions about the distribution of the data. It consisted of randomly resampling the neuronal data to obtain replications of the same size as the original dataset. This procedure was performed 9999 times in the analysis for each neuron, each time with a different resampling. The likelihood ratio was extracted for each resampled dataset and compared with the one obtained from the original dataset. Then, if the original likelihood ratio fell above the 95th percentile (equivalent p value of 0.05), the neuron was considered to be significantly modulated by the factor of interest. The number of neurons modulated by the force applied and by the force and reward factors and their interaction were collected. For each neuron, a force selectivity index (FSI) and a reward SI (RSI) were estimated. These SIs were defined as follows:
In this formula, µx was the mean of the FiringRate during the cue-threshold period; SSx was the sum of the squares of the difference between the mean firing rate and the firing rate in an individual trial for each pair of condition described below; and dfx was the degree of freedom (number of trials − 1) for each pair of conditions described below (Peck et al., 2013). For each neuron, the FSI was computed by comparing the neuronal activity during trials in the high-force conditions (Fr and FR, represented by the subscript number 1) with the neuronal activity during trials in the low-force conditions (fr and fR, represented by the subscript number 2). A positive FSI indicated a stronger modulation in the high force conditions, whereas a negative index indicated a stronger modulation in the low-force conditions. In the same way, for each neuron, the RSI was computed by comparing the neuronal activity during trials in the large-reward conditions (fR and FR, represented by the subscript number 1) with the neuronal activity during trials in the small-reward conditions (fr and Fr, represented by the subscript number 2). A positive RSI indicated a stronger modulation in the large-reward conditions, whereas a negative index indicated a stronger modulation in the small-reward conditions.
Alignment on the reaction time
The previous analysis was also performed in a 150 ms period following the RT to assess the influence of movement initiation on STN neuronal activity. As for the cue-threshold period, the number of significantly modulated neurons was computed and the SIs were estimated during this period. The average spike density was also calculated aligned on the reaction time to determine whether the neural response was triggered by the cue onset or the movement initiation (see Fig. 3D). Because it was clearly triggered by the cue onset, the analysis described later was applied only on the cue-threshold period and with the neuronal activity aligned on the cue onset.
Relation between the anatomic localization and the selectivity indices.
Each recorded neuron was reported on tridimensional representation of the brain and potential correlations between their localization inside the STN and their capacity to encode the factors of the task (FSI and RSI) were investigated. Pearson correlations were performed contrasting the FSI or the RSI of each neuron with its position (in millimeters) in anteroposteriority, laterality, and depth.
Independence of subpopulations of neurons
The level of dependence between neurons belonging to subpopulations responding to the factors of the task was assessed using resampling methods. From the whole population of neurons (N), we defined the neurons selective for the amount of force (Nforce), the neurons selective for the amount of reward (Nreward), and the neurons selective for both (NFR). Then, we reassigned randomly the previously computed p values for force and reward to have a simulated population of neurons. This resampling was performed 20,000 times and allowed us to have the distribution of the number of neurons NFR found by chance. The position of our measured NFR on this distribution allowed us to determine the dependence between both populations (i.e., if the encoding of a factor was predictive or preclusive to the encoding of the other factor).
Neural decoding analysis
We performed a neural decoding analysis using the neural decoding toolbox developed by Meyers (2013). This analysis used a maximum correlation coefficient classifier method trained to discriminate, in our case, among the four conditions of the task or between the two levels of a task factor, and to compute the decoding accuracy. Each recorded cell activity was formatted as a sequence of average activity by bins of 150 ms sampled at 20 ms intervals (overlap, 130 ms) for each trial. For this population analysis, we first considered the whole population of 78 neurons and defined the optimal split factor (k = highest number of trials in each condition for each site). We decided to eliminate the 11 neurons with an insufficient number of trials in each condition for such analysis and to perform it on 67 neurons (sites) sharing at least 17 trials (k) per condition (4 × 17 = 68 data points). The following step consisted to randomly select from all the available data points of each site a population of 68 data points to shape a pseudopopulation of neurons (i.e., neurons recorded separately but treated as recorded simultaneously) with an equal number of data points. Then the data were normalized into z scores to allocate the same weight to each neuron and avoid the influence of a higher firing rate on the decoding method. The classifier was trained using k – 1 number of splits and was next tested on the remaining split. This procedure was repeated 50 times to increase the strength of the results, generating new splits and, consequently, new pseudopopulations. The results were then averaged over these 50 runs. To estimate the significativity of the classifier accuracy, a permutation test was performed by shuffling the labels and randomly assigned them to the conditions before rerunning it. This procedure was repeated 10 times to obtain a null distribution of the decoding accuracies. The times when the decoding accuracies were above what was considered chance level were considered to be statistically significant. The significance level was considered reached if the real decoding accuracies were greater than all the ones of the shuffle data in the null distribution for at least five consecutive significant bins. Always considering the whole population of neurons, when the decoding analyses of the force and reward factors were performed separately, we chose a k of 25 and 26, respectively, allowing us to consider 77 of 78 neurons and to remove only 1 cell. For the decoding analysis of the subpopulations of neurons modulated by the force or reward factors, we used different k values, adapted for each situation. For the force-modulated neurons (n = 19), we used k = 13, 18, and 17, respectively, to test the decoding of the condition, the reward factor, and the force factor, allowing us to consider the whole population − 1 (n = 18) to test the condition, and the whole population (n = 19) to test the factors. For the reward-modulated neurons (n = 15), we used k =12, 26, and 25 to test the decoding for the condition, the reward factor, and the force factor, respectively. It allowed us to consider the whole population (n = 15) in all cases.
Results
Behavioral results
Behavioral analyses were performed on trials completed while STN neurons were recorded (78 neurons; Monkey M, 30 neurons in 16 d of recording; Monkey Y, 48 neurons in 24 d of recordings).
Reaction times
Average RTs (i.e., the time to reach the lower threshold of the required force after the occurrence of cues) were computed from the correct trials only (Monkey M, 2337 correct trials; Monkey Y, 3942 correct trials; Fig. 2A). RTs were significantly shorter for the large-reward trials than for the small-reward trials in Monkey M (two-way ANOVA on RT z score: preward < 0.001, F(1,2333) = 95.9) and in Monkey Y (two-way ANOVA on RT z score: preward < 0.01, F(1,3938) = 7.34). Although there was a slight decrease in the high-force condition for both monkeys, the two-way ANOVA revealed that there was no significant difference in the RTs between the high-force trials and the low-force trials (Monkey M: pforce > 0.05, F(1,2333) = 2.85; Monkey Y: pforce > 0.05, F(1,3938) = 0.07). In both monkeys, there was no interaction effect between the required force level and the size of the expected reward on the RTs.
Behavioral performance of both monkeys. A, RTs of the monkeys in the four conditions of the task. Solid black lines, high force; dashed black lines, low force. The error bars represent the SEM. The stars indicate the influence of force and reward on the animal reaction time (two-way ANOVA: **p < 0.01, ***p < 0.001). B, Acceptance level of the animals in the four conditions of the task (fR, FR, fr, and Fr). The stars indicate a significant difference between the proportions of accepted trials on the total number of trials performed in a given condition (Pearson's χ2: *p < 0.05, ***p < 0.001). C, Mean of the force developed on the lever along the trial by the animals in the four conditions of the task. Black lines, High force; gray lines, low force; thick lines, large reward; thin lines, small reward. The dashed vertical line represents the occurrence of the visual stimuli.
Acceptance level
Both monkeys shared an acceptance level, ordered from the highest to the lowest, for the conditions fR, then FR, then fr, and finally Fr (Fig. 2B). For both monkeys, the four conditions were thus ranked in the same preference order. The size of the expected reward seemed to be more relevant than the level of effort required for them to decide whether to perform the task or not. In the most accepted fR conditions, monkeys decided to perform the action in 98.7% (Monkey M) and 98.9% (Monkey Y) of the trials. In contrast, in the least accepted Fr conditions, the monkeys only performed the action in 81.2% (Monkey M) and 87.7% (Monkey Y) of the trials. FR trials were accepted more frequently (Monkey M, 96.9%; Monkey Y, 96.3%) than fr trials (Monkey M, 86.0%; Monkey Y, 94.2%). The overall difference between the accepted trials and the rejected ones was highly significant for both monkeys (Monkey M: χ2 = 191.05, p < 0.001; Monkey Y: χ2 = 157.03, p < 0.001). Moreover, a 2 × 2 comparison revealed that each level of acceptance was different from the others (Monkey M: χ2 > 4.93, p < 0.05; Monkey Y: χ2 = 5.67, p < 0.05). These results show that the monkeys understood the task and valued each condition before deciding to perform the trial or not. Indeed, the effort to be made and the size of the expected reward contributed to compute the subjective value of each condition for both monkeys. As depicted in Figure 2C, for the same amount of force required, the average force applied by the animals was slightly different, depending on the expected/received reward in some periods. This result led us to consider the force applied as a variable in our analyses of the neuronal activity to isolate a reward or a force effect from motor response because of a mechanical modulation.
Electrophysiological results
STN neurons activity during the cue-threshold period
Our visuomotor task allowed us to explore how STN neurons integrated visual cues carrying effort- and reward-related information and compared them to motor-related ones. During the cue-threshold period, corresponding to the period in which the visual stimuli significance was integrated and the response developed, 10.3% of neurons (8 of 78) modulated their activity depending on the force applied by the animal on the lever, while 46.2% of neurons (36 of 78) modulated their activity depending on the task factors. Among these neurons, 52.8% of neurons (19 of 36) showed a “force effect,” which is a difference in activity between the high-force and low-force trials, and 41.6% of neurons (15 of 36) showed a “reward effect,” which is a difference in activity between large- and small-reward trials. It is important to note that only one cell belonged to both populations and that the group of neurons showing a force effect was independent of the one showing a reward effect (resampling method; equivalent p value = 0.065). The encoding of force was neither predictive nor preclusive to the encoding of reward and vice versa. On these 36 neurons, 4 showed an interaction effect (11.1%). The distribution of the force and reward SIs for each of the 78 neurons and the average spike density of the whole recorded population are shown in Figure 3. The overall distribution of the FSI during this period (Fig. 3A, green histogram) was significantly positive and not centered on 0 (Wilcoxon signed-rank test: V = 2279, p = 0.00023), and the RSI distribution showed the same tendency (Wilcoxon signed-rank test: V = 1921, p = 0.0584; Fig. 3A, red histogram). The window chosen for the analyis, between cue onset and threshold, included the initiation of the movement by the animal. To control for the influence of movement initiation on the STN neuronal activity, the same analysis was performed, but now aligned on the RT and the results compared with the one obtained for the cue-threshold period. During a period of 150 ms from the RT, 15.4% of neurons (12 of 78) modulated activity depending on the force applied by the animal on the lever, while 33.3% of neurons (26 of 78) modulated activity depending on the task factors, with 6 neurons present in both categories. The majority of neurons (76.9%, 20 of 26) showed a reward effect, and only 15.4% (4 of 26) showed a force effect and 11.5% (3 of 26) showed an interaction effect (Fig. 3C). The overall distributions of FSI and RSI during this period showed the same, but not significant, tendency to be positive in a majority of neurons during the cue-threshold period (Fig. 3C; FSI: Wilcoxon signed-rank test, V = 1860, p = 0.1115; RSI: Wilcoxon signed-rank test, V = 1904, p = 0.07).
Distribution of the FSI and RSI, average activity of STN neurons among the task conditions, and comparison of cue versus RT alignment. A, Scatter plots of force versus reward selectivity indices for each individual neuron during the cue-threshold period. FSIs > 0 indicate higher modulation in the high-force conditions. RSIs > 0 indicate higher modulation in the large-reward conditions. The color of the dots indicates the significance of a modulation (force, reward, or interaction effect) in the GLM analysis. Filled green circles represent the neurons showing a force effect. Unfilled red circles represent the neurons showing a reward effect. Black crosses represent neurons showing an interaction effect, and small gray dots represent neurons without modulation by the task factors (n. s. for non significant). The superimposed histograms represent the distribution of the FSI (green) and the RSI (red) of the 78 neurons. B, Average spike density (σ = 30) of the whole population (n = 78) of STN neurons. The horizontal dashed line represents the baseline activity, and the four solid color lines represent the four conditions of the task (purple, fR; orange, FR; green, fr; blue, Fr). The vertical dashed line represents the occurrence of the visual cues. C, Same representation as in A for a period of 150 ms from the reaction time. D, Average spike density (σ = 50) of the whole population with all conditions combined. The vertical dashed line represents the occurrence of the visual cues for the activity represented in blue, and the RT for the activity represented in gray. The activity is slightly higher (<1 Hz) when aligned on the RT but clearly triggered by the cue onset. For scaling reasons, a neuron with an RSI higher than 2 is not represented on the scatter plot in A an C.
The average spike density, aligned on cue onset or on RT (Fig. 3D), shows that even if slighty higher when aligned on RT, the response of STN neurons was clearly triggered by the occurrence of the visual cues. For this reason, we considered the cue-threshold period to be the most relevant to further analyze the activity of STN neurons, and this period will be the only one considered in the following analysis.
Distribution of the FSI and RSI of responding STN neurons
Among the 19 neurons showing a force effect, a significantly higher number of neurons (exact binomial test, p = 0.0007) were FSI+ (17 of 19; i.e., stronger response for the larger force) and the remaining ones FSI– (2 of 19; stronger response for the lower force). Conversely, among the 15 neurons showing a reward effect, a comparable number of neurons (exact binomial test, p = 0.61) were RSI+ (9 of 15) and RSI– (6 of 15). As illustrated in Figure 4A, the spike density of the 19 neurons showing a force effect reflects the dominance of the FSI+ neurons and their response after the presentation of the cues. We did not observe any difference in terms of average spike density or distribution of the RSI for these 19 neurons, and the spike density of the conditions sharing the same force required (fr/fR and Fr/FR) was comparable. It was not the case for the 15 neurons showing a reward effect.
Distributions of the FSIs and RSIs during the cue-threshold period and average spike density of STN neurons showing a force or a reward effect. Same representation as in Figure 3. A, Indices distribution and average spike-density for the neurons showing a reward effect. Left, Scatter plot of force selectivity versus reward selectivity indices for the neurons showing a force effect (n = 19; green filled circles). The black line represents the Pearson's correlation between the FSI and RSI of the 19 neurons. The gray arrow indicates the neuron taken as an example on the right panel of the figure. Middle, The average spike density shows the higher activity in the high-force conditions after the occurrence of the cues (materialized by the vertical line at time 0). Right, Raster plot of a cell showing a force effect. Each line represents a trial, and each dot represents the occurrence of a spike. The trials are sorted among the four conditions. In this example, the activity is higher in the high-force conditions than in the low-force conditions after the occurrence of the visual cues. B, Distribution and average spike density of indices for the neurons showing a reward effect. Left, Top, Scatter plot of force versus reward selectivity indices for the neurons showing a reward effect (n = 15; empty red circles). For scaling reasons, a neuron with an RSI higher than 2 is not represented on the scatter plot. The black line represents the Pearson's correlation between the RSI and FSI of the 15 neurons, revealing a significant correlation. The gray arrows indicate the neurons taken as example on the right panel of the figure. Middle, The average spike density of the separated populations of neurons showing a reward effect. Middle, Top, Average spike density of the neurons with a positive RSI (n = 9) showing higher activity in the large-reward conditions after the occurrence of the cues (materialized by the vertical line at time 0), but also decreasing response with the high force. Middle, Bottom, Average spike density of the neurons showing a negative RSI (n = 6) showing lower activity in the large-reward conditions after the occurrence of the cues (materialized by the vertical line at time 0), but also increasing slightly with the high force. Left, Bottom, Boxplot representing the average activity during the cue-threshold period and among the four conditions of the task of both subpopulations of reward-modulated neurons RSI+ and RSI–. The boxplots illustrate the influence of the force on the reward-modulated neurons. Only for RSI– is the effect of force significant. Purple, fR; orange, fR; green, fr; blue, Fr. Right, Raster plots of neurons showing a positive (top) and a negative (bottom) reward effect at the occurrence of the cues. The influence of the force on the reward-modulated neurons is visible at the population level and at the single-cell level.
Indeed, they were equally distributed between RSI+ and RSI– neurons (nine vs six neurons, respectively). However, we observed a significant negative correlation between the force and reward indices of these 15 neurons (Pearson correlation: r = −0.56, p = 0.028) showing that the higher the RSI, the lower the FSI will be; and the lower the RSI, the higher the FSI will be. This reveals that, even if not showing a force effect, most of the neurons showing a reward effect also integrate force value. Both subpopulations of reward-modulated neurons (RSI+ and RSI–) were observed separately and revealed interesting features. The boxplot (Fig. 4B, bottom left) and the average spike density along time (Fig. 4B, middle) show that the RSI+ and the RSI– neurons encoded the task conditions following the motivational ranking of the four task conditions (fR/FR/fr/Fr). Indeed, RSI+ neurons increased their activity with the most favorable conditions of the task. At the single-cell level, the raster shown in Figure 4B, top right, evidenced this pattern of activity. As a population, we observed a tendency of positive correlation between their average activity in the cue-threshold period and the task conditions (Pearson correlation: r = 0.23, p = 0.18; Fig. 4B, bottom left, “positive RSI”). On the other hand, RSI– neurons decreased their activity in the most favorable conditions of the task. As a population, we observed a significant negative correlation between their average activity in the cue-threshold condition and the task conditions (Pearson correlation: r = −0.44, p = 0.03; Fig. 4B, bottom left, “negative RSI”). At the single-cell level, the raster shown in Figure 4B, bottom right, evidenced this pattern of activity. As a whole, the reward-modulated neurons encoded the motivational value conveyed by the visual stimuli rather than only the size of the reward by increasing or decreasing their activity according to the task conditions and their subjective value.
Neural decoding analysis
We performed a neural decoding analyis (Meyers, 2013) based on the training of a classifier to discriminate among the four different conditions, between both reward conditions (r and R) and between both force conditions (f and F). This analysis allowed us to evaluate three new aspects of the STN neuronal activity. The results are depicted in Figure 5. First, by performing the training of the classifier at one time point and testing its capacity to decode the activity at different time points (Fig. 5A–C, left), we figured out whether the encoding of the condition, force, or reward information by STN neurons was static or dynamic. The dominance of the decoding accuracy confined along the main diagonal suggests that the representation of the condition and its factors was mainly sustained by a dynamic rather than a stationary code. The difference between these two representations is still a topic of interest, but dynamic codes have been described to support complex stimulus transformation, as reported previously in studies interested in the representation of cognitive problems (Crowe et al., 2010), observed actions (Lanzilotto et al., 2019), and the ability to solve tasks more generally (Meyers, 2018). The second and third aspects concern the temporal course of the decoding of information and the comparison of the decoding accuracy on selective and nonselective neurons. We observed, considering the whole neuronal population (Fig. 5, middle) that the information regarding the amount of reward was integrated before (Fig. 5B, middle; first significant bin: 160 ms after the occurrence of the cues, red curve) the information regarding the amount of force (Fig. 5C, middle: first significant bin: 360 ms after the occurrence of the cues, green curve). Moreover, we evaluated the decoding accuracy of different neuronal populations: the neurons showing a reward effect, the neurons showing a force effect, and the remaining neurons. Interestingly, the 15 neurons showing a reward effect (Fig. 5A, right, red curve) decoded the task conditions 180 ms after the occurrence of the cues, while the neurons showing a force effect significantly discriminate among the four conditions 700 ms after the occurrence of the cues. This main difference between both populations confirms our preceding analysis: the neurons showing a force effect were only involved in the encoding of the force, whereas the neurons showing a reward effect also integrated a force information, allowing them to significantly decode among the task conditions. This result was confirmed when we looked further in the decoding of the force by the reward-modulated neurons and vice versa. Indeed, even if late, the reward-modulated neurons showed an increase in the decoding accuracy after the occurrence of the cues (Fig. 5C, bottom, red curve) that the force-modulated neurons did not show for the amount of reward at this time (Fig. 6C, bottom, green curve).
Dynamic encoding of relevant information of the task along the trial. A, Results obtained following the training of a classifier at a time t1 (y-axis) and testing this classifier at a time t2 (x-axis) for the decoding of the task condition. Left, Bidimensional map of the decoding accuracy in which each pixel represents the decoding accuracy at t2 with a training of the classifier performed at t1. The higher decoding accuracy along the main diagonal shows the dynamic decoding of the task condition. The black lines indicate the occurrence of the visual cues. Middle, The black curve represents the decoding accuracy along the main diagonal, at lag 0 (when t1 = t2) for the whole population of recorded neurons. Right, Similar representation analyzing separately the neurons showing a force effect (green), a reward effect (red), and the remaining ones (gray). The thick lines at the bottom of the plots represent the significance of the decoding accuracy above the chance level (at 25%, four conditions). The time is the beginning of the first of five significant consecutive bins based on the same analysis performed with a shuffle of the condition labels. B, C, Same representation as in A for the decoding of the amount of reward (B) and the amount of force (C). The trials are pooled between the small-reward (fr and Fr) conditions and the large-reward (fR and FR) conditions for the decoding of the amount of reward. Inversely, they are pooled between the low-force (fr and fR) conditions and the high-force conditions (Fr and FR) for the decoding of the amount of force. The chance level represented by the black line is 50% in both cases.
Topography of the neuronal recordings in the subthalamic nucleus. The three bidimensional plots on the left represent the projections of each recorded cell from the midpoint of the interaural line. Top left, AP versus laterality, horizontal view. Top right, Depth versus laterality, coronal view. Bottom left, Depth versus AP, sagittal view. Right, Three-dimensional reconstruction of the cell distribution and theoretical boundaries of the STN based on the atlas of Saleem and Logothetis (2007). The filled circles represent the neurons recorded in Monkey Y, and the filled squares represent the neurons recorded in Monkey M. The ellipsoids on the bidimensional plots represent the 95% of the cell distribution for each population of cells. Green, Neurons showing a force effect (n = 19); red, neurons showing a reward effect (n = 15); black, remaining neurons (n = 45).
Localization of the recordings
The reconstruction of the electrode trajectories allowed us to extrapolate the location of each recorded neuron. The complete reconstruction along the three different planes is depicted in Figure 6. From the midpoint of the interaural line, the average coordinates of our recording were as follows: laterality: 4.95 ± 0.71 mm (minimum = 3.07 mm; maximum = 6.68 mm); anteroposteriority: 14.08 ± 0.92 mm (minimum = 12.86 mm; maximum = 16.01 mm); and depth: 11.29 ± 1.09 mm (minimum = 8.55 mm; maximum = 13.75 mm). Based on the reconstruction made from the atlas of Saleem and Logothetis (2007), the majority of the recorded neurons were located in the anterior half of the nucleus. We performed Pearson correlations to find a potential link between the strength of the response of STN neurons (FSI or RSI) and the coordinates of the location of neurons (laterality, anteroposteriority, and depth). We found that the neurons recorded more medially (Pearson correlation: r = −0.24, p = 0.034) and deeper (Pearson correlation: r = 0.24, p = 0.031) exhibited higher FSI. No significant correlation between the location of a neuron and the RSI was found.
Discussion
The present data brought new evidence about the functional properties of the STN neurons and their role in the integration of force, effort, and motivational information. Our task allowed us to extract and differentiate information about the following: (1) the encoding of force (i.e., the force developed physically on the lever); (2) the effort (i.e., the force requested on the lever in response to the green stimulus); and (3) the motivation to act (i.e., an integration of the effort and the reward size to compute the motivational value specific of a pair of visual stimuli). First, we found that STN neurons, at the single-cell level, were mainly involved in independent processes with cells significantly modulated by the effort (i.e., the force requested to develop on the lever) or by the reward size (i.e., the amount of reward the animal can get at the end of the trial). Second, these two populations exhibited different patterns of modulation, with the effort-modulated neurons mainly active when a high effort was required, whereas the reward-modulated neurons not only responded to the reward amount, but they also integrated, as a population, the motivational value of the stimuli. Third, the population of reward-modulated neurons was composed of neurons increasing or decreasing their activity in the most favorable condition of the task and exhibiting an activity according to the motivational ranking of the four task conditions. Fourth, the reward-modulated neurons seem first to encode the reward size and then to integrate the amount of force required.
STN neurons encode the effort to produce rather than the force developed
Our results revealed an interesting feature about the properties of the STN neurons. Indeed, during the cue-threshold period, when the animal must extract information from the cues and react to them accordingly, the proportion of neurons encoding the force required (low vs high) was higher than the proportion of neurons encoding the force developed. The work of Tan et al. (2013, 2015) showed similar evidence from recordings of the LFPs of STN of PD patients. They first showed a decreased power in the beta band and an increased power in the gamma band when the effort required increased (Tan et al., 2013). In a second study (Tan et al., 2015), the authors disambiguate the effort from the force, asking the patients to exert different levels of force on a lever with the index or the little finger. For the same effort, a lower force was produced if the little finger was used. They demonstrated that STN activity encoded the effort rather than the absolute force and suggested a role of the basal ganglia in determining the effort to be attributed to a response more than in the parametrization of the movement itself. This is in line with behavioral studies in humans showing that individuals used the sense of effort more than the proprioceptive feedbacks to evaluate the force generation (Jones and Hunter, 1983; Carson et al., 2002; Proske et al., 2004). Recording the potentials evoked by transcranial magnetic stimulation of the motor cortex in peripheral muscles used in their task, Carson et al. (2002) showed that the sense of effort was not based on central motor command and proposed that is was associated with the activity of structures upstream of the motor cortex. The notion of effort to invest in an action was the center of the task performed by PD patients in the study of Zénon et al. (2016), which showed a neural response to the effort cues in the 1–10 Hz band of the STN LFP. Moreover, and in line with our results, the authors highlighted that the responses observed were more informative of the level of effort rather than the actual quantity of force. Interestingly, in our data, the deeper and the more medial the recordings, the higher the FSI. It has been recently demonstrated (Stephenson-Jones et al., 2016) that a pathway between the medial STN and the habenula-projecting globus pallidus (GPh) was involved in signaling when an outcome was aversive or worse than expected (Stephenson-Jones, 2019). We could hypothesize that the neurons encoding a high effort to be produced, located on the medial border of the STN, projected on the GPh and transferred a negative signal to the lateral habenula. This hypothesis is supported by the fact that in rodents, a subpopulation of STN neurons could encode aversive reinforcers (Breysse et al., 2015). In a task similar to ours, Varazzani et al. (2015) reported a modulation of the noradrenergic neurons of the locus coeruleus (LC) by the task difficulty at the moment of the action. To date, no direct connections between the LC and the STN have been reported, but we could hypothesize about an indirect influence of LC effort-related activity on STN neurons through a prefrontal pathway. This last point is also supported by the fact that the force effect appeared later during the trial than the reward effect. It might well be possible that the reward-modulated neurons are directly sensitive to the cue information, while the force-modulated neurons are reflecting a more integrated process like action preparation at some point.
STN neurons encode the motivational value of the combined visual stimuli
Neural correlates between the activity of STN neurons and stimuli predictive of a reward or the reward itself have been previously shown in rodents (Baunez et al., 2002, 2005; Teagarden and Rebec, 2007; Lardeux et al., 2009, 2013; Breysse et al., 2015; Baunez, 2016) and in nonhuman primates (Matsumura et al., 1992; Darbaky et al., 2005; Espinosa-Parrilla et al., 2013, 2015). The population of reward-modulated neurons we recorded also integrated, as a population, information about the force required, as shown by the negative correlation between the RSI and FSI of these neurons, and their ability to decode the condition and not only the reward size. STN neurons are known to be directly interconnected with a number of prefrontal areas (Takada et al., 2001; Nambu et al., 2002; Haynes and Haber, 2013) with some degree of overlap between STN territories (Haynes and Haber, 2013; Nougaret et al., 2013). They would allow the gestion of conflict during decision-making by inhibiting the cortical activity through the STN–globus pallidus internal segment–thalamus–cortex pathway. This enables a control of impulsivity by allocating a temporal window necessary for the scrutiny of the different available options (Frank et al., 2007; Cavanagh et al., 2011). The role of STN in the control of impulsivity and decision-making has been largely documented in both rats and humans (for review, see Frank et al., 2007; Eagle and Baunez, 2010; Breysse et al., 2021). In our study, this subpopulation of STN neurons could send a forerunner information to the output structures of the BG or to the GPe regarding the estimation of the subjective reward value (i.e., also integrating the effort in the valuation of the reward). This information would help to improve the decision-making by promoting, slowing down, or stopping the execution of the action, as suggested by Isoda and Hikosaka (2008). This computation could be under the influence of dopamine neurons known to play a role in value-based behaviors in a similar paradigm (Varazzani et al., 2015). Another target of these STN neurons could be the ventral pallidum (VP) with whom it shares reciprocal connections (Haber and Knutson, 2010). The VP contains cells that display distinct reward modulations depending on the expected outcomes, the reward-positive and reward-negative types (Tachibana and Hikosaka, 2012). Moreover, because the reward-positive neurons combined expected reward values and expected costs, the authors argued that the VP neuronal activity is used for modulating impending motor actions. Considering the reciprocal connections between the STN and VP and the populations of positive and negative RSI neurons we found in the STN, we can hypothesize that these two structures would work together to update the value of a behavioral context and modulate a corresponding motor output. The temporal dynamic would be interesting to compare between the VP and STN. However, the present study shows that the encoding of the reward size was a fast process (180 ms) that occurs before integration with the force-related information. The fact that STN neurons are able to integrate both information in a sequential order is in line with the LFP recordings in PD patients tested in a similar task showing modulation of activity with regard to the net subjective value (Zénon et al., 2016). Interestingly enough, these comparable results were obtained with a simultaneous combined cue presentation here, while in the task used with the patients, the cue indicating the size of the reward was presented before the cue related to the effort to produce (Zénon et al., 2016). In monkeys, it has been reported (Espinosa-Parrilla et al., 2015) that STN neurons are sensitive only to the value of the outcome at its occurrence in the context of a choice. Here, we extend the precedent findings, showing the encoding of the motivational value of the visual stimuli by STN neurons, in the absence of the choice to be made. The differences in the conclusions could be partly explained by the differences between the task used here and the one used by Espinosa-Parrilla et al., (2013, 2015) in the fact that, in our task, the reward amount varied but its identity did not, and, second, that various levels of force were needed and lead to different efforts, implying a cost–benefit integration.
Limitations of our interpretations and future perspectives
The present study demonstrates new features on STN neuron properties and completes our previous findings on the activity of the GPe neurons (Nougaret and Ravel, 2018) and the tonically active neurons of the striatum (Nougaret and Ravel, 2015) in the same paradigm. Indeed, the integration of the motivational value of the visual stimuli was found only in the STN as a population, placing this structure as an essential node modulating motivated behaviors within the BG circuitry. In our study, there was no choice to be made between two options: the choice was to perform the action or not, and we recorded only few omission trials in each condition, making it difficult to study the decision of the monkey to take the action or not, unlike in the study recording STN LFPs in parkinsonian patients using a similar task (Zénon et al., 2016). Consequently, our results raised conclusions about the incentive motivation rather than decision-making about performing a motivated action. Moreover, to have a more complete view on how motivational information is processed within the BG, it would be of great interest to compare the properties of STN neurons with the ones of projection neurons of the striatum, the other main input structure of the circuit. Also, the STN is at the center of at least two main pathways within the basal ganglia, the indirect and the hyperdirect pathway, and our recordings did not allow us to identify whether the recorded neurons received mainly inputs directly from the cortex or indirectly through the striatum and the GPe. Complementary studies involving inactivation of specific pathways could help to shed light on the contribution of each cortical and subcortical inputs in goal-directed behaviors and on STN neuronal responses. In addition, the understanding of how STN neurons encode motivational information appears fundamental to comprehending the nonmotor neuropathologies involving dysfunctions of the BG, such as addiction and obsessive compulsive disorders, and the alterations of reward-based behaviors encountered in patients with Parkinson's disease. Today, the STN-DBS introduced by Limousin et al. (1995) is used worldwide to alleviate the motor symptoms in PD patients, but it also affects the cognitive and motivational deficits observed. Animal and clinical studies reported that STN-DBS can improve these nonmotor deficits but can also make them worse (Chaudhuri and Schapira, 2009; Castrioto et al., 2014), in some cases triggering an apathy that cancels the motor improvement observed in PD patients (Martinez-Fernandez et al., 2016). However, STN-DBS can also reduce the oscillations between hypodopaminergic and hyperdopaminergic states and diminish the compulsive use of dopaminergic medication and other forms of impulse control disorders observed in some PD patients (Lhommée et al., 2012; Eusebio et al., 2013). Interestingly, STN-DBS applied in parkinsonian patients performing a similar task to that used here increased their level of acceptance for trials involving a higher cost (Atkinson-Clement et al., 2019). This may be explained by either a faulty encoding of the effort or an increased motivation for the reward, which is in line with former studies showing an increased motivation for sweet food when an effort is required in a progressive ratio schedule of reinforcement (Rouaud et al., 2010), unlike when no effort is implied (Vachez et al., 2020). In contrast to what is reported with food reward, STN lesions or DBS reduce motivation for substances of abuse (e.g., cocaine, heroin, and alcohol; Baunez et al., 2005; Lardeux and Baunez, 2008; Rouaud et al., 2010; Pelloux and Baunez, 2017; Wade et al., 2017), suggesting it could be an interesting target for addiction treatment (Pelloux and Baunez, 2013). Beneficial effects of STN-DBS have indeed been shown on escalated heroin or cocaine intake (Wade et al., 2017; Pelloux et al., 2018). It was further shown that abnormal oscillatory activity within the STN might be associated with the escalated drug intake (Pelloux et al., 2018). Further work will thus be needed to understand more thoroughly how STN neuronal activity plays its role in motivational processes and how it could contribute to repair pathologic states.
Footnotes
This work was supported by Center National de la Recherche Scientifique, the Aix-Marseille Université, the Fondation de France (Parkinson's Disease Program Grant 2008 005902 to S.R.), and the Agence Nationale de la Recherche (ANR STNmotiv: ANR-09-MNPS-028-01 to C.B.).
The authors declare no competing financial interests.
- Correspondence should be addressed to Simon Nougaret at simon.nougaret{at}univ-amu.fr