Abstract
Some people are more willing to make immediate, risky, or costly reward-focused choices than others, which has been hypothesized to be associated with individual differences in dopamine (DA) function. In two studies using PET imaging, one empirical (Study 1: N = 144 males and females across 3 samples) and one meta-analytic (Study 2: N = 307 across 12 samples), we sought to characterize associations between individual differences in DA and time, probability, and physical effort discounting in human adults. Study 1 demonstrated that individual differences in DA D2-like receptors were not associated with time or probability discounting of monetary rewards in healthy humans, and associations with physical effort discounting were inconsistent across adults of different ages. Meta-analytic results for temporal discounting corroborated our empirical finding for minimal effect of DA measures on discounting in healthy individuals but suggested that associations between individual differences in DA and reward discounting depend on clinical features. Addictions were characterized by negative correlations between DA and discounting, but other clinical conditions, such as Parkinson's disease, obesity, and attention-deficit/hyperactivity disorder, were characterized by positive correlations between DA and discounting. Together, the results suggest that trait differences in discounting in healthy adults do not appear to be strongly associated with individual differences in D2-like receptors. The difference in meta-analytic correlation effects between healthy controls and individuals with psychopathology suggests that individual difference findings related to DA and reward discounting in clinical samples may not be reliably generalized to healthy controls, and vice versa.
SIGNIFICANCE STATEMENT Decisions to forgo large rewards for smaller ones due to increasing time delays, uncertainty, or physical effort have been linked to differences in dopamine (DA) function, which is disrupted in some forms of psychopathology. It remains unclear whether alterations in DA function associated with psychopathology also extend to explaining associations between DA function and decision making in healthy individuals. We show that individual differences in DA D2 receptor availability are not consistently related to monetary discounting of time, probability, or physical effort in healthy individuals across a broad age range. By contrast, we suggest that psychopathology accounts for observed inconsistencies in the relationship between measures of DA function and reward discounting behavior.
Introduction
Discounting is a natural phenomenon that describes the tendency to devalue rewards that are relatively delayed, uncertain, or require more effort than sooner, more certain, or less effortful ones. Individual differences in discounting in humans have been hypothesized to be strongly related to individual differences in dopamine (DA) function. Studies of human and nonhuman animals have reported that pharmacological effects on DA D2-like receptors (D2Rs) alter discounting (Salamone et al., 1996; St Onge et al., 2010; Koffarnus et al., 2011; Weber et al., 2016). Specifically, D2Rs are believed to regulate decisions to inhibit impulsive actions (Frank, 2005; Ghahremani et al., 2012; Robertson et al., 2015), such as choosing smaller-sooner/more-likely/less-effortful rewards. However, studies of the transient manipulation of the DA system do not clarify whether more persistent individual differences in decision making are also primarily mediated by differences in DA D2R expression.
Multiple studies have reported links between discounting behavior and forms of psychopathology that are associated with alteration in striatal DA function, including the following: drug addiction (MacKillop et al., 2011; Amlung et al., 2017), obesity (Amlung et al., 2016), schizophrenia and bipolar disorder (Ahn et al., 2011), attention-deficit/hyperactivity disorder (ADHD) (Amlung et al., 2016), and Parkinson's disease (PD) (Kaasinen and Vahlberg, 2017). While these studies suggest a common involvement of DA in discounting in disease, it leaves open questions about specific features and clinical range of influence between DA and discounting behavior.
Only a few studies have directly assessed associations between trait-like individual differences in DA function and discounting behavior. Several recent studies using PET suggest that reduced availability of DA receptors contributes to greater discounting (for a summary of DA PET studies of reward discounting, see Table 1). However, many existing studies are limited by small sample sizes (Button et al., 2013), a focus on only temporal discounting (Crunelle et al., 2014; Ballard et al., 2015; Cho et al., 2015; Joutsa et al., 2015; Oberlin et al., 2015; Smith et al., 2016) or a mixture of decision features which may or may not be dissociable (Treadway et al., 2012), use of radiotracers with limited visibility outside the striatum (e.g., [11C]raclopride), or assessment of individuals with psychopathology (that vary in DA and other neuromodulatory functions) (Crunelle et al., 2014; Ballard et al., 2015; Eisenstein et al., 2015; Joutsa et al., 2015; Oberlin et al., 2015). Although prior PET studies have largely focused on the striatum, DA neurons in the midbrain also project to the amygdala, hippocampus, thalamus, anterior cingulate, insula, and frontal and parietal lobes (Björklund et al., 1978; Berger et al., 1991). Accordingly, there may be subtle differences in how DA function uniquely accounts for different types of discounting across the brain in individuals who vary in DA status.
It remains unclear whether there exists a reliable association between individual differences in DA and discounting in healthy humans. Here, in two studies (one empirical and one meta-analytic), we sought to characterize the relationship between individual differences in DA and decision making in healthy human adults. In Study 1, we analyzed data from three samples of healthy adults (young adults, N = 25, and adult life-span, N = 84, N = 35). We estimated time, probability, and effort discounting of monetary rewards using multiple tasks that attempted to dissociate discounting of these three decision features and estimated DA D2R availability using PET imaging with two different radiotracers, [18F]fallypride and [11C]FLB 457, with complementary coverage of striatal and extrastriatal brain regions (Cropley et al., 2006). We analyzed data from multiple samples across a broad age range to examine the generality of effects across human adults. In Study 2, we performed a quantitative meta-analysis to examine the consistency of or variation in individual differences across PET imaging studies of DA and discounting in healthy human adults and clinical groups.
Materials and Methods
Study 1
Participants and procedures.
The data analyzed here were collected from three different samples at two different universities. They will be described as Samples 1–3. Sample 1 included 25 healthy young adults (ages 18–24 years; mean ± SD, 20.9 ± 1.83 years; 13 females) recruited from the Vanderbilt University community in Nashville, TN between 2012 and 2013. Sample 2 included 84 healthy adults (ages 22–83 years; mean ± SD, 49.4 ± 17.6 years; 48 females) recruited from the Greater Nashville, TN metropolitan area between 2013 and 2016. Sample 3 included 35 healthy adults (ages 26–79 years; mean ± SD, 47.7 ± 17.4 years; 30 females) recruited from the Greater New Haven, CT metropolitan area between 2015 and 2017. Data from Samples 1 and 2 were collected at Vanderbilt University, and data from Sample 3 were collected at Yale University (for descriptive statistics for each sample, see Table 2).
Screening criteria.
Across samples, participants were subject to the following exclusion criteria: any history of psychiatric illness on a screening interview (a Structural Interview for Clinical DSM-IV Diagnosis was also available for all subjects and confirmed no history of major Axis I disorders) (First et al., 1997), any history of head trauma, any significant medical condition, or any condition that would interfere with MRI (e.g., inability to fit in the scanner, claustrophobia, cochlear implant, metal fragments in eyes, cardiac pacemaker, neural stimulator, pregnancy, and metallic body inclusions or other contraindicated metal implanted in the body). Participants with major medical disorders, including diabetes, and/or abnormalities on a comprehensive metabolic panel, complete blood count, or EKG, were excluded. Participants were also excluded if they reported a history of substance abuse, current tobacco use, alcohol consumption >8 ounces of whiskey or equivalent per week, use of psychostimulants (excluding caffeine) more than twice at any time in their life or at all in the past 6 months, or any psychotropic medication in the last 6 months other than occasional use of benzodiazepines for sleep. Any illicit drug use in the last 2 months was grounds for exclusion, even in participants who did not otherwise meet criteria for substance abuse. Urine drug tests were administered, and subjects testing positive for the presence of amphetamines, cocaine, marijuana, PCP, opiates, benzodiazepines, or barbiturates were excluded. Premenopausal females had negative pregnancy tests at intake and on the day of the scan. There were minor differences in exclusion thresholds between Samples 1/2 and Sample 3 based on the location and full study protocol (e.g., a subset of subjects in Sample 3 also received an oral dose of d-amphetamine). For full screening details, see Smith et al. (2017).
PET imaging: [18F]fallypride data acquisition and preprocessing (Samples 1 and 2).
[18F]Fallypride, (S)-N-[(1-allyl-2-pyrrolidinyl)methyl]-5-(3[18F]fluoropropyl)-2,3-dimethoxybenzamide was produced in the radiochemistry laboratory attached to the PET unit at Vanderbilt University Medical Center, following synthesis and quality control procedures described by U.S. Food and Drug Administration IND 47,245. PET data were collected on a GE Discovery STE PET scanner (GE Healthcare). Serial scan acquisition was started simultaneously with a 5.0 mCi (185 MBq; Study 1 median specific activity = 5.33 mCi, SD = 0.111; Study 2 median specific activity = 5.32, SD = 0.264) slow bolus injection of DA D2/3 tracer [18F]fallypride (specific activity >3000 Ci/mmol). CT scans were collected for attenuation correction before each of the three emission scans, which together lasted ∼3.5 h with two breaks for subject comfort. Before the PET scan, T1-weighted magnetic resonance (MR) images (Turbo Field Echo [TFE] SENSE protocol; Act. TR = 8.9 ms, TE = 4.6 ms, 192 TFE shots, TFE duration = 1201.9 s, FOV = 256 × 256 mm, voxel size = 1 × 1 × 1 mm) were acquired on a 3T Intera Achieva whole-body scanner (Philips Healthcare).
PET imaging: [11C]FLB 457 data acquisition and preprocessing (Sample 3).
[11C]FLB 457, 5-bromo-N-[[(2S)-1-ethyl-2-pyrrolidinyl]methyl]-3-methoxy-2-(methoxy-11C) benzamide was synthesized as previously described (Sandiego et al., 2015) in the radiochemistry laboratory within the Yale PET Center in the Yale School of Medicine. PET scans were acquired on the high-resolution research tomograph (Siemens Medical Solutions). [11C]FLB 457 (median specific activity: 7.80 mCi/nmol) was injected intravenously as a bolus (315 MBq; average = 8.62 mCi, SD = 2.03 mCi) over 1 min by an automated infusion pump (Harvard Apparatus). Before each scan, a 6 min transmission scan was performed for attenuation correction. Dynamic scan data were acquired in list mode for 90 min following the administration of [11C]FLB 457 and reconstructed into 27 frames (6 × 0.5 min, 3 × 1 min, 2 × 2 min, 16 × 5 min) with corrections for attenuation, normalization, scatter, randoms, and dead time using the MOLAR (Motion-compensation OSEM List-mode Algorithm for Resolution-Recovery Reconstruction) algorithm (Carson et al., 2004). Event-by-event, motion correction (Jin et al., 2013) was applied using a Polaris Vicra optical tracking system (NDI Systems) that detects motion using reflectors mounted on a cap worn by the subject throughout the duration of the scan. Before the PET scan, T1-weighted MR images (MPRAGE protocol; TR = 2.4 s, TE = 1.9 ms, FOV = 256 × 256 mm, voxel size = 1 × 1 × 1 mm) were acquired on a 3T Trio whole-body scanner (Siemens Medical Systems). After decay correction and attenuation correction, PET scan frames were corrected for motion using SPM8 (Friston et al., 1994) with the 20th dynamic image frame of the first series serving as the reference image. The realigned PET frames were then merged and reassociated with their acquisition timing info in PMOD's PVIEW module to create a single 4D file for use in PMOD's PNEURO tool for further analysis.
Binding potential calculation.
We estimated D2 receptor availability as binding potential (BPND) using the simplified reference tissue model (SRTM) with the cerebellum as the reference region (Lammertsma and Hume, 1996) via two approaches: voxelwise and ROI-based (by fitting time activity curves). PMOD's PXMOD tool was used to estimate BPND voxelwise using a published basis function fitting approach (Gunn et al., 1997) (for average voxelwise BPND images from all three samples, see Fig. 1).
The set of ROIs did not completely overlap across samples due to differences in regional coverage of the radiotracers (Samples 1, 2, 3: midbrain, thalamus, amygdala, hippocampus, anterior cingulate cortex [ACC], and insula; Samples 1 and 2: ventral striatum, caudate, putamen). The midbrain was drawn in MNI standard space using previously described guidelines (Mawlawi et al., 2001; Dang et al., 2012a,b) and registered to PET images using the same transformations used in BPND calculation. All other ROIs were derived from the Hammers Atlas plus deep nuclei parcellation as produced from the parcellation of the T1 structural image of each subject in the PNEURO module of PMOD software. The PET data were registered to the T1 image for each subject and, thus, to the ROIs (all steps implemented in PNEURO module of PMOD software). BPND values from ROIs were obtained by fitting the SRTM to the PET time activity curve data from each ROI in the PKIN (kinetic modeling) module of PMOD using the cerebellum as the reference region. These ROI-based BPND values were then averaged across hemispheres. Recently, our laboratory and others have shown that many brain regions may be susceptible to partial volume effects in estimating BPND, especially in older adults as a result of age differences in gray matter volume (Smith et al., 2017). PVC increased estimated binding potential across adults of all ages while also increasing individual differences not related to age (Smith et al., 2017). Therefore, we used PVC values in all analyses presented here, with the exception of the midbrain for which we used uncorrected BPND for analysis, because it was not available in the Hammers Atlas in PNEURO. We shared both corrected and uncorrected values for all ROIs if others want to do additional analysis (these data can be accessed at https://osf.io/htq56/).
We extracted mean D2-like BPND from the midbrain (mean ± SD: 1.39 ± 0.356) for all samples because both [18F]fallypride and [11C]FLB 457 have demonstrated good SNR in this region (Ray et al., 2012; Narendran et al., 2014). We extracted mean striatal D2-like BPND from Samples 1 and 2 in the ventral striatum (uncorrected mean ± SD: 18.6 ± 3.30; PVC mean ± SD: 37.6 ± 8.43), caudate (uncorrected mean ± SD: 16.18 ± 3.52; PVC mean ± SD: 26.1 ± 5.54), and putamen (uncorrected mean ± SD: 22.8 ± 3.40; PVC mean ± SD: 33.0 ± 5.03). Because [11C]FLB 457 has poor SNR in the striatum compared with [18F]fallypride, we did not extract striatal BPND from Sample 3. We extracted mean D2-like BPND from all samples in the ACC (uncorrected mean ± SD: 0.732 ± 0.281; PVC mean ± SD: 0.912 ± 0.385), thalamus (uncorrected mean ± SD: 2.32 ± 0.622; PVC mean ± SD: 2.74 ± 0.638), amygdala (uncorrected mean ± SD: 2.191 ± 0.490; PVC mean ± SD: 3.10 ± 0.692), hippocampus (uncorrected mean ± SD: 1.05 ± 0.308; PVC mean ± SD: 1.40 ± 0.546), and insula (uncorrected mean ± SD: 2.12 ± 0.654; PVC mean ± SD: 2.35 ± 0.714). To avoid arbitrary delineations of larger cortical regions, cortical BPND associations were evaluated with whole-brain voxelwise analyses (discussed in Experimental design and statistical analysis).
Approval for the [18F]fallypride study protocol (Samples 1 and 2) was obtained from the Vanderbilt University Human Research Protection Program and the Radioactive Drug Research Committee. Approval for the [11C]FLB 457 study protocol (Sample 3) was obtained from the Yale University Human Investigation Committee and the Yale-New Haven Hospital Radiation Safety Committee. All participants in each sample completed written informed consent. Each samples' study procedures were approved in accordance with the Declaration of Helsinki's guidelines for the ethical treatment of human participants.
Reward discounting tasks.
All samples completed a temporal discounting task (N = 144), Samples 2 and 3 also completed a probability discounting task (N = 119), and Sample 2 also completed a physical effort discounting task (N = 84). All tasks were incentive-compatible (played for real cash earnings) and performed during fMRI scanning (Samples 1 and 2) or on a computer in a behavioral laboratory (Sample 3) on a separate visit from the PET imaging session as part of larger multimodal neuroimaging studies. The average number of days between a PET imaging session and performance on discounting tasks was similar between studies (Sample 1: 18.2 ± 12.5; Sample 2: 25.0 ± 18.4; Sample 3: 38.9 ± 27.3).
Temporal discounting task.
All three samples completed a temporal discounting task adapted from a previously used paradigm (McClure et al., 2004). On each trial, participants chose between an early monetary reward and a late reward. In Sample 1, the delay of the early reward was set to today, 2 weeks, or 1 month, whereas the delay of the late reward was set to 2 weeks, 1 month, or 6 weeks. In Samples 2 and 3, the delay of the early reward was set to today, 2 months, or 4 months, whereas the delay of the late reward was set to 2 months, 4 months, or 6 months. In all samples, the early reward magnitude ranged between 1% and 50% less than the late reward. Participants in Sample 1 played 84 (42 trials in two runs) trials of the temporal discounting task and participants in Samples 2 and 3 played 82 trials (41 trials in two runs). One participant in Sample 3 had missing data for this task, producing a total sample size of 143 participants with temporal discounting data across all samples.
Probabilistic discounting task.
Samples 2 and 3 completed a probabilistic decision-making task similar to commonly used two-alternative forced choice mixed gamble tasks (Levy and Glimcher, 2011). On each trial, participants chose between a smaller monetary reward with a higher probability and a larger reward with a lower probability. The probability of the higher probability reward was set to 50%, 75%, or 100%, whereas the probability of the lower probability reward was set to 25% or 50% lower. The higher probability reward magnitude ranged between 1% and 50% lower compared with the lower probability reward. Participants in Samples 2 and 3 played 82 trials of the probability discounting task. Data for this task were available for all participants, producing a total sample size of 119 participants with probability discounting data.
Effort discounting task.
The Effort Expenditure for Rewards Task (EEfRT) was adapted from an existing paradigm that used finger pressing as the physical effort required for earning a reward (Treadway et al., 2009). On each trial, participants chose between a smaller monetary reward available for a lower amount of physical effort (pinky finger button presses) and a larger reward available for a higher amount of effort. The effort required for the smaller reward was set as 35%, 55%, or 75% (of each participant's maximum press rate), whereas the effort required for the larger reward was set as 20% or 40% higher than the smaller reward (i.e., 55%, 75%, or 95%). The number of button presses required for each level of effort was individually determined based on an initial calibration procedure in which participants pressed a button with their pinky finger as many times and as rapidly as possible in a few short intervals. The smaller magnitude reward ranged between 1% and 50% lower than the larger reward. On half of the trials, after making a choice, participants were shown a 1 s “Ready” screen and then completed the button-pressing task. Participants in Sample 2 played 82 trials of the effort discounting task. No participant had missing data for this task, producing a total sample size of 84 participants with effort discounting data.
Computational modeling of reward discounting.
In addition to a simple calculation of the proportion of smaller magnitude (less delayed/higher probability/lower effort) reward choices, we used a computational model to estimate behavioral preferences. For each participant and each task, discounting was modeled with a hyperbolic discounted value function, SV = , where R represents the monetary reward magnitude, k represents the discount rate, and C represents either: (1) proportion of maximum finger press rate for effort, (2) odds against winning (1 − p(win))/p(win)) for probability, or (3) delay in days for time. Data were fit with a softmax as the slope of the decision function. Because k values are not normally distributed, we used natural log-transformed values Ln(k + 1). Past work from our laboratory has shown k values and simple proportion of smaller reward choices are highly correlated (Seaman et al., 2018). We report both scores for transparency.
Experimental design and statistical analysis.
To determine whether D2R availability in the midbrain, striatum, and extrastriatal regions were associated with discounting, we combined one sample of healthy young adults with two cross-sectional healthy adult life-span samples. We ran linear regressions between BPND and the proportion of sooner/higher probability/lower effort choices as well as k values. Regressions included control variables for age, sex, and study sample (using dummy coded variables for Samples 2 and 3 where appropriate). Standardized β coefficients are reported for these primary analyses. We corrected for multiple comparisons within each cost domain (time, probability, effort) for each region available for each combination of samples because not all samples were tested on all tasks or had BPND for all regions. We applied Bonferroni correction to p values as follows: midbrain = 0.05; striatal ROIs = 0.05/3 = 0.016; extrastriatal ROIs = 0.05/5 = 0.010. Previous work has documented associations between discounting and household income and education (de Wit et al., 2007; Reimers et al., 2009). Because we did not identify such associations between education or income with discounting in any task, we did not include these measures as covariates in regressions.
Additional exploratory ROI analyses examined whether associations between DA and discounting varied across age groups or study samples. Full evaluation of these effects required running 27 additional multiple regression analyses that evaluated main effects of D2R availability, sex, age, and study sample (as above in the primary analyses) in addition to interactions between age and D2R availability and study sample and D2R availability. Given the lack of specific hypotheses for these exploratory analyses, we applied a Bonferroni correction for multiple comparisons; only interactions that were significant at p < 0.00185 (i.e., 0.05/27 = 0.00185) are reported with follow-up within-group tests. Interactions are reported as unstandardized β coefficients. Full model outputs for all of these analyses are available on Open Science Framework (OSF) (https://osf.io/htq56/).
Exploratory voxelwise statistical testing of D2R availability was separately performed for each discounting task in each sample in MNI standard space. Because [11C]FLB 457 was acquired on a high-resolution scanner, which produced maps with lower local spatial correlation, we spatially smoothed these BPND maps with a 5 mm FWHM Gaussian kernel to increase spatial SNR (Christopher et al., 2014; Plavén-Sigray et al., 2017). Linear regressions examining the effect of proportion of sooner/higher probability/lower effort choices or Ln(k + 1) values on voxelwise BPND with age and sex as covariates were performed using FSL Randomize (version 2.9) within each sample. Threshold-free cluster enhancement (Smith and Nichols, 2009) was used to detect regions with significant correlations across the whole brain with nonparametric permutation tests (5000 permutations). Statistical maps were thresholded at p < 0.05.
Study 2
To identify research studies of interest, a PubMed search for the following terms (((dopamine) AND positron emission tomography) AND humans) AND (discounting OR impulsive choice) yielded 10 studies. Five of these studies included original analysis of the relationship between preferences in a discounting task and a PET measure of DA function and were included. An additional exhaustive search via Google Scholar identified three additional relevant and includable studies. Notably, six of the studies in the meta-analysis used tracers that bind to D2Rs for baseline receptor availability or DA release measures (Treadway et al., 2012; Ballard et al., 2015; Cho et al., 2015; Eisenstein et al., 2015; Joutsa et al., 2015; Oberlin et al., 2015), two used tracers that measure presynaptic DA uptake (Joutsa et al., 2015; Smith et al., 2016), and one used a tracer that binds to DA transporters (DATs) (Crunelle et al., 2014). The study measuring DAT reported methylphenidate occupancy after drug administration. To obtain the DAT BPND measure, we sign-flipped the correlation because DAT BPND is inversely related to methylphenidate occupancy. In addition to the present study (Study 1), which examined time, probability, and effort, one other study examined both time and probability discounting (Eisenstein et al., 2015), another study examined effort-based discounting (Treadway et al., 2012), and the remaining studies examined only time discounting. One of these studies used SPECT rather than PET and was included. If correlation coefficients were not reported, t statistics and degrees of freedom were used to generate correlation coefficients using the formula r = (Rosenthal and Rosnow, 2008). Because correlations are bound and can be skewed, they were Fisher r- to Z-transformed before meta-analysis. In the case of one study (Treadway et al., 2012), correlations between caudate D2R BPND and preferences for effort were originally reported as three within-task correlations (by probability condition). To approximate the full task correlation, we used the Fisher r to Z transformation for the three correlations and then averaged these values. Depending on the decision preference index reported (Ln(k), proportion smaller, area-under-the-curve, etc.), we sign-flipped Z scores so that more positive values reflected greater discounting (e.g., less willing to choose a larger, delayed/uncertain/effortful reward). One study did not assess or report linear correlations (Cho et al., 2015). A summary of these studies is presented in Table 1.
Because our group previously reported little to no correlation between time, probability, and effort discounting in Sample 2 (Seaman et al., 2016), we limited the meta-analysis to time-discounting measures only. Therefore, the meta-analysis included 7 studies with 14 correlation effects (including the effect of time discounting from the present study). The goal of the meta-analysis was to identify generalizable patterns that address the broader question of whether discounting is related to general striatal DA function. Because prior reports indicated positive associations within individuals between tracer targets, for example, D2 receptors and DAT (Volkow et al., 1998; Yang et al., 2004), D2 receptors and DA synthesis capacity (Berry et al., 2018), D2 receptors and DA release (Samanez-Larkin et al., 2013), DAT and DA synthesis capacity (Sun et al., 2012), and DAT and DA release (Volkow et al., 2002), we included all studies that reported a correlation with a striatal region. It should be noted that indices of any one of these radiotracer targets alone may not be reflective of general DA function but contribute to, and interact within, complex spatiotemporal circuits that impact dopaminergic synapses. If a study reported multiple striatal regions, we used the reported t statistics and p values to select only the region with the largest effect size. Because this resulted in inconsistent ROIs (with 6 effects in the whole striatum, 6 in the ventral striatum, 2 in the caudate, and 2 in the putamen), we compared the correlation between time discounting in the present study with D2R availability in the whole striatum. BPND for the whole striatum was calculated as a volume-weighted average of the caudate, putamen, and ventral striatum PVC BPND values. Included effects from the present study controlled for age, sex, and study sample. Replacing the whole striatum value with the largest substriatal effect size value in our study (ventral striatum) did not change the pattern of results.
Meta-analytic effects were derived using the metafor R package (Viechtbauer, 2010) in JASP (version 0.8.5.1) using random effects with restricted maximum likelihood (JASP Team, 2018) to help account for between-study variance. An initial meta-analysis across all studies evaluated whether the common correlation (intercept) was significantly >0 (p < 0.05). Because the study samples included groups with psychopathology and radiotracers that bind to different dopaminergic targets, we ran additional meta-analytic models to evaluate whether effect sizes depended on the interaction of these terms. We dummy-coded study populations as either belonging to a group that is characterized by addiction, healthy controls, or any other psychopathology or disease. We coded the following as addiction: pathological gambling, methamphetamine users, and non–treatment-seeking alcoholism. Other psychopathology samples included obesity, PD, and treatment-naive ADHD samples. Radiotracer targets were D2Rs, including baseline and release measures, DA synthesis capacity, or DAT. We used the Q statistic to test the null hypothesis that the common true correlation is zero and I2 values to assess significance due to variance explained by heterogeneity of the effects (Borenstein et al., 2011). Model fit quality statistics are reported for the intercept model and the interaction model, along with each of the interaction main effect terms alone. We evaluated publication bias and study precision asymmetry with visual inspection of a funnel plot and Egger's test (p < 0.05).
Results
Study 1
Discounting across studies
Average behavioral measures of time and probability discounting did not differ between samples (time: F(2,140) = 1.63, p = 0.200; probability: F(1,117) = 0.009, p = 0.925), facilitating our ability to combine samples for analysis. Simple choice proportions (e.g., smaller-sooner/total number of choices) were highly correlated with computationally estimated discount rates Ln(k + 1) for time (r(141) = 0.829, p < 0.001), probability (r(117) = 0.798, p < 0.001), and effort (r(82) = 0.830, p < 0.001). A previous publication documented a lack of associations between time, probability, and effort discounting within a subset of Sample 2 (N = 75), with the exception of a modest significant correlation between time and effort discounting using the proportion choice variable but not using the discounting parameters from the hyperbolic models (Seaman et al., 2018). Across the samples included here, we also observed a general lack of associations between discounting across the tasks. Once again, the only exceptions were significant correlations within Sample 2 between time and effort discounting using both the proportion choice variables (r(82) = 0.27, p = 0.014) and, here, a significant correlation between the proportion choice variable for time discounting and the discounting model parameter (Ln(k + 1)) for effort discounting (r(82) = 0.27, p = 0.012). However, any associations or lack of associations with behavioral measures of effort discounting should be viewed with caution given that most participants selected a high proportion of larger/high-effort choices creating a ceiling effect that restricted the range of values.
Age effects on discounting and D2R availability
Samples 2 and 3 included adults of all ages. Age was not reliably associated with reward discounting of time (r(141) = 0.049, p = 0.563), probability (r(117) = −0.007, p = 0.947), or effort (r(82) = 0.116, p = 0.293). Age was negatively correlated with BPND in the midbrain (r(142) = −0.442, 95% CI [−0.565, −0.300], p < 0.001), caudate (r(107) = −0.409, 95% CI [−0.555, −0.240], p < 0.001), putamen (r(107) = −0.350, 95% CI [−0.505, −0.173], p < 0.001), anterior cingulate (r(142) = −0.316, 95% CI [−0.456, −0.161], p < 0.001), and insula (r(142) = −0.437, 95% CI [−0.560, −0.294], p < 0.001) but not in the ventral striatum (r(107) = 0.083, 95% CI [−0.106, −0.267], p = 0.389), amygdala (r(142) = −0.145, 95% CI [−0.301, 0.019], p = 0.083), hippocampus (r(142) = −0.130, 95% CI [−0.287, 0.034], p = 0.121), or thalamus (r(142) = −0.125, 95% CI [−0.283, 0.039], p = 0.136). Correlations between age and discounting within Sample 2 were previously reported (Seaman et al., 2018). Correlations between age and BPND for Samples 2 and 3 were previously reported (Dang et al., 2016; Smith et al., 2017).
Discounting and D2R availability
We did not identify associations between D2-like BPND in the midbrain and discounting across Samples 1, 2, and 3 or the striatum and discounting across Samples 1 and 2 (Table 3). We identified a modest positive correlation between probability discounting and D2R availability in the hippocampus (Ln(k + 1): β = 0.197, SE = 0.110, t(114) = 2.06, p = 0.042). However, the correlation did not survive correction for multiple comparisons. No associations were identified between discounting and any of the other ROIs in the primary analyses (Table 3; Fig. 2).
Exploratory evaluation of possible interactions between age and D2R availability or study sample and D2R availability in predicting discounting revealed four significant interactions after controlling for multiple comparisons. The significant interactions revealed that the association between D2R availability and effort discounting varied across age in the midbrain (chose low effort: β = −0.0195, p = 0.00006, Ln(k + 1): β = −0.0496, p = 0.0015) and ventral striatum (chose low effort: β = −0.00005, p = 0.0009, Ln(k + 1): β = −0.002, p = 0.0002). Follow-up analyses examined simple effects within younger adults (ages 18–30), middle-aged adults (ages 31–57), and older adults (ages 57–83) from a tertile split of ages. In the midbrain, there was a negative association between D2R availability and discounting within older adults (chose low effort: r(32) = −0.50, p = 0.002, Ln(k + 1): r(32) = −0.46, p = 0.006) but nonsignificant associations in younger adults (chose low effort: r(13) = 0.23, p = 0.41, Ln(k + 1): r(13) = 0.25, p = 0.36) and middle-aged adults (chose low effort: r(33) = 0.27, p = 0.12, Ln(k + 1): r(33) = 0.12, p = 0.48). In the ventral striatum, there was a positive association between D2R availability and discounting within younger adults (chose low effort: r(13) = 0.67, p = 0.007, Ln(k + 1): r(13) = 0.76, p < 0.001) but nonsignificant associations in middle-aged adults (chose low effort: r(33) = 0.10, p = 0.57, Ln(k + 1): r(33) = 0.20, p = 0.26), and older adults (chose low effort: r(32) = 0.18, p = 0.31, Ln(k + 1): r(32) = −0.16, p = 0.38). No age × D2R availability interactions reached corrected levels of significance in any other ROI for effort discounting and in any ROI for time and probability discounting. No study sample by D2R availability interactions reached corrected levels of significance in any ROI for any task. See OSF for complete model output and figures (https://osf.io/htq56/).
Voxelwise analysis of binding potential maps did not reveal any significant correlations with discounting. Unthresholded statistical maps can be viewed/downloaded from NeuroVault (https://neurovault.org/collections/ZPFBVXPK/).
Study 2
Meta-analysis: DA PET studies of reward discounting
An initial meta-analysis across all studies of temporal discounting did not identify a significant common correlation between discounting and kinetic measure of DA function (Omnibus test of model coefficients, Cochran's Q = 1.03, p = 0.310, I2 = 84.7%; βintercept = −0.167, SE = 0.164, Z = −1.02, Akaike information criterion [AIC] = 28.6).
Alternatively, a model that included the interaction between psychopathology group and radiotracer target provided a better fit than the common correlation model (without interaction terms) and accounted for the heterogeneity of effects (Omnibus test of model coefficients, Cochran's Q = 35.2, p < 0.001, I2 = 37.15%, AIC = 19.8). Inspection of the coefficients suggested that psychopathology alone had a greater impact on the model than radiotracer target: βHealthy,D2-receptor/intercept = −0.088, SE = 0.124, Z = −0.708, p = 0.479, βDA synthesis capacity = −0.479, SE = 0.357, Z = −1.34, p = 0.180; βDAT = −0.034, SE = 0.410, Z = −0.084, p = 0.933; βAddiction = −0.676, SE = 0.215, Z = −3.14, p = 0.002; βOther Psychopathology = 0.720, SE = 0.317, Z = 2.27, p = 0.023; βOther Psychopathology, DA synthesis capacity = 0.605, SE = 0.566, Z = 1.07, p = 0.285.
A follow-up model with the radiotracer target interaction term alone provided a worse fit (Omnibus test of model coefficients, Cochran's Q = 2.59, p = 0.273, I2 = 83.9%, AIC = 28.4). However, the follow-up model with the psychopathology term alone provided the best model fit compared with all other meta-analysis models (Omnibus test of model coefficients, Cochran's Q = 35.7, p < 0.001, I2 = 31.8%, AIC = 14.3). Again, inspection of the coefficients suggested that psychopathology alone had a greater impact on the model, regardless of radiotracer target: βHealthy/intercept = −0.138, SE = 0.110, Z = −1.26, p = 0.207, βAddiction = −0.616, SE = 0.202, Z = −3.05, p = 0.002, βOther Psychopathology = 0.793, SE = 0.199, Z = 3.99, p < 0.001. A forest plot of the psychopathology model is provided in Figure 3. Plotted values indicate Pearson correlation coefficients for display purposes only. Visual inspection of asymmetry in a funnel plot of effects from the psychopathology model (Fig. 2) and Egger's test (Z = −2.24, p = 0.025) indicated some potential publication bias associated with differences between studies reporting effects in specific psychopathology groups. Egger's test did not indicate the presence of publication bias in the common correlation model (Z = −1.80, p = 0.072) or full interaction model (Z = −1.56, p = 0.119).
The nature of the psychopathology group effect was that healthy individuals showed a nonsignificant, small, negative correlation between DA and discounting, the addiction groups showed a significant and stronger negative association, and the other psychopathology groups showed a stronger positive association relative to healthy controls. To facilitate comparison of group effects with past and future studies, we converted estimated coefficient Z values back to Pearson correlation coefficients. The correlation for the healthy group was r = −0.137, 95% CI [−0.339, 0.076], the correlation for the addiction group was r = −0.638, 95% CI [−0.796, −0.399], and the correlation for the other psychopathology group was r = 0.575, 95% CI [0.319, 0.753]. Including additional data from studies using effort and probability discounting measures did not change the pattern of results (for additional data and figures shared on OSF, see https://osf.io/htq56/).
Discussion
Here, we examined whether time, probability, and effort discounting of monetary rewards were related to individual differences in DA function in humans. We found that preferences for shorter time delays, higher probability, and lower physical effort were generally uncorrelated with DA D2R availability across brain regions in healthy adults.
A meta-analysis comparing correlations between discounting and striatal DA function failed to detect a correlation >0. Consistent with Study 1, DA and discounting in healthy groups were unrelated. However, there was heterogeneity dependent on psychopathology, with addiction showing a strong negative relationship to DA. Together, these findings suggest that individual differences in D2Rs are not reliably associated with discounting in healthy adults. Despite numerous past findings suggesting a role for DA in reward discounting behavior, the present findings raise questions about the specific role of D2Rs in discounting.
The difference in correlations between healthy adults and clinical groups in the meta-analysis suggests that individual differences may depend on alterations in striatal DA function. In addictions, striatal D2R expression is diminished (Volkow et al., 2009; however, for discussion of mixed findings in pathological gambling potentially due to specific facets of the disorder, see Potenza, 2013). This lowered striatal D2R expression may not be compensated by other features of the DA system, such as synthesis capacity, release, reuptake, or metabolism, which also become dysregulated in addictions (Volkow et al., 2009). As a result, it is possible that effects on temporal discounting emerge when the system is dysregulated. Dysregulation in different features of the DA system may contribute to nonlinear individual differences. The inverted-U hypothesis, for example, has been invoked to characterize individual difference associations between DA and cognition (Vijayraghavan et al., 2007; Cools and D'Esposito, 2011). In this case, changes in D2Rs may shift the relative balance in extracellular DA binding with D1-like receptors. Studies have proposed similar inverted-U associations between striatal DA and trait-level sensation-seeking (Gjedde et al., 2010) or fMRI reward signals (Dreher et al., 2008), and cortical DA and delay discounting (Smith and Boettiger, 2012; Elton et al., 2017). The present meta-analytic results revealed little to no association in the healthy range and positive and negative correlations in psychopathology associated with disrupted DA function. An inverted-U relationship driven by dysregulation of striatal DA may account for the differential associations between discounting and D2Rs between healthy and clinical groups. Future studies of individuals with a broad range of disruptions in DA are needed to properly test this hypothesis.
Importantly, the measures of baseline D2R availability were static and cannot describe temporal changes in DA signaling related to reward cues. Potentially, individual differences only emerge as a result of temporal dynamics of DA midbrain spiking or DA release (which may also be affected by psychopathology). For example, phasic changes in rodents' striatal DA release vary with discounting behavior (Moschak and Carelli, 2017) and subjective value (Schelp et al., 2017). Phasic changes might better explain individual differences in human reward discounting. For example, value-related fMRI activation linked to the decision process may better capture individual difference associations with baseline DA.
The striking difference in meta-analytic correlation effects between healthy controls and individuals with psychopathology suggests that individual difference findings in clinical samples cannot be reliably generalized to healthy controls, and vice versa. Disruption of brain function as a result of addiction, ADHD, obesity, and PD is not limited to a striatal DA abnormality and is more widespread across systems. Alterations in the DA system may interact with changes to broader neural systems. For example, one model of addiction suggests multiple cognitive and motivational corticostriatal circuits interact and compensate for disruptions in glutamatergic and GABAergic prefrontal signaling (Volkow et al., 2011). Disruptions to these circuits may affect the relationship between DA and discounting in addiction (MacKillop et al., 2011). In the context of reward processing, DA release in the striatum impacts cholinergic (Wang et al., 2006), glutamatergic, and GABAergic signaling (Alexander and Crutcher, 1990; Karreman and Moghaddam, 1996). Changes in these other systems may moderate effects of D2Rs on discounting, although future studies with direct measures of these system interactions are needed to evaluate this possibility.
Two of the samples in our empirical analysis included age ranges wider than most PET studies of DA. Although age was negatively correlated with D2R availability, we did not observe age-related associations with discounting in any task. Although prior studies described age differences in discounting (Green et al., 1999; Simon et al., 2010), the lack of an association in the present study is consistent with a recent study of >23,000 adults which did not identify a correlation between age and time discounting (Sanchez-Roige et al., 2018). Well-documented age-related D2 receptor loss with no changes in discounting behavior is complementary evidence that individual differences in discounting are not likely to be D2-mediated in healthy adults. Controlling for main effects of age did not substantially change any of the results of the primary analyses, suggesting that overall the broad age range of our samples did not account for the lack of effects. However, exploratory analysis of age by D2R availability interactions revealed that associations between D2R availability and effort discounting varied across adulthood such that associations were more positive in younger adulthood (particularly in the ventral striatum) and more negative in older adulthood (particularly in the midbrain, where the signal primarily reflects autoreceptors). If replicated, this pattern might suggest that changes in the mesolimbic DA system with age have differential impact on effort-based decision making. However, it should be noted that these analyses are based on Sample 2 (the only sample that included the effort task), so the within-group analyses of effects are based on relatively few participants. Future research with larger samples across adulthood are needed to better assess the reliability of these effects.
There are several weaknesses of the present studies. Because the finger-pressing requirement for the effort task was not very difficult for participants, additional studies that elicit broader individual differences in preferences are needed to better evaluate associations between D2 receptors and effort discounting.
In the empirical study, we included data from two radiotracers, which have different kinetic properties. Because of this, tracer is confounded with other sample differences. However, in many regions, only one tracer contributed data. Furthermore, in primary analyses, we included sample as a covariate and observed no significant interactions between sample and D2R availability in predicting discounting.
The meta-analysis included data from multiple studies using tracers with complementary coverage, but our empirical study was limited to D2Rs. Future studies may benefit from comparing multiple measures in the same individuals, for example, D2Rs and DAT, the latter of which have been more consistently associated with altering discounting behavior (Wade et al., 2000; van Gaalen et al., 2006; Koffarnus et al., 2011). Although the meta-analysis included one DAT, two DA synthesis, and two DA release effects, radiotracer target did not impact the overall effects in the present analyses, and importantly, analyses restricted to D2 receptors did not impact results. However, given the limited number of effects for most DA targets, it is difficult to systematically evaluate potential variation across the DA system.
Although the meta-analysis included studies with subject samples varying broadly in clinical status, there was often only one effect per diagnostic group. Importantly, effects from the other psychopathology group that included ADHD, obesity, and PD should be interpreted with caution. Although these groups are similar in that they are impacted by alterations in DA function, there are differences in how DA is dysregulated in each of them (e.g., presynaptic synthesis capacity, DA reuptake, postsynaptic receptor expression) (Madras et al., 2005; Benton and Young, 2016; Kaasinen and Vahlberg, 2017). Grouping of addictions might present issues with respect to illness duration because alterations to DA can exhibit different immediate and long-term changes with drug use (Volkow et al., 2009).
Our meta-analytic results were restricted to temporal discounting, but they were not impacted by the inclusion of correlations for probability and effort discounting tasks. Unfortunately, there were too few of these different task associations to properly evaluate potential differential effects. Further, the absence of a strong relationship between time, probability, and effort discounting in the empirical data complicates our ability to generalize preferences across tasks. It is possible that the meta-analytic effects observed for time discounting may be different if a greater number of effects for probability and effort were observed. For example, gamblers discount over time but exhibit risk-insensitive preferences (Holt et al., 2003), suggesting that probability and time discounting may be different in addiction. Thus, to better characterize specific diagnostic groups affected by alterations in DA function, more studies are needed to evaluate associations with various forms of discounting.
The present findings indicated that individual differences in D2R availability are not consistently correlated with trait-level individual differences in reward discounting. Our combination of a relatively large empirical study with a meta-analysis adds confidence to the findings and avoids the common weakness of human PET studies, especially individual difference studies, that typically lack statistical power. Future studies specifying the relationship between baseline DA function, temporal dynamics of DA release, and discounting will likely provide additional insight into how dopaminergic control of signaling influences decision preferences in healthy individuals.
Footnotes
This work was supported by National Institute on Aging Pathway to Independence Award R00-AG042596, National Institute on Aging Grant R01-AG044838, and National Institute on Drug Abuse Grant R21-DA033611. Some of the results reported in this manuscript were presented in a poster at the Society for Neuroeconomics (2017). We thank Kevin S. LaBar for comments on portions of the manuscript.
The authors declare no competing financial interests.
- Correspondence should be addressed to Jaime J. Castrellon, Center for Cognitive Neuroscience, Duke University, Box 90999, Durham, NC 27708. jaime.castrellon{at}duke.edu
This is an open-access article distributed under the terms of the Creative Commons Attribution License Creative Commons Attribution 4.0 International, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.