Abstract
Multiple lines of evidence link economic choices to the orbitofrontal cortex (OFC), but other brain regions may contribute to the computation and comparison of economic values. A particularly strong candidate is the basolateral amygdala (BLA). Amygdala lesions impair performance in reinforcer devaluation tasks, suggesting that the BLA contributes to value computation. Furthermore, previous studies of the BLA have found neuronal activity consistent with a value representation. Here, we recorded from the BLA of two male rhesus macaques choosing between different juices. Offered quantities varied from trial to trial, and relative values were inferred from choices. Approximately one-third of BLA cells were task-related. Our analyses revealed the presence of three groups of neurons encoding variables offer value, chosen value, and chosen juice. In this respect, the BLA appeared similar to the OFC. The two areas differed for the proportion of neurons in each group, as the fraction of chosen value cells was significantly higher in the BLA. Importantly, the activity of these neurons reflected the subjective nature of value. Firing rates in the BLA were sustained throughout the trial and maximal after juice delivery. In contrast, firing rates in the OFC were phasic and maximal shortly after offer presentation. Our results suggest that the BLA supports economic choice and reward expectation.
SIGNIFICANCE STATEMENT Economic choices rely on the orbitofrontal cortex (OFC), but other brain regions may contribute to this behavior. A strong candidate is the basolateral amygdala (BLA). Previous results are consistent with a neuronal representation of value, but the role of the BLA in economic decisions remains unclear. Here, we recorded from monkeys choosing between juices. Neurons in the BLA encoded three decision variables: offer value, chosen value, and chosen juice. These variables were also identified in the OFC. The two areas differed in the proportion of cells encoding each variable and in the activation timing. In the OFC, firing rates peaked shortly after offer presentation; in the BLA, firing rates were sustained and peaked after juice delivery. These results suggest that the BLA supports choices and reward expectation.
Introduction
Economic choices entail assigning and comparing the subjective values of different goods. Valuation and decision are thought to involve the orbitofrontal cortex (OFC; Wallis, 2011; Padoa-Schioppa and Conen, 2017), but other brain regions may also contribute to these processes. In this respect, the basolateral amygdala (BLA) is a strong candidate. Anatomically, the amygdala is interconnected with prefrontal regions, including orbital and medial prefrontal areas (Porrino et al., 1981; Amaral and Price, 1984; Morecraft et al., 1992; Ghashghaei and Barbas, 2002; Cho et al., 2013). BLA lesions impair performance in goal-directed behavior following reinforcer devaluation (Málková et al., 1997; Blundell et al., 2001; Setlow et al., 2002; Fiuzat et al., 2017; Hart and Izquierdo, 2017), with effects similar to those observed following lesions of the central OFC (Gallagher et al., 1999; Rudebeck et al., 2013). In humans, bilateral amygdala lesions affect gambling and performance in trust games (De Martino et al., 2010; van Honk et al., 2013). In nonhuman primates and rodents, numerous studies have examined BLA activity in the context of reinforcement learning (Ono et al., 1995; Uwano et al., 1995; Parkinson et al., 2001; Setlow et al., 2002; Balleine et al., 2003; Corbit and Balleine, 2005; Chau et al., 2015). For example, Costa et al. (2016) found that amygdala lesions disrupted learning under deterministic and stochastic reinforcement. In addition, BLA neurons were found to encode the valence of positive or negative outcomes, or of outcome-predicting cues (Garavan et al., 2001; Paton et al., 2006; Zhang et al., 2013). Other studies found neuronal activity in the BLA was linearly related to reward magnitude (Bermudez and Schultz, 2010; Jenison et al., 2011) and context adapting (Paton et al., 2006; Saez et al., 2017). Most relevant here, Grabenhorst et al. (2012, 2016) and Zangemeister et al. (2016) showed that BLA responses during a saving–spending task reflected the decision outcome. Together, these results suggest that the BLA might contribute to the mental processes underlying economic choices. However, the behavioral tasks used in most of these studies did not include a choice. Hence, it remains unclear what decision variables may be represented in the BLA during economic decisions, and how the encoding of these variables compares with that found in the OFC.
To shed light on these issues, we recorded from the BLA of macaques engaged in a juice choice task. In this behavioral paradigm, animals choose between two juices offered in variable amounts. In each session, the relative subjective value of the two juices is inferred from the choice pattern and used to interpret neuronal activity. As in previous studies of the OFC (Padoa-Schioppa and Assad, 2006) and anterior cingulate cortex (ACC; Cai and Padoa-Schioppa, 2012), we examined the activity of individual cells from the BLA in relation to a large number of variables potentially encoded in this area. Our analyses revealed that neurons in the BLA encoded the identity and values of offered and chosen goods. Specifically, different groups of neurons encoded the value of individual juices (offer value), the binary choice outcome (chosen juice), and the subjective value of the chosen good (chosen value). These results closely resemble those previously obtained for the OFC (Padoa-Schioppa and Assad, 2006). One notable difference between the two areas was that significantly more neurons in the BLA encoded the chosen value, whereas the encoding of offer value and chosen juice was comparatively weaker. In addition, modulations in the BLA were sustained throughout the trial and peaked after juice delivery. In contrast, neuronal activity in the OFC was phasic, peaked shortly after the offer, and presented a secondary peak at juice delivery. Our results suggest that the BLA might participate in the neural processes underlying economic choices. The sustained time course of BLA activation also suggests an interpretation in terms of reward expectation.
Materials and Methods
All experimental procedures conformed to the NIH Guide for the Care and Use of Laboratory Animals and were approved by the Institutional Animal Care and Use Committee at Washington University in St. Louis.
Experimental procedures.
The choice task and the procedures for neuronal recordings were similar to those described in previous studies (Padoa-Schioppa and Assad, 2006; Cai and Padoa-Schioppa, 2012). In brief, two adult male rhesus monkeys (Macaca mulatta; monkey H, 10 kg; monkey I, 9 kg) participated in the experiments. After a period of chair training and familiarization, monkeys underwent a surgical procedure under general anesthesia to implant a head post and an oval recording chamber (main axes, 30 and 50 mm). The chamber was placed on the left hemisphere in monkey H and on the right hemisphere in monkey I. The chamber was centered on 25 mm anteroposterior and ±10 mm mediolateral, with the longer axis parallel to a sagittal plane. Presurgery and postsurgery structural MRIs were used to guide neuronal recordings.
During the experiments, the monkey sat in an electrically insulated enclosure in front of a computer monitor (57 cm distance). The gaze direction was monitored with an infrared video camera (EyeLink, SR Research). The choice task was controlled through a custom software (MonkeyLogic) written in MATLAB (MathWorks). In each session, the animals chose between two juices labeled A and B (with A preferred) offered in variable amounts. Figure 1a illustrates the trial structure. At the beginning of each trial, the animal maintained central fixation for 1.5 s. Two offers, represented by sets of color squares, were then presented simultaneously on the two sides of the fixation point. The offers remained on display for a randomly variable delay (1–2 s), at the end of which the central fixation point was extinguished (go signal). The animal indicated its choice with a saccade, and after an additional 0.75 s delay, the chosen offer was delivered. A typical session lasted ≥200 trials. Different pairs of offers were presented pseudorandomly. For each pair of offers, the spatial positions (left/right) were counterbalanced across trials. Across sessions, we used a variety of juice types (14 different juices; 25 different juice pairs).
Extracellular recordings were conducted using single tungsten electrodes (FHC; BLA and OFC in monkey H and OFC in monkey I) or linear arrays (V-probe, Plexon; BLA in monkey I). Electrodes were guided through a custom-made 1 mm grid. Distances from the top of the grid to the surface of the brain were carefully measured. Postsurgical MRI scans were used to estimate the travel distance necessary to reach the amygdala and OFC, and to reconstruct the recording coordinates. For each animal, we also performed a few dedicated sessions to map the transition between gray and white matter and to register these measures with those obtained from MRI scans. Recording locations were occasionally confirmed with an MRI scan performed with one electrode in place. Neuronal activity was amplified, bandpass filtered (300 Hz to 6 kHz; Lynx 8, Neuralynx), and acquired at 40 kHz (Power 1401, CED). Spikes were detected online and saved to disk for offline sorting (Spike2, CED). Amygdala recordings were centered on the basolateral nucleus (Fig. 2); OFC recordings were centered on area 13.
Analysis of choice patterns.
All analyses were performed in MATLAB (MathWorks). Unless otherwise indicated, the analyses conducted here were as described in previous studies (Padoa-Schioppa and Assad, 2006; Cai and Padoa-Schioppa, 2012).
The quantities of juices A and B offered on any given trial are referred to as qA and qB. In every session, animals presented a quality–quantity trade-off. If the two juices were offered in equal amounts, the animal generally chose juice A (by definition). However, if the qB/qA ratio was sufficiently large, the animal chose juice B consistently. Choice patterns were analyzed with a probit regression, as follows: where choice B = 1 if B is chosen and 0 otherwise, and Φ(X) is the cumulative distribution function of the standard normal distribution. From the fitted parameters, we derived measures for the relative value ρ = exp(−a0/a1) and the sigmoid steepness η = a1. Intuitively, the relative value ρ is the quantity that makes the animal indifferent between 1A and ρB. The steepness η, also termed inverse temperature, is inversely related to choice variability.
Neuronal classification.
Neuronal data were analyzed in seven time windows aligned with different behavioral events: preoffer (0.5 s prior the offer onset), postoffer (0.5 s after the offer onset), late delay (from 0.5 to 1 s after the offer onset), prego (0.5 s before the go cue), reaction time (from the go cue to the saccade start), prejuice (0.5 s before the juice delivery), and postjuice (0.5 s after the juice delivery; Fig. 1b). An “offer type” was defined by two offered quantities (e.g., [1A:2B]); a “trial type” was defined by an offer type and a choice (e.g., [1A:2B, B]). For each cell and each time window, we averaged spike counts across trials for each trial type. A “neuronal response” was defined as the firing rate of one cell in one time window as a function of the trial type. Trial types with fewer than two trials were discarded.
For each neuron and each time window, we conducted a three-way ANOVA (factors: offer type × position of A × movement direction). As in previous studies, we set a significance threshold of p < 0.001. As described below, a large majority of neurons were not affected by spatial factors. We thus conducted a one-way ANOVA (factor: trial type). Responses that passed the significance criterion (p < 0.001) were included in subsequent analyses. Neurons that passed this criterion in one or more time windows were referred to as “task-related” (Table 1).
We conducted a series of analyses to identify the decision variables encoded in the BLA. As in previous studies, we defined a large number of variables that neurons could conceivably encode. These included variables associated with a single juice (offer value A, offer value B, etc.), variables reflecting the subjective value of the chosen and unchosen juices (chosen value, other value, value difference, etc.), variables defined by the quantity as opposed to the value (chosen number, total number, etc.), and a variable defined by the binary choice outcome (chosen juice). In total, we examined 19 variables, which are defined in Table 2. For each neuronal response, we performed a linear regression separately on each variable. We thus obtained the regression slope and the R2 value. If the regression slope differed significantly from zero (p < 0.05), the variable was said to “explain” the response. If the variable did not explain the response, we set R2 = 0. Any given response could be explained by >1 variable. In this case, we identified the variable providing the best explanation (highest R2 value). For some analyses, we collapsed variables offer value A and offer value B (into variable offer value A|B) and variables chosen value A and chosen value B (into variable chosen value A|B). The collapsed variable was deemed to explain a given response if at least one of the two original variables explained the response. In this case, the collapsed variables was assigned the highest of the two R2 values obtained for the original variables. If neither variable explained the response, the collapsed variable was assigned R2 = 0.
To identify a small number of variables that best accounted for the whole population, we used two procedures for variable selection—namely, stepwise and best-subset. Both procedures were originally designed for multilinear regressions in the presence of multicollinearity (Dunn and Clark, 1987; Glantz and Slinker, 2001) and were adapted for the analysis of neuronal populations encoding different variables (Padoa-Schioppa and Assad, 2006). The stepwise procedure is iterative. In the first run, the variable explaining the largest number of responses per time window is selected and all of the responses explained by that variable are removed from the dataset. The procedure is then repeated on the residual dataset until newly selected variables explain less than a minimum percentage of responses. In this study, we set the minimum at 2%. (In other studies, we used 5% or 1%; here, we found that results were most stable when we imposed the 2% criterion.)
Importantly, the stepwise procedure does not guarantee optimality. In contrast, the best-subset procedure is exhaustive. For k = 1, 2, 3, …, the procedure examines all subsets of k variables, computes the number of responses explained by each subset, and identifies the subset that explains the maximum number of responses. A variant of this analysis examines the total R2 value explained across responses for each subset. For each k, the explanatory power of the variables selected by the best-subset procedure was higher than that of all the other subsets. To establish whether this inequality was statistically significant, we conducted a post hoc analysis. For each selected variable (variable X), we assessed whether the marginal explanatory power was statistically higher than that of other nonselected variables (variable Y). For each pair of variables (Table 3), nX was the number of responses explained by X, not explained by Y, and not explained by other selected variables; nY was the number of responses explained by Y and not explained by either X or other selected variables. We then performed a binomial test to examine whether the inequality nX > nY was statistically significant. We repeated the same procedure comparing the total R2 values of the responses exclusively explained by variable X or Y.
After we identified the variables encoded in the amygdala, we assigned each neuronal response to the variable providing the highest R2 value. To assess whether this neuronal representation was categorical in nature, we examined the R2 value provided by the linear regressions. Consider two variables, X and Y. For each response, we computed the difference ΔR2 = R2X − R2Y. We then examined the distribution of ΔR2 across the population of responses classified as encoding one of these two variables. A bimodal distribution suggests categorical representation (Padoa-Schioppa, 2013). Bimodality was tested with Hartigan's dip test. A recently developed clustering procedure (Onken et al., 2019) could not be applied due to high session-to-session variability in offer types and choices.
To classify neurons (as opposed to responses), we proceeded as follows. First, we focused on selected variables. Second, for each variable and each time window, we calculated the signed R2 value where the sign was that of the regression slope. Third, for each variable, we computed the sum(R2) as the total R2 value across time windows. The encoded variable was that providing the highest ‖sum(R2)‖, and the sign of the encoding was that of sum(R2).
Analysis of U-shaped responses.
Many neurons in the BLA encoded the chosen value. We examined whether their firing rates reflected the subjective nature of value. This analysis closely resembled that conducted in previous studies (Padoa-Schioppa and Assad, 2006; Cai and Padoa-Schioppa, 2012).
For each response in the dataset, we separated trials in which the animal chose juice A or juice B. For each group of trials, we regressed firing rates on the amount of juice chosen by the animal (linear fit). We thus obtained two slopes, βA for juice A and βB for juice B. If both slopes differed significantly from zero (p < 0.01) and had the same sign, the response was said to be “U-shaped.” Importantly, this procedure identified U-shaped responses in an unbiased way.
If U-shaped responses encode the chosen value, the slope ratio βA/βB should provide a neuronal measure for the relative value of the two juices (ρneuronal = βA/βB). Furthermore, this measure should reflect the subjective nature of value and should be indistinguishable from the behavioral measure obtained from the choice pattern (ρbehavioral ☰ ρ; see above). To test this prediction, we took advantage of the fact that, for any given juice pair, ρbehavioral varied to some extent from session to session. We thus performed an ANCOVA using the neuronal measure log(ρneuronal) as a dependent variable, the behavioral measure log(ρbehavioral) as a predictor, and grouping data by the juice pair. Data recorded from different monkeys and with different juice pairs were grouped separately because relative values generally depend on the subject and on the juice pair. We used the full ANCOVA model, and we included only juice pairs with ≥10 U-shaped responses.
Results
Two monkeys performed a juice choice task (Fig. 1a,b). In each session, the animal chose between two juices labeled A and B, with A preferred. Offers were represented by two sets of colored squares, and the monkey indicated its choice with a saccade. The offered quantities and the spatial configuration of the offers varied pseudorandomly. Choice patterns presented a quality–quantity trade-off (Fig. 1c). Measures for the relative value of the juices (ρ) and for the sigmoid steepness (η) were derived from a probit regression (see Materials and Methods; Eq. 1).
Neuronal encoding of decision variables in the amygdala
We recorded a total of 1132 neurons from the BLA (432 cells from monkey H, 700 cells from monkey I; Figs. 1d, 2). We analyzed firing rates in seven time windows aligned with different behavioral events (see Materials and Methods). A “neuronal response” was defined as the activity of one cell in one time window.
Visual inspection of individual responses revealed several interesting facts. First, a sizeable fraction of neurons appeared modulated by the offer type. In many cases, neuronal responses seemed to encode the variable chosen value. Figure 3a illustrates one example. This response, plotted against the offer type, had a characteristic U shape (dictated by the offer types included in the experiment). Plotting the firing rate against the variable chosen value revealed a close-to-linear relation. Figure 3b illustrates another example. In this case, the variable chosen value was encoded with a negative slope. Compared with previous findings for the OFC, responses encoding the chosen value were very prevalent in the amygdala. However, we also observed neurons encoding other variables. For example, the activity of the cell in Figure 3c increased as a function of the quantity of juice A offered and did not vary with the quantity of juice B. Plotting the firing rate against the variable offer value A revealed a close-to-linear relation. Similarly, the response in Figure 3d seemed to encode the variable offer value B. Figure 3e illustrates another example. In this case, firing rates were close to binary—high when the animal chose juice A and low when it chose juice B. Thus, this response seemed to encode the variable chosen juice.
For a quantitative analysis, we proceeded in steps. First, for each cell and each time window, we conducted a three-way ANOVA (factors: offer type × position of A × movement direction), and we imposed a significance threshold of p < 0.001. As detailed in Table 1, 409 of 1132 (36%) cells were significantly modulated by the offer type in at least one time window. In contrast, fewer cells were modulated by the spatial configuration of the offers (29/1132 = 3%) or by the movement direction (111/1132 = 10%). We then conducted a one-way ANOVA (factor: trial type). When all time windows were pooled (except the preoffer), 863 neuronal responses from 398 neurons passed the statistical criterion (p < 0.001; Table 1). Subsequent analyses focused on this population.
As in previous studies, we defined a large number of candidate variables potentially encoded by the population. These included variables associated with a single juice (offer value A, offer value B, etc.), a variable reflecting the binary choice outcome (chosen juice), variables reflecting the relative value of the two juices (chosen value, other value, etc.), and variables reflecting the numerosity (chosen number, total number, etc.; 19 variables total; see Materials and Methods and Table 2). For each response, we performed a linear regression on each variable, and we obtained the slope and the R2 value. If the slope differed significantly from zero (p < 0.05), the variable was said to “explain” the response. Figure 4 illustrates the results obtained for the whole population. For each time window, Figure 4a indicates the number of cells explained by each variable. In this plot, each response may contribute to >1 bin (because any response could be explained by >1 variable). In contrast, Figure 4b indicates the number of responses for which any given variable provided the best explanation (highest R2 value). Thus, in this plot, each response contributes to ≤1 bin. Inspection of Figure 4b reveals that the variable chosen value was the most dominant. In addition, smaller populations of responses were best explained by variables offer value A|B, chosen value A|B, or chosen juice.
Collectively, the 19 variables examined in our analysis explained 804 of 863 (93%) responses passing the ANOVA. To identify a small subset of variables that could best account for the whole population, we conducted a stepwise analysis and a best-subset analysis (see Materials and Methods). In the first iteration, the stepwise procedure selected the variable chosen value, which explained 605 of 863 (70%) responses (Fig. 5a,b). Once these responses were removed, the residual dataset was almost exclusively concentrated on variables offer value A|B, chosen value A|B, and chosen juice (Fig. 5a). In the second and third iterations, the stepwise procedure selected variables offer value A|B and chosen juice. The three selected variables collectively explained 759 responses, corresponding to 88% of task-related responses and 94% of responses explained by all 19 variables. Variables selected in subsequent iterations did not reach the 2% criterion (Fig. 5a,b). Note that because offer value A|B and chosen juice were both correlated with chosen value A|B, once the former two variables were selected, most of the responses best explained by the latter variable were removed from the dataset (Fig. 5a).
Importantly, the stepwise procedure is path-dependent and does not guarantee optimality. We thus conducted a best-subset analysis (see Materials and Methods). In one variant of this analysis, variables were selected for the maximum number of responses explained. By this criterion, the best subset of three variables included chosen value, offer value A|B, and chosen value A|B. In another variant of the analysis, variables were selected for the maximum total R2 value (see Materials and Methods). By this criterion, the best subset of three variables included chosen value, offer value A|B, and chosen juice (Fig. 5c). To gauge the significance of these observations, we conducted a post hoc analysis comparing the marginal explanatory power of each variable in the best subset with that of other, nonselected variables (see Materials and Methods; Table 3). The explanatory power of chosen value was significantly higher than that of any competing variable (all p < 0.005). The explanatory power of offer value A|B was significantly higher than that of chosen value A|B, both for the number of responses explained (p = 0.002) and for the total R2 value (p = 0.03). In contrast, the explanatory power of chosen juice was statistically indistinguishable from that of chosen value A|B, both for the number of responses explained (p = 0.5) and for the total R2 value (p = 0.45).
These results indicated unequivocally that neurons in the amygdala encoded variables chosen value and offer value A|B. Conversely, they did not fully disambiguate between variables chosen juice and chosen value A|B. However, we noted that the latter is a collapsed variable (see Table 2) and thus has a statistical advantage. Conversely, the explanatory power of variable chosen juice was significantly higher than that of either variable chosen value A or chosen value B taken alone (Table 3, bottom rows). Furthermore, the present study aimed to contrast the results obtained for the amygdala with those obtained for the OFC, and we intended to be cautious in identifying differences between areas. Thus, we provisionally concluded in favor of the variable chosen juice. We classified each neuronal response in the population as encoding one of the three selected variables (offer value A|B, chosen value, and chosen juice). Each response was assigned to the variable providing the highest R2 value. We thus identified 165 offer value A|B responses, 588 chosen value responses, and 110 chosen juice responses. Figure 5d illustrates the distribution of R2 values obtained for each population.
We further analyzed the nature of this neuronal representation by computing, for each pair of variables and for each relevant response, the difference in R2 value obtained for the two variables. We then examined the distribution of ΔR2 (see Materials and Methods). For variables chosen value and offer value A|B, the distribution of ΔR2 presented a significant dip near zero (p = 0; Fig. 6b), indicating that the two groups of responses were categorically distinct. Similarly, for variables chosen value and chosen juice, the distribution of ΔR2 presented a significant dip near zero (p = 0; Fig. 6c). In contrast, for variables offer value A|B and chosen juice, the dip in the distribution of ΔR2 did not reach significance (p = 0.2; Fig. 6c; all p values from Hartigan's dip test).
In summary, our analyses indicated the presence of a large and categorically distinct group of neuronal responses encoding the variable chosen value. Other task-related responses appeared to form smaller and possibly noncategorical clusters centered on variables offer value A|B and chosen juice (or possibly chosen value A|B). In principle, a larger dataset could resolve the current ambiguity regarding these smaller clusters (i.e., the encoded variables and the categorical nature of the representation). For the rest of this study, we conservatively classified neuronal responses in three groups defined by variables offer value A|B, chosen value, and chosen juice. A spatial reconstruction of the recording locations (Fig. 2) did not reveal any clear topographic organization of the different groups of cells within the BLA.
U-shaped responses reflect the subjective nature of value
Many neuronal responses in the amygdala encoded the chosen value. Importantly, this variable reflected the relative value of the juices, which could vary to some extent from session to session. We thus conducted a specific analysis to assess whether variability in chosen value responses matched the variability observed at the behavioral level.
For each response, we examined separately trials in which the animal chose juice A and juice B. For each group of trials, we regressed firing rates against the chosen quantity, and we obtained the regression slopes, βA and βB. Responses for which both slopes differed significantly from zero (p < 0.01) were identified as U-shaped (see Materials and Methods). Figure 7a illustrates one example. If indeed this response encoded the chosen value, βA should be proportional to the value of juice A, βB should be proportional to the value of juice B, and the ratio βA/βB should equal the relative value of the two juices. Furthermore, this neuronal measure of relative value (ρneuronal = βA/βB) should be indistinguishable from the corresponding measure obtained from behavioral choices through the probit regression (ρbehavioral).
For the response in Figure 7a, we measured ρbehavioral = 2.2 and ρneuronal = 2.3 ± 1.0, consistent with the prediction. In itself, this observation did not rule out that this cell encoded some physical property of the juices, such as the sugar content. The key question was whether session-to-session variability in ρbehavioral was matched by variability in ρneuronal. To address this issue, we considered the whole population of U-shaped responses. We performed an ANCOVA (full model) regressing log(ρneuronal) on log(ρbehavioral) and grouping responses by the monkey and by the juice pair (Fig. 7b). We imposed that ≥10 data points be available for each juice pair. The analysis showed that the neuronal measure, log(ρneuronal), depended significantly on the behavioral measure, log(ρbehavioral) (p = 0.0004). Furthermore, an examination of the slopes and intercepts obtained for each juice pair revealed that, in most cases, the relation between log(ρneuronal) and log(ρbehavioral) was statistically indistinguishable from identity (Fig. 7c).
The results illustrated in Figure 7 demonstrate that U-shaped responses in the amygdala reflected the subjective nature of value. Indeed, if these neurons encoded any physical property of the juices, the ratio βA/βB would be constant and would not covary with the behaviorally measured relative value of the two juices. In other words, regression lines in Figure 7b would be horizontal. Session-to-session fluctuations in ρbehavioral can be seen as forms of natural devaluation and revaluation. The fact that the relation between ρneuronal and ρbehavioral is indistinguishable from identity means that U-shaped responses incorporate this internal variability.
Activity profiles and comparison with OFC
We next examined the time course of activation for neurons encoding the variables identified in the amygdala. Each cell was classified according to the sum(R2) across time windows. We pooled offer value cells associated with the two juices and chosen juice cells associated with the two juices. For both groups of cells, we refer to the encoded juice as “juice E” and to the other juice as “juice O.” For chosen juice cells, we defined juice E as that eliciting higher firing rates. Thus, our dataset included 55/16 offer value cells with positive/negative encoding, 214/38 chosen value cells with positive/negative encoding, and 35 chosen juice cells. We examined the activity profiles of these populations over the course of the trial.
For each chosen value cell, we divided trials into tertiles according to the chosen value (low, medium, and high), and we computed the activity profile for each tertile. We then averaged the profiles across neurons for each population, separately for positive and negative encoding. Focusing on cells with positive encoding (Fig. 8a), we noted that their activity became significantly modulated by the chosen value ∼200 ms after offer presentation. This modulation remained sustained throughout the delay and after the go signal, and the activity further increased shortly before juice delivery. Similarly, the activity of cells with negative encoding (Fig. 8b) became modulated by the chosen value within 250 ms after the offer. This modulation remained sustained throughout the trial until after juice delivery.
For offer value cells, we divided trials into tertiles according to the offer value of juice E. Focusing on the population with positive encoding (Fig. 8c), we found that the activity became significantly modulated by the encoded variables ∼200 ms after offer presentation. This modulation persisted throughout the delay, after the go signal, until after juice delivery. (The population of offer value cells with negative encoding was too small, resulting in noisy activity profiles, and was not analyzed further in this study.)
For chosen juice cells, we divided trials according to the chosen juice. Inspection of the activity profiles (Fig. 8d) reveals that this population reflected the choice outcome (i.e., the chosen juice) within 250 ms of the offer. These neurons maintained a sustained modulation throughout the delay, and this modulation further increased upon juice delivery.
The time profiles observed in Figure 8a–c appeared somewhat different from what was observed in previous studies of the OFC, where average firing rates were lower, and where offer value and chosen value cells presented more transient modulations (Padoa-Schioppa, 2013). Importantly, previous studies of the OFC were conducted in different animals. To confirm that differences in activity profiles revealed differences between brain regions as opposed to differences between individual animals, we collected a new dataset from the OFC. Specifically, we recorded the activity of 716 neurons (364 from monkey H; 352 from monkey I). Using the same procedures described above and in previous studies, we identified task-related responses with an ANOVA (p < 0.001), and we classified neurons according to the variables characteristic of the OFC—namely, offer value, chosen value, and chosen juice (Table 4). We then examined the activity profiles for the various populations, dividing trials into tertiles for offer value cells and chosen value cells and according to the choice outcome for chosen juice cells.
At the population level, the picture emerging from the new OFC data (Fig. 9) closely resembled that described previously (Padoa-Schioppa, 2013). In particular, both chosen value cells (Fig. 9a) and offer value cells (Fig. 9c) presented a largely phasic modulation that emerged ∼200 ms after the offer and mostly dissipated within 500 ms. Conversely, chosen juice cells (Fig. 9d) revealed the choice outcome starting ∼250 ms after the offer and throughout the rest of the trial.
Comparing the two areas, one notable difference was that firing rates in the BLA were generally higher than in the OFC. For example, the preoffer activity (baseline) of chosen value cells with positive encoding was ∼25 spikes/s (sp/s) in the BLA (Fig. 8a) and <10 sp/s in the OFC (Fig. 9a). Similarly, the preoffer activity of offer value cells with positive encoding was ∼17 sp/s in the BLA (Fig. 8c) and ∼7 sp/s in the OFC (Fig. 9c). Statistical analyses confirmed that the difference in baseline (preoffer activity) between the two areas was statistically significant at the population level. This result held true in each monkey (both p < 10−20, Kruskal–Wallis test; Fig. 10).
To further examine the encoding of decision variables over time and in the two areas, we analyzed firing rates in 250 ms sliding time windows shifted by 50 ms. For each cell and each time window, we regressed the firing rate against the encoded variable. Each regression provided a slope and a p value. We defined LP = −log10(p), which essentially quantified the strength of the encoding. Figure 11 illustrates the results obtained for the two groups of cells (offer value, chosen value) with positive encoding in the two areas. For each neuron, we defined the tuning latency as the first time for which p < 0.01. As illustrated in Figure 12, most tuning latencies were <500 ms.
Confirming the picture emerging from the population firing rates (Figs. 8, 9), it appeared that most chosen value cells in the BLA remained significantly tuned throughout the trial (Fig. 11b). In contrast, the tuning of most chosen value cells in the OFC appeared phasic (Fig. 11d). To quantify this effect, we focused on cells with a tuning latency of ≤250 ms, and we assessed how many of them were still tuned 1 s after the offer onset. We found that the proportion of chosen value cells presenting sustained tuning was significantly higher in the BLA than in the OFC (p = 0.002, χ2 test). This trend held true in each animal (p = 0.008 and p = 0.089). In contrast, no such difference across areas was found for offer value cells (all p > 0.5, χ2 test). In both the BLA and OFC, approximately equal numbers of offer value cells presented phasic versus sustained tuning.
To summarize, quantitative comparisons of the results obtained for BLA and OFC revealed several differences. First, mean firing rates were overall higher in the BLA than in the OFC (Fig. 10). Second, the proportion of task-related cells encoding the chosen value was significantly larger in the BLA than in the OFC (BLA, 252/398 = 64%; OFC, 93/231 = 41%; Pearson's χ2 test; p < 10−5). The higher prevalence of chosen value cells in the BLA compared with the OFC held true even when we considered separately the data for the two animals (p < 10−4, Mantel–Haenszel test). Third, the encoding of chosen value was more sustained in the amygdala compared with the OFC.
Discussion
Economic choice behavior entails the computation and comparison of subjective values. A large number of studies link these mental processes to the OFC. In contrast, the role of other areas remains uncertain. In this respect, arguably the strongest candidate is the amygdala. Together with the central OFC (Gallagher et al., 1999; Rudebeck et al., 2013) and the mediodorsal thalamus (Mitchell et al., 2007; Izquierdo and Murray, 2010), the amygdala is the only brain region where lesions affect performance in reinforcer devaluation tasks (Málková et al., 1997; Blundell et al., 2001; Setlow et al., 2002; Fiuzat et al., 2017; Hart and Izquierdo, 2017). Because values driving goal-directed behaviors are closely related to values driving economic choices (O'Doherty, 2014; Padoa-Schioppa and Schoenbaum, 2015), the known effects of amygdala lesions suggest that this area might also play a role in economic decisions. Supporting this notion, previous neurophysiology results are broadly consistent with a representation of value in the BLA (Schoenbaum et al., 1998, 2003; Sugase-Miyamoto and Richmond, 2005; Paton et al., 2006; Belova et al., 2008; Bermudez and Schultz, 2010; Roesch et al., 2010; Jenison et al., 2011; Grabenhorst et al., 2012; Peck et al., 2013; Leathers and Olson, 2017). At the same time, most previous studies of the BLA did not involve economic choices and/or did not contrast neuronal activity in the BLA with that recorded in the OFC. To fill these important gaps, we examined the activity of neurons in the BLA while monkeys performed a juice choice task. Approximately one-third of cells were task-related. Of these, 90% encoded one of the three decision variables: offer value, chosen value, or chosen juice. As a caveat, the explanatory power of chosen juice was statistically indistinguishable from that of variable chosen value A|B.
Intriguingly, the variables identified in the BLA are the same previously identified in the OFC (Padoa-Schioppa and Assad, 2006). In contrast, neurons in the dorsal and ventral ACC were found to encode different sets of variables (Cai and Padoa-Schioppa, 2012). Aside from the caveat mentioned above, the BLA and OFC appeared to differ in two important ways. First, the majority—∼2/3—of task-related cells in the BLA encoded the chosen value. In contrast, here and in other studies, cell counts for the three groups of cells in the OFC were more balanced, with 1/3–1/2 encoding the chosen value (Padoa-Schioppa, 2013; Xie and Padoa-Schioppa, 2016). Second, the two areas differed for the time course of this signal. Specifically, chosen value cells in the BLA presented high firing rates and a sustained modulation throughout the trial. In contrast, chosen value cells in the OFC had lower firing rates, and their modulation was generally phasic. In this respect, current measures for the OFC replicate previous findings (Padoa-Schioppa, 2013).
The implications of our results are potentially profound. The proposal that economic decisions are generated in a neural circuit within the OFC was partly motivated by the presence of different groups of neurons in this area capturing both the input (offer value) and the output (chosen juice, chosen value) of the decision process (Padoa-Schioppa and Conen, 2017). The current results broaden this perspective, suggesting that neuronal populations in the BLA may play an active role in the decision. However, more work is necessary to test this hypothesis. For one, the preponderance of chosen value responses and the time course of activation—sustained throughout the trial and peaking after juice delivery—suggest a role of the BLA in reward expectation and possibly consummatory behavior, as opposed to decision-making per se. Moreover, the lines of evidence implicating the OFC in economic decisions include the correlation between choice variability and neuronal variability in each of the three groups of cells (Padoa-Schioppa and Conen, 2017), the fact that disrupting neuronal activity in the OFC disrupts choices (Kuwabara et al., unpublished observations), and the fact that one can predictably bias choices through low-current electrical stimulation (Ballesta et al., unpublished observations). Similar experiments should be conducted on the amygdala to ascertain its possible role in economic decisions.
Aside from whether or how BLA participates in the decision process, the fact that neurons in this area encode subjective values deserves some comment. Numerous previous results are consistent with a neuronal representation of economic value in this area (Kahn et al., 2002; Gottfried et al., 2003; Schoenbaum et al., 2003; Belova et al., 2008; Bermudez and Schultz, 2010; Jenison et al., 2011; Peck et al., 2013; Leathers and Olson, 2017). However, the behavioral paradigms used in most previous studies did not provide an operational measure of value. Thus, previous work did not examine whether contextual variability in subjective values measured behaviorally was matched at the neuronal level. In this respect, the present results are noteworthy. In particular, the identity between the neuronal and behavioral measure of relative values demonstrates that neurons in the BLA reflect the subjective nature of values as opposed to any physical property of the goods. Broadening the perspective, neuronal signals encoding the chosen value have been found in numerous brain regions, including the OFC (Padoa-Schioppa and Assad, 2006; Sul et al., 2010), dorsolateral and ventrolateral prefrontal cortex (Cai and Padoa-Schioppa, 2014), ventromedial prefrontal cortex (Strait et al., 2014), pregenual cingulate cortex (Amemori and Graybiel, 2012), dorsal and ventral ACC (Cai and Padoa-Schioppa, 2012; Hosokawa et al., 2013), and dorsal and ventral striatum (Lau and Glimcher, 2008; Cai et al., 2011). It may be noted that only a few studies have conclusively shown that neuronal signals reflected the subjective nature of values. With this caveat, the present results extend previous findings and contribute to the understanding that value signals are widely distributed and likely inform a large number of neural and cognitive processes.
Footnotes
This research was supported by the National Institutes of Health (Grant R01-MH104494 to C.P.-S.). We thank Alessandro Livi, Weikang Shi, and Manning Zhang for comments on the manuscript.
The authors declare no competing financial interests.
- Correspondence should be addressed to Camillo Padoa-Schioppa at camillo{at}wustl.edu