Abstract
The importance of face memory in humans and primates is well established, but little is known about the neurotransmitter systems involved in face recognition. We tested the hypothesis that face recognition is linked to dopamine (DA) activity in fusiform gyrus (FFG). DA availability was assessed by measuring D1 binding potential (BP) during rest using PET. We further assessed blood-oxygen-level-dependent (BOLD) signal change while subjects performed a face-recognition task during fMRI scanning. There was a strong association between D1 BP and BOLD activity in FFG, whereas D1 BP in striatal and other extrastriatal regions were unrelated to neural activity in FFG. These results suggest that D1 BP locally modulates FFG function during face recognition. Observed relationships among D1 BP, BOLD activity, and face-recognition performance further suggest that D1 receptors place constraints on the responsiveness of FFG neurons.
SIGNIFICANCE STATEMENT The importance of face memory in humans and primates is well established, but little is known about the neurotransmitter systems involved in face recognition. Our work shows a role for a specific neurotransmitter system in face memory.
Introduction
Face processing is mediated by a dedicated functional anatomy (Haxby et al., 2000). Research has established a face-specific network that includes inferior occipital cortex (Gauthier et al., 2000) and fusiform gyrus (FFG; Sergent et al., 1992; Kanwisher et al., 1997) for processing invariant aspects of faces (Gobbini and Haxby, 2007). These regions interact with others (e.g., cingulate cortex, amygdala) to support operations such as new-face encoding, familiar-face recognition, and processing complex meaning in faces (Haxby et al., 2000). Although the neuroanatomy of episodic face recognition is well established, little is known about the neurotransmitter systems involved. Here, we tested the hypothesis that DA plays a principal role in face recognition by assessing associations between DA D1 binding potential (BP) in several brain regions and FFG blood-oxygen-level-dependent (BOLD) activity during face recognition.
Evidence from patient studies, animal research, pharmacologic challenges, specific genetic polymorphisms, and molecular PET and SPECT imaging studies implicate DA in multiple cognitive domains (Bäckman et al., 2006, 2010; Bäckman and Nyberg, 2013; Schultz, 2007a,b). Although most of this work has linked DA to working-memory performance (Williams and Goldman-Rakic, 1995; Luciana and Collins, 1997; Cohen et al., 2002; Cools et al., 2007; Dodds et al., 2009; Landau et al., 2009; Fischer et al., 2010; Cools and D'Esposito, 2011), PET research has associated DA with episodic-memory performance, including verbal and spatial recall, word recognition, and paired-associate learning (Bäckman et al., 2000; Erixon-Lindroth et al., 2005; Cervenka et al., 2008, 2010; Takahashi et al., 2008). Most PET studies linking DA to episodic memory have used markers of striatal and extrastriatal D2 receptor binding. Rodent work indicates that hippocampus D1-receptor activation during encoding is critical to establishing persistent episodic-like memories (Rossato et al., 2009; Bethus et al., 2010); however, there is little corresponding human research in episodic memory focused on D1 receptors.
Based on rodent pharmacoimaging, Knutson and Gibbs (2007) asserted that striatal DA D1 activity largely determines the magnitude of the BOLD signal. Similar human DA–BOLD links have been demonstrated using presynaptic DA markers and markers of D1 and D2 receptor binding during both working memory and episodic memory performance (Landau et al., 2009; Nyberg et al., 2009; Bäckman et al., 2011; Rieckmann et al., 2011a). Of relevance to the current study, Schott et al. (2008) observed associations between DA release and BOLD activity in ventral striatum during reward-related learning. Distributed DA-mediated reward circuitry is known to overlap with the face-specific network (Gurevich and Joyce, 1999; Martinezet al., 2003; Schultz, 2007a,b). One objective of the current study was to assess relationships between D1 receptor binding and BOLD activity in FFG during face recognition.
Evidence suggests a role for DA in episodic face recognition. Pharmacological data indicate that mesolimbic DA projections extend beyond hippocampus to parahippocampal regions including FFG (Tessitore et al., 2002; Kim et al., 2010). In one study (Kim et al., 2010), subjects underwent fMRI during categorization-task performance after taking either placebo or l-DOPA (a precursor to DA and other catecholamines). These investigators found l-DOPA-related activation increases in one face network region, FFG, suggesting that exogenous DA may modulate FFG BOLD activity. Other studies demonstrate that long-term potentiation mechanisms linked to episodic-memory formation depend on tonic DA-induced hippocampal activity that stimulates ventral tegmentum (VTA), promoting DA release in parahippocampal structures including FFG (Schultz, 2007a,b; Lisman et al., 2011).
We assessed whether DA plays a role in modulation of FFG BOLD activity during face recognition and if this modulation is related to face-recognition performance. We measured FFG D1 receptor BP using PET and BOLD using fMRI during face recognition. Discriminant validity of FFG D1–BOLD associations was examined by assessing relationships of D1 BP in striatal and extrastriatal regions to FFG BOLD signal amplitude. We further assessed the relationships between BP and BOLD in other face-specific regions (i.e., anterior cingulate cortex, amygdala, and inferior occipital gyrus). Finally, we examined relationships among FFG D1 BP, FFG BOLD, and face-recognition performance. Neurocomputational models (Li et al., 2001, 2010) suggest that increased neural signal gain with DA input yields more distinct memory representations and improved performance. Therefore, we tested whether the degree to which BOLD is modulated by D1 BP in FFG (i.e., the BOLD:DA ratio) predicts face-recognition performance.
Materials and Methods
Subjects.
Twenty right-handed subjects (ages 22–30 years, M = 25.2, SD = 2.2; 10 females) were recruited by local newspaper ads in the Stockholm metropolitan area. All participants reported that they were nonsmokers with no history of drug or alcohol abuse, significant neuropsychiatric disorders, or brain damage. Written informed consent was obtained from the participants and all procedures, including [11C] SCH23390 dosage levels, were approved by the Ethics and Radiation Safety Committees of the Karolinska Institute, Stockholm, Sweden. Two participants were excluded from subsequent analyses for behavioral accuracy that was >2.5 SDs below the sample mean (overall Maccuracy = 61%; outlier 1 Maccuracy = 14%; outlier 2 Maccuracy = 18%).
Procedure.
Participants first completed health and cognitive screening. They then underwent a PET measurement lasting 61 min to acquire data for D1 BP calculation. On a subsequent occasion, not >1 week after the PET scanning, subjects were shown 24 face pictures outside of the MR scanner and instructed to encode them for later retrieval. Approximately 24 min later, during fMRI scanning, subjects were shown the set of 24 previously encoded faces intermixed with a set of 24 new faces in a yes/no recognition test
fMRI task.
Before fMRI scanning, outside of the scanner, subjects were exposed to 24 faces (half male, half female) during an intentional encoding session with the instructions “You will now see a number of faces. Your task is to remember them for a later test.” Approximately 24 min after conclusion of intentional face encoding, subjects performed a face-recognition task during fMRI scanning. In this task, they alternated between sets of face-recognition trials and sets of sensorimotor button-press trials. During the sensorimotor trials, subjects pressed a button each time a cross-hair or circle appeared in the middle of the screen. On each face-recognition trial, a target or distractor face was presented for 2 s followed by a 500 ms intertrial interval during which a cross-hair appeared on the screen. Subjects pressed a forefinger-response button to indicate that they recognized the face from the previously encoded set and they pressed a right middle-finger button to indicate that they did not recognize the face. During the sensorimotor trials, the letter “x” or a circle was presented for 2 s every 0.5 s and subjects pressed a button with either finger when one of these target objects appeared.
There were eight face-recognition blocks and eight sensorimotor blocks. In each face-recognition block, subjects viewed six faces, half of which had been presented previously. On the sensorimotor blocks, subjects responded to the letter on half the trials and responded to the circle on half the trials. Each block took 15 s and there was an interblock interval of 1.25 s. The face-recognition and sensorimotor blocks were in turn grouped into two sets of four blocks each and presented in two separate runs. There were four dummy scans added at the beginning of each run to accommodate initial magnetization. Therefore, each run took 140 s and the entire procedure lasted ∼5 min including instructions.
fMRI scanning and BOLD analysis.
Whole-brain imaging data were acquired on a 1.5 TE Signa Echospeed MR-scanner (GE Medical Systems) using a standard circular 1-channel head coil. T1-weighted 3D-SPGR images (TR = 24 ms, TE = 6 ms, flip angle = 35°) were acquired for anatomical coregistration in 124 contiguous 1.5 mm coronal slices (image resolution = 256 mm × 256 mm × 186 mm, voxelsize = 0.9 mm × 0.9 mm × 1.5 mm). Functional images were acquired using a T2*-sensitive gradient-echo EPI sequence (TR = 2.5 s, TE = 40 ms, flip angle = 90°). The image volumes had a field of view of 220 mm × 220 mm, an in-plane resolution of 3.44 mm × 3.44 mm, and contained 32 horizontal, 4-mm-thick slices with a 0.5 mm gap in between each slice. All images were acquired interleaved. During the fMRI session, 104 image volumes were obtained across the two scanning runs. All data processing was performed using SPM2 (Wellcome Department of Cognitive Neurology, UK). Functional images were first spatially realigned to the first volume in each time series. Inspection of movement parameters generated during the spatial realignment showed that no participant had moved >3 mm or 3° in any direction during task performance. Volumes were then normalized to a standard T1 template. Normalized images were spatially smoothed with a Gaussian kernel of 12 mm full-width at half-maximum. High-frequency noise was removed using a band-pass filter at 128 Hz. The hemodynamic response function acted as the low-pass filter.
Face-recognition versus sensorimotor BOLD effects were modeled using a box-car function convolved with a canonical hemodynamic response function. Whole-brain analyses were performed using directional t tests of face-recognition versus sensorimotor periods. Average β values were then extracted from regions of interest (ROI), which comprised 4 mm radius spheres centered around peak activation (t-score) within anatomical brain regions of the WFU Pickatlas (Maldjian et al., 2003). Face-specific ROIs included anterior cingulate, amygdala, FFG, and inferior occipital gyrus.
PET scanning and DA D1 analysis.
PET data were collected on an ECAT Exact HR 47 system (Siemens) in 3D mode with trans-axial resolution of 3.8 mm full-width at half-maximum and a field of view of 4.5 mm radially 20 mm from center. Transmission measurements were collected over 10 min with 3 rotating 68Ge-68Ga sources. Then, 300 MBq of the [11C]SCH23390 radioligand was rapid-bolus injected into the left antecubital vein. Emission data were collected over the following 51 min in 13 time frames of increasing duration.
ROIs were manually delineated on each individual's T1-weighted image separately for each hemisphere using the Human Brain Atlas software (Roland et al., 1994). ROIs derived for subsequent analyses included: FFG, ventral striatum, caudate, putamen, hippocampus, amygdala, anterior cingulate cortex, orbital frontal cortex, medial prefrontal cortex, and dorsolateral prefrontal cortex (Rieckmann et al., 2011b). These regions were segmented into gray matter, white matter, and CSF to minimize partial-volume effects. Subsequent analyses with unilateral ROIs did not produce different results (i.e., DA–BOLD relationships); therefore, we aggregated unilateral ROIs across hemispheres.
BP calculation.
Time-activity curves (TACs) were calculated from the PET images. For TAC generation, radioactivity was plotted versus time and corrected for decay rate. D1 receptor availability was measured as the BP of the [11C]SCH23390 radioligand. BP is defined as the ratio at equilibrium of specifically bound radioligand to that of nondisplaceable radioligand in tissue (Innis et al., 2007) and calculated using the simplified reference tissue model with cerebellum as the reference region (Lammertsma and Hume, 1996) and corrected for partial-volume effects (Meltzer et al., 1990). Therefore, BP may be thought of as density of available D1 receptors indexed by the extent of radioligand occupancy of postsynaptic D1 receptors.
Results
BOLD activity in fusiform gyrus
BOLD signal contrast between the face-recognition and button-press tasks indicated suprathreshold voxels in FFG during face-recognition performance (Fig. 1).
D1–BOLD relationships
We tested whether D1 BP predicts BOLD signal variability in FFG during face recognition by performing a linear regression of FFG D1 BP estimates and their corresponding BOLD parameter estimates. The standardized correlation coefficient revealed a positive relationship between these variables (β = 0.62; R2 = 0.38, p < 0.007; Fig. 2; regression diagnostics did not identify any unduly influential data points; Belsley et al., 1980). No such relationships were observed in any other face-specific network regions (i.e., anterior cingulate cortex, amygdala, inferior occipital gyrus; all p > 0.05).
To determine whether the relationship between FFG D1 BP and BOLD was unique to DA activity in FFG, we computed corresponding correlations for D1 BP in limbic (anterior cingulate, amygdala, hippocampus), striatal (ventral striatum, caudate, putamen), and prefrontal (e.g., orbito-, dorsolateral, medial) brain regions with FFG BOLD. None of these associations outside of FFG approached significance (all p > 0.10; Fig. 3).
Relationships to performance
Overall, d′ scores indicated that subjects performed well at the group level (M = 1.25; SD = 0.43). Among individuals with high d′ scores (i.e., those subjects whose face-recognition scores were above the median), most of their data points fell above the regression line in the DA-BOLD regression analysis suggesting that, for these subjects, there was greater BOLD activity relative to D1 receptor availability compared with low d′ performers (i.e., those subjects whose performance was below the median). Indeed, d′ was significantly related to individuals' residual scores (r = 0.49, p < 0.04). This relationship indicates that higher BOLD activity relative to D1 receptor availability was associated with better face recognition. It is worth noting that neither BOLD nor D1 receptor availability alone was associated with individual differences in performance (p > 0.05). These results are consistent with recent work on the physiological basis of the BOLD response, which suggests an important role for DA in modulation and the activation necessary for distinct memory representations and accurate performance (Knutson and Gibbs, 2007; Karlsson et al., 2009; Li et al., 2009, 2010). On this view, higher neural activity (as approximated with BOLD in the present study) is achieved with lower levels of neural input (as measured by D1 BP in the present study), resulting in better memory performance compared with the case in which the same high activation levels must be achieved with higher levels of neural input (Li et al., 2010)
We formally tested this relationship by first calculating the proportion of subjects' standardized BOLD β values relative to their standardized DA BP values (i.e., their BOLD/BP ratio). Figure 4 illustrates that this ratio significantly predicted face-recognition performance (β = 0.48, R2 = 0.23, p = 0.04). A Mahalanobis distance analysis identified two outliers in this analysis. These persons were discarded and a separate FFG BOLD:BP ratio model predicting d′ was run. This model was also significant (β = 0.57, R2 = 0.32, p = 0.02).
Discussion
We examined whether D1 BP predicts neural activity in FFG during episodic face recognition. A strong relationship between D1 BP and BOLD activity in FFG during face recognition was observed. This is the first demonstration of a role of D1 receptors in face recognition and the first to indicate a D1–BOLD association in FFG.
We observed a region-specific effect wherein D1 BP mediated FFG BOLD activity. This result is consistent with previous work on DA-BOLD links in striatal and associated midbrain regions and their anterior extensions to prefrontal cortex (Schott et al., 2008; Landau et al., 2009; Nyberg et al., 2009; Bäckman et al., 2010; Cools and D'Esposito, 2011; Rieckmann et al., 2011a). Such results are thought to reflect a social cognition network for processing emotional and reward-related information conveyed in face stimuli (cf. Siessmeier et al., 2006; Schultz, 2007a,b). Similarly, relationships that we observed between D1 BPs in ventral striatum and anterior cingulate with FFG BOLD, whereas not significant, were probably not trivial. Further research is needed to assess the extent to which distal modulation of FFG can occur during face-recognition processing. The present findings do, however, show that DA affects posterior regions by local modulation of BOLD activity (cf. Jabbi et al., 2013). These results occurred, despite the nearly fivefold reduction in D1 receptors compared with the striatum (Rieckmann et al., 2011b). This link between FFG D1 BP and BOLD suggests that dopamine can act locally to influence face-specific regions within a social cognition network.
The specific D1–BOLD association in FFG might reflect the neural demands of the face-recognition task that participants were engaged in during BOLD imaging. Face recognition is known to depend upon FFG function (Kanwisher et al., 1997; Henson et al., 2000). Tasks emphasizing other aspects of face processing (e.g., attractiveness) might elicit D1 BP-BOLD associations in other face-relevant regions (Schultz, 2007a,b). Further research is needed to fully understand the region-specific nature of the associations observed here.
How might DA function at the FFG site to modulate the BOLD response during face recognition? In the mesolimbic DA system, cell bodies extending from VTA to temporal lobe structures as far as FFG might synthesize DA for downstream function during face processing. Similar to DA mechanisms that have been illustrated in midbrain (Knutson and Gibbs, 2007), local DA FFG neurons could function by a process wherein, in response to face-recognition demands, local DA release activates postsynaptic D1 receptors, changing their polarity and increasing their metabolic activity, thereby giving rise to local increases in BOLD activity (Zonta et al., 2003; Hutchison et al., 2013; Hubbard et al., 2015). This metabolic activity, in turn, could stimulate astrocytic mechanisms that mediate neurovascular interactions, which give rise to the BOLD signal. This mechanism is known to lead to expansion of local vascular lumen, as well as increased cerebral blood flow and velocity (Takano et al., 2006; Rossi, 2006). Subsequently, presynaptic DA transporters reuptake unbound DA leading to reversal of postsynaptic polarities, reductions in FFG neural activity, and the accompanying BOLD response (Knutson and Gibbs, 2007). Therefore, the mechanism whereby DA modulates BOLD activity by constraining postsynaptic neural responsiveness in distributed cortical regions such as FFG might resemble its modulatory influences during adaptive cognition (see Schultz, 2007a,b).
DA is a diverse neurotransmitter the functions of which range from somatomotor coordination (Carlsson et al., 1958; Schultz et al., 1989) to motivational (Faure et al., 2005), reward-learning (Schott et al., 2008), and cognitive (Williams and Goldman-Rakic, 1995; Inase et al., 1997; Fallon and Cools, 2014) functions (Schultz, 2007a,b). Such functional diversity might be undergirded by reward timing in brain regions critical to the task at hand. Isomorphism between face and reward processing has been suggested by studies demonstrating attractiveness-related variability in face-network regions such as amygdala, which is highly responsive to both attractive and disfigured faces (Kampe et al., 2001; Aharon et al., 2001; Senior, 2003; Liang et al., 2010). Therefore, although D1 BP is associated with FFG BOLD in episodic face recognition in general, it might be linked to neural activity in amygdala and other reward-related regions in processing facial attractiveness.
Our data support the hypothesis that D1 BP modulates BOLD activity by placing constraints on the responsiveness of postsynaptic neurons (Choi et al., 2006; Knutson and Gibbs, 2007). However, they do not rule out other possible mechanisms through which BOLD activity constrains DA activity. For example, similar to mechanisms posited in striatum (Bäckman and Nyberg, 2013), the BOLD response in FFG could modulate DA activity by phasically inhibiting GABAergic neurons. In this view, increased DA activity results from, rather than leads to, increased FFG BOLD activity. A third possibility is that DA and BOLD activity mutually influence one another through the mechanisms discussed above. Finally, we cannot rule out a role for serotonin in this context insofar as its postsynaptic availability is also reflected in the affinity of SCH23390 (Ekelund et al., 2007). However, for several reasons, such potential influences are likely to be negligible (Andersen et al., 1992; Rieckmann et al., 2011b).
We observed higher face-recognition performance for individuals who showed greater BOLD response relative to their DA availability (i.e., those with a higher BOLD–BP ratio). Conversely, those with low BOLD responses relative to DA availability performed less accurately. Similar to prefrontal activity during working memory, subjects for whom higher BOLD activation is attained with lower levels of DA neural input achieve higher gains in the sigmoidal function relating DA input to activation (Li et al., 2001; Li et al., 2010). As has been shown with behavioral associations, larger gain should yield higher neural signal relative to background noise, more distinct memory representations, and higher memory performance (Li and Sikström, 2002; Li et al., 2009). Conversely, subjects for whom higher BOLD activity can only be reached with high levels of DA input should achieve lower gains, less distinct memory representations, and reduced memory performance (Li et al., 2010; Nyberg et al., 2012).
In sum, strong regional associations between D1 BP and BOLD activity in FFG were observed, whereas D1 BP in striatal and other extrastriatal regions were unrelated to BOLD activity in FFG. These results suggest that D1 BP modulates neural function in FFG during face recognition. Relationships among DA, BOLD activity, and performance further suggest that the extent to which DA availability drives FFG neural activity is linked to face-recognition accuracy.
Footnotes
This work was supported by the University of Texas at Dallas Faculty Development Leave Program, the National Institutes of Health (Grants R01AG029523 and R01AG047972 to B.R.); the Swedish Research Council, Swedish Council for Working Life and Social Research, Swedish Brain Power, an Alexander von Humboldt Research Award, and a donation from the af Jochnick Foundation (L.B. and H.F.), and the Torsten and Ragnar Söderberg's Foundation (L.B. and L.N.). Significant technical and conceptual portions of this work were completed while B.R. was in residence at the Aging Research Center of the Karolinska Institute in Stockholm, Sweden. We thank Saranya Sundaram, Brooke Gomez, and Monroe Turner for technical support.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Bart Rypma, School of Behavioral and Brain Sciences, Green Hall, GR41, University of Texas at Dallas, 800 West Campbell Road, Richardson, TX 75080. bart.rypma{at}utdallas.edu