Abstract
Previous advances in magnetic resonance imaging allow the analysis of blood oxygen level-dependent signals in real time, thus opening the possibility of feeding an index of these signals back to scanned human participants. However, it is still not known to what extent different cortical networks may differ in their sensitivity to such internally generated neurofeedback (NF). Here, we compare NF efficacy across six cortical regions including: early and high-order visual areas and the posterior parietal lobe, a prominent node of the default mode network (DMN). Our results reveal a consistent difference in NF activation across these areas. Sham controls ruled out a role of attention/arousal in these effects. These differences are suggestive of a relationship to the relative reliance on intrinsic information, moving from early visual cortex (lowest) to the DMN (highest). Interestingly, the visual parahippocampal place area showed NF activation closer to the DMN node. The results are compatible with the notion of the DMN as an intrinsically oriented system.
Introduction
The “default mode network” (DMN) is a large-scale system that shows a consistent reduction in activity when subjects engage in externally oriented tasks (Raichle and Snyder, 2007). We and others have hypothesized that this network may in fact be specialized for intrinsically oriented functions that are detached from the immediate external environment (Gusnard et al., 2001; Golland et al., 2007; Buckner et al., 2008; Preminger et al., 2011). However, it should be noted that the specialization trend toward reliance on intrinsic information is not a categorical distinction of the DMN. For example, within the visual system, as one moves along the cortical hierarchy, there is a tendency for brain activation to be dissociated from the external physical stimulation and become closer to the perceptual inner state of the individual (Treisman and Kanwisher, 1998; O'Craven and Kanwisher, 2000; Avidan et al., 2002; Grill-Spector and Malach, 2004; Lee et al., 2007).
Thus, an important characteristic of cortical functional specialization appears to be a differential link to extrinsic versus intrinsically derived signals. Although most studies that explore this issue were based on carefully controlled stimuli and tasks, it occurred to us that a complementary perspective could be provided by the newly introduced method of magnetic resonance (MR)-based neurofeedback (NF; MR-NF). The NF method is based on feeding a representation of brain activations back to the participants during the MR scanning (Yoo and Jolesz, 2002; Weiskopf et al., 2004b; Goebel and Linden, 2014).
Here, we hypothesize that if indeed the DMN is a network tuned to intrinsically derived information, it should be more amenable to the NF method compared to the extrinsically oriented visual system. It should be noted that this prediction is diametrically opposite to what is typically found during external stimulation, in which the DMN network shows a reduced response to external stimuli while the visual system is positively activated by such stimuli (Shulman et al., 1997; Buckner et al., 2008; Singh and Fawcett, 2008; Preminger et al., 2011). Furthermore, within the visual system proper, if indeed high-order visual areas are more dependent on the inner state of the individual, then we should expect them to also show a higher NF sensitivity compared to the more externally linked early visual areas.
To test our hypothesis, we conducted a within subject comparison of MR-NF activations across six ROIs, chosen to represent networks of presumably different links to internal information. These included a prominent node of the DMN [located in the inferior parietal lobe (IPL)] and five visual regions: early visual cortex and the high-order category selective regions lateral occipital complex (LO), extrastriate body area (EBA), fusiform face area (FFA), and parahippocampal place area (PPA) (Grill-Spector and Weiner, 2014).
Our results reveal that the DMN-related IPL showed significantly higher NF activations compared to the majority of visual areas, whereas early visual cortex showed the lowest activations. These results highlight the usefulness of MR-NF as a new mapping tool and are compatible with the hypothesis tying the DMN network to internally related information.
Materials and Methods
Subjects.
Eight healthy subjects (four females; mean age ± SD, 28.3 ± 2.06; range, 26–32) participated in the experiment. All participants were right-handed, had normal hearing and normal or corrected-to-normal vision, and provided written informed consent for their participation. The Hertzog Hospital Ethics Committee approved the experimental protocols.
Tasks and stimuli.
The experiment was conducted on 2 consecutive days (Fig. 1). On the first day, subjects performed an external visual localizer task (9.5 min). On the following day, the subjects performed an imagery localizer task (5.5 min) for each of the target ROIs, after which they underwent a real-time (rt) fMRI NF run for that ROI (5.5 min). The imagery and NF runs for the different ROIs were counterbalanced between the subjects.
Localizer.
To localize the visual regions of interest to be enhanced during training (see below), subjects performed two localizer tasks, one externally driven and the other internally driven. The externally driven visual localizer was a visuomotor categorization task that consisted of 9 s blocks of pictures from five different categories (faces, houses, bodies, tools, and textures) separated by rest periods (6 s). The internally driven localizer was an imagery task that consisted of blocks of imagery with intermittent rest periods (40 and 20 s, respectively). Before the scans, subjects were given instructions for the imagery tasks that later repeated as the suggested initial cognitive strategy for the NF run. Strategies included the following: for the IPL, episodic memory or future simulation (“recall a memory from your personal past or simulate a future situation”; Andrews-Hanna et al., 2010); for the PPA, imagery of a familiar path and/or buildings; for the FFA, imagery of familiar faces (Ishai et al., 2000; O'Craven and Kanwisher, 2000); for the EBA, imagery of full bodies or body parts (Blanke et al., 2010); for the LO, imagery of tools or their manipulation (Zhang et al., 2004); and for the early visual cortex, imagery of texture patterns (Kosslyn et al., 1999). Following the localizer experiment, a target ROI was defined based on real-time analysis of the BOLD activation during the localizer runs. The targeted ROIs were compared to those from a more conventional visual localizer experiment conducted on all of the subjects, which allowed verification of the imagery localizer using the more conventional category selectivity (Hasson et al., 2002) and DMN (Golland et al., 2007; Preminger et al., 2011) localization criteria (for details, see Definitions of ROIs, below). In cases where the imagery localizer failed to reveal a consistent activation (particularly in the case of the EVA), the ROI was based on the conventional localizer map.
Neurofeedback training.
After defining the six target ROIs for each subject, the six MR-NF (Weiskopf et al., 2004b; DeCharms et al., 2005; Johnston et al., 2010) experiments were run counterbalanced within the same scanning session. The experiments were run following a similar paradigm to our previous NF study of frontal cortex activation (Harmelech et al., 2013). The NF run constituted a 5.5 min scan that included blocks of elevation and rest (Fig. 1). For the elevation condition in each experiment, participants were given an initial strategy that corresponded to the one used in the imagery localizer. However, their task was to increase the pitch of the feedback tone as much as possible, and they were encouraged to modify their strategy if that produced a higher feedback tone in the NF experiment. The subjects were also informed of the existence of a delay of ∼6 s in the feedback (resulting from the hemodynamic lag). Each block lasted 40 s and was followed by a 20 s rest period, both marked by brief (<1 s) auditory cues. During each run, subjects were provided with ongoing auditory feedback, a tone (2000 ms in length) every time point, whose pitch indicated the level of activity in the target ROI out of seven levels. For each time point, the baseline was calculated by extracting the minimum and maximum values from a sliding window of the last 10 time points and subtracting them to calculate the dynamic activation range. The activation range was multiplied by 1.1, in case the activation level at a given time point exceeded that range. The relative level of activity was calculated as the ratio between the average ROI value for the given time point minus the minimum value and the baseline [given time point value minus min/1.1(max minus min)]. Both the auditory cues and the real-time feedback were provided via Confon HP-SC 01 headphones (by MR Confon). Auditory feedback was chosen after a few pilot experiments with different types of feedback (two types of visual, auditory, and bimodal) were done. The feedback that yielded the best results was the auditory feedback. The subjects also reported such auditory feedback to be less interfering with their mental strategy, while being salient enough as feedback and for monitoring.
Reports.
After the imagery localizer and NF scan, the subjects were asked to rate the similarity of their thought content during the imagery localizer run and the neurofeedback on a scale of 1 to 5, 1 being very similar and 5 being very different.
Sham feedback control.
Six of the eight subjects participated in a sham feedback control experiment. The experiment was identical to the NF experiment, except that feedback was sampled from a different (irrelevant) ROI (V1) from the target ROI. All subjects underwent two runs, one for each of the two most successfully activated ROIs (the IPL and the PPA). In each run they were given the exact same instructions and strategy as in the NF experiment, only this time they were unknowingly given feedback according to their activity in V1.
Imaging setup.
The scans were performed on a 3T Trio Magnetom Siemens scanner at the Weizmann Institute of Science, Rehovot, Israel. Three-dimensional T1-weighted anatomical images were acquired with high-resolution 1 mm slice thickness (3D MP-RAGE sequence; TR, 2300 ms; TE, 2.98 ms; 1 × 1 × 1 mm voxels). BOLD contrast was obtained with gradient echoplanar imaging sequence (TR, 2000 ms; TE, 30 ms; flip angle, 90°; FOV, 240 mm; matrix size, 80 × 80; scanned volume, 35 axial slices of 4 mm thickness, no gap, 3 × 3 × 4 mm voxel, anterior commissure–posterior commissure).
Data analysis and preprocessing.
Real-time fMRI data were analyzed with the Turbo BrainVoyager and BrainVoyager software packages (Brain Innovation) and with complementary in-house software. The first two images of each functional scan were discarded to allow for T1-equilibration effects. The functional images were superimposed on 2D anatomic images and incorporated into the 3D data sets through trilinear interpolation. The cortical surface in a Talairach coordinate system (Talairach and Tournoux, 1988) was reconstructed for each subject from the 3D MP-RAGE scan. Data were corrected for slice-dependent time shifts for each volume and 3D head motion. Motion parameters generated in the latter process were used later as nuisance regressors in the general linear model (GLM) and correlation analyses. fMRI images from the localizer tasks and the NF session were spatially smoothed with a Gaussian kernel with a full width at half maximum (FWHM) of 4 mm. Temporal filtering of linear trends was performed to account for low-frequency noise (e.g., scanner drift). All regressors were modeled as boxcar functions convolved with the hemodynamic response function. A hemodynamic lag of 3–6 s was assumed for each subject. The analysis was performed independently for the time course of each individual voxel. After computing the coefficients for all regressors, a Student's t test between coefficients of different conditions (e.g., PPA elevation vs rest) was performed. Multisubject analysis was based on a random-effects GLM. The multisubject functional maps were projected on an inflated or unfolded Talairach normalized brain. Significance levels were calculated, taking into account the minimum cluster size and the probability threshold of a false detection of any given cluster. This was accomplished by a Monte Carlo simulation (cluster-level statistical threshold estimator in BrainVoyager) using the combination of individual voxel probability thresholding (α was set at 0.05 throughout). Spatial smoothing with a Gaussian kernel with a FWHM of 4 mm was used for individual subjects and 8 mm for the multisubject map.
Definitions of ROIs.
All ROIs were defined individually for each subject in two stages. In the first stage (day 1), the regions were defined according to an external conventional visual localizer as the activated voxels located within 15 mm of the multisubject activity center, in the following GLM contrasts: PPA, defined using the contrast “houses > faces”; FFA, defined using the contrast “faces > houses”; EBA, defined using the contrast “body parts > textures”; LO, defined using the contrast “tools > textures”; early visual, defined using the contrast “textures > rest”; IPL, defined as the top inactivated region.
In the second stage (day 2), the target regions for the neurofeedback were defined as the regions of overlap (r = 15 mm) between the ROIs defined by the external conventional visual localizer and those defined by the imagery-strategy localizer (Table 1), except for cases of failure of imagery activation, where the ROI was based exclusively on the conventional visual localizer.
Statistical testing for ROI analysis.
To compare the different ROI activations for the different conditions (neurofeedback vs imagery, neurofeedback vs sham, imagery vs sham), and given the relatively small sample size, nonparametric Friedman's tests were used. P values were corrected for multiple comparisons using the Benjamini–Hochberg correction (Hochberg and Benjamini, 1990).
Results
After each NF experiment, participants were requested to rate the similarity of their thought content between the imagery localizer and the NF on a scale of 1 (very similar) to 5 (very different). According to the reports, the original strategy and the one the participants used for upregulation of the DMN-related IPL and the PPA were very similar (mean, 1 ± 0); for the EBA, FFA, and LO they were fairly similar (1.4 ± 0.18, 1.4 ± 0.18, 1.9 ± 0.22, mean ± SE); for the early visual cortex, the strategy they reported using was somewhat different (mean, 2.5 ± 0.19).
The imagery-localizer experiment revealed differential activations associated with the different imagery and thought contents. These activations, as expected, corresponded to the well-known visual and DMN functionalities. An example of the BOLD activations produced by the six different strategies, and the locations of chosen target ROIs, are depicted on an unfolded cortex in Figure 2A (localizer column) for a representative participant. The borders of category selective areas, early visual cortex, and default mode network, defined by the conventional visual localizer experiment, are depicted in color contours. The regions selected as targets for the MR-NF experiment are indicated by a red circle. As can be seen, the imagery localizer produced robust activation mainly in the IPL, EBA, and PPA. Using each of the ROIs as a target in a separate scan, participants then conducted a MR-NF experiment. The results for the NF-MR activation for the same participant are depicted in Figure 2B (neurofeedback column). The maps of NF activation are shown on an unfolded cortex in the left column, and the BOLD signal time courses are shown in the right column superimposed on the protocol indicating periods of NF and rest. The maps were ordered, top to bottom, according to the level of NF activation for each ROI, as measured by the GLM β values [IPL, 4 ± 0.32, p < 0.001; PPA, 2.12 ± 0.52, p < 0.01; EBA, 0.59 ± 0.39, nonsignificant (NS); FFA, 0.007 ± 0.56, NS; LO, −0.4 ± 0.57, NS; early visual cortex, −1.9 ± 0.59, NS]. As can be seen, the participant was quite successful in activating some of the target ROIs using the feedback signal. However, there was a striking difference in the amplitude and reliability of these activations across ROIs. In particular, the DMN IPL showed the highest activation, whereas early visual cortex showed essentially no NF activation.
This striking differential activation was consistent across participants. Figure 3 depicts summary group results, showing averaged event-related BOLD activation (A) and averaged activation (GLM β values; B). As can be seen, again the IPL DMN region showed the highest NF activation, which was significantly higher than in the visual regions. High-order visual regions (except for the LO) showed significantly higher activations than early visual cortex. Interestingly, the place-related PPA, located near the medial temporal lobe, showed a significantly higher NF activation compared to the FFA, EBA, and LO regions. Also noteworthy is the observation that the NF was a significantly more effective “activator” than the imagery strategy by itself (p < 0.05 for all ROIs, except for LO and early visual cortex). We ran a sham feedback control (see Materials and Methods) on six of the same subjects that participated in the NF experiment. The results showed significantly lower activation levels compared to the NF (IPL, 1.9 ± 0.44, p < 0.05; PPA, 1.28 ± 0.25, p < 0.05). Furthermore, comparing the sham to the imagery strategy failed to show a significant difference (IPL, p = 0.7; PPA, p = 0.6).
Discussion
Our results reveal a striking differentiation in the ability of participants to use NF signals to activate different cortical areas. Specifically, subjects showed significantly higher NF activations of the IPL (a DMN node) than regions in the visual cortex. In contrast, subjects were largely unsuccessful in using NF to activate early visual cortex. Category selective visual areas showed a mild level of NF activation, with the intriguing exception of the PPA (Figs. 2, 3).
It could be argued that the enhanced activation we found during the NF was not a result of the feedback proper, but a more general effect, for example, a greater arousal and motivation of the participants when undergoing the NF procedure compared to simple imagery. However, our sham control, in which participants were given a “wrong” feedback, i.e., derived from a different ROI (V1), argues against this interpretation. Thus, we show a significantly weaker activation in the sham compared to the true NF procedure. Furthermore, the activation during the sham was similar to the imagery condition, indicating that, in this case, participants could not use the feedback signal to enhance the activation. Thus, we believe that the significantly higher activations the NF produced compared to the imagery strategy by itself were not due to enhanced motivation or attention (which are also present during the sham condition), but rather due to the neurofeedback itself.
It could be argued that the stronger activation in the DMN and PPA is due to the better ability of participants to choose an optimal strategy for these areas compared to, e.g., early visual cortex. Indeed, one has a clear introspective intuition that it is easier to think about an episodic memory than to imagine a complex visual texture. However, it is important to distinguish between the apparent ease in conjuring a self-generated image from the objective information we have about the optimal stimuli for an area. For example, in a NF experiment targeting olfactory representations, one may be well informed about the fact that the optimal strategy for this task should be imagining different smells, yet fail in performing the NF experiment since olfactory imagery is notoriously difficult to achieve (Stevenson and Case, 2005). Such discrepancy between objective information and subjective performance may actually be a reflection of the present findings; i.e., subjectively, it is far easier to self-activate the DMN compared to early visual cortex. However, objectively, our knowledge of the optimal stimuli that drive area V1 is undoubtedly more certain than that of the currently debated involvement of the DMN in memory function (Hinds et al., 2013). Furthermore, our knowledge of the face selectivity of the FFA is quite likely more detailed, given the extensive literature, compared to the PPA, yet the ability of subjects to self-activate the latter was far superior to the former, again matching an introspective feeling that it is more difficult to conjure up vivid faces than to remember topographical information.
Differential activation within the visual system
Regarding differential effects within high-order visual areas, the main difference we found was a preferential activation of the PPA compared to the other areas (EBA, FFA, and LO; see Results; Fig. 3). The reason for the preferential activation of the PPA is intriguing, but not fully clarified yet. One possibility is that the PPA may in fact be a system that has closer functional links to memory systems. The PPA is anatomically closer to medial temporal lobe structures whose link to DMN functionality has been suggested in previous work (Kahn et al., 2008). Furthermore, functional properties such as selectivity for topographical representations (Kanwisher et al., 1997) and selectivity to associative networks (Bar and Aminoff, 2003) may also be compatible with such tighter links to medial temporal lobe and DMN structures. An alternative possibility may be related to the fact that in ecological daily life, the need to self-activate topographical representations (for example, in the course of planning a navigation route) may be more adaptive than the need to conjure faces. Further research is needed to clarify this finding. Another significant result observed was the failure of subjects to use NF to activate early visual cortex. This reduced ability is in line with our overall hypothesis that the success of NF activation reflects the cortical specialization for processing internally generated information compared to externally derived information. However, it is important to emphasize that our results certainly cannot be taken as evidence against top-down effects in early visual cortex (Brefczynski and DeYoe, 1999; Gandhi et al., 1999; Davidesco et al., 2013). Our results do not rule out such influences, but rather suggest that they play a relatively minor role in NF-based activations.
Potential translational impact
It should be noted that besides adding a new functional distinction to mapping of cortical networks, our results have potential translational impact. There has been a growing interest in using self-generated brain activation as a communication tool for paralyzed individuals such as “locked-in” patients (Laureys et al., 2005; Owen et al., 2006), or even to control devices in experimental brain–machine interface (BMI) paradigms (Cohen et al., 2014). Such approaches are critically dependent on the ability of the patients to self-activate their cortical areas. From this perspective, our study highlights the DMN and PPA regions as optimal candidates for such communication and BMI strategies. Furthermore, the paradigm illustrated in the present work, of comparing NF activation across a large number of cortical areas, could be extended to provide a comprehensive NF-based activation “atlas” that could serve as an important tool for optimizing the use of self-induced brain activation for communication purposes.
Relation to previous MR-NF research
Although the MR-NF method is fairly new, a number of studies have applied it in the human cortex. Target areas were selected on the basis of either anatomical [e.g., anterior cingulate (Weiskopf et al., 2003), anterior insula (Caria et al., 2007), inferior frontal gyrus (Rota et al., 2009)] or functional (e.g., motor and visual imagery; Yoo and Jolesz, 2002; Weiskopf et al., 2004a) criteria. This method was applied and shown to produce strong activations resulting in significant brain-behavior changes in chronic pain patients (deCharms et al., 2005), emotion regulation (Johnston et al., 2010; Linden et al., 2012), motor control and Parkinson's disease (Goebel et al., 2010; Subramanian et al., 2011), and visual perceptual learning (Shibata et al., 2011; Scharnowski et al., 2012). However, to the best of our knowledge, no systematic comparison of the effectiveness of NF activations across several cortical areas has been reported so far. It should be noted that although we show that some brain regions can be self-regulated more than others in a single NF session, it remains an open issue whether these results pertain also to multiple sessions.
Two important aspects should be considered when comparing our results to previous studies. First, whereas our NF approach revealed a particularly weak NF regulation in area V1, two previous studies demonstrated NF training effects in V1 (Shibata et al., 2011; Scharnowski et al., 2012). A consistent difference between our study and these reports relates to the participants' attempt to use an explicit conscious strategy to activate V1. In the studies by Shibata et al. (2011) and Scharnowski et al. (2012) it appears that the NF signal may have exerted a direct influence on the training procedure, in absence of a concrete awareness of the participants about the training targets. These results raise the intriguing possibility that in areas that are less directly accessible to self-related representations, such as V1, network plasticity may be mediated via direct reward associations of which the participants are not aware.
A second important aspect to note is that, at least for V1, a NF approach based on multivariate decoding, may provide a superior performance (Shibata et al., 2011). This raises the possibility that a multivariate decoding that is less susceptible to feedback noise could possibly lead to a different sensitivity profile in our study. This possibility cannot be ruled out at present, and future studies using multivariate mapping across a range of cortical areas should resolve this issue.
Relation to imagery
One can envision the MR-NF approach as an optimized version of mental imagery tasks. Indeed, as our imagery localizer indicates, the imagery strategy corresponded well to the MR-NF strategy subjects used (except for the early visual cortex, probably due to lack of success in upregulation of this region). A substantial body of literature exists concerning the impact of imagery in the visual regions reported here, as well as self-generated activations of the DMN. Thus, significant activations of the FFA and PPA were reported to be associated with imagining faces and places, respectively (Ishai et al., 2000; O'Craven and Kanwisher, 2000; Soddu et al., 2009). Similarly, activations in LO have been demonstrated to be associated with imagery of tool use and objects (Zhang et al., 2004; Konkle and Oliva, 2012). Activations in the EBA were found in imagery of full and upper human bodies (Blanke et al., 2010) and imagery of tool use (Tomasino et al., 2012). However, to the best of our knowledge, none of these studies compared the magnitude of activations across these areas. An interesting issue concerns imagery effects in area V1. Although the robustness of such activations have been debated (Roland and Gulyas, 1994; Kosslyn and Thompson, 2003; Pylyshyn, 2003), previously, more sensitive approaches using multivariate classifier approaches appear to show informative signals related to imagery content in early visual cortex (Kamitani and Tong, 2005; Slotnick et al., 2005). However, again, the levels of activations were not compared to other visual cortical areas. Finally, despite the difficulty in activating the DMN above resting baseline using external stimuli, the application of “stimulus-free” paradigms, in which participants self-activate the DMN, was shown to result in robust and maintained activations (Preminger et al., 2011; Kucyi and Davis, 2014). Similarly, we showed in a previous study that the anterior cingulate cortex, which may possess functional links to the DMN (Damoiseaux et al., 2006; Park et al., 2012), can be robustly and consistently activated during NF, even leading to long-term restructuring of its spontaneous (resting-state) connectivity (Harmelech et al., 2013).
Footnotes
This work was supported by the EU FP7 VERE project, ICORE program (ISF 51/11) grants, and the Helen and Martin Kimmel award to R.M. We thank Nahum Stern, Fanny Atar, and Dr. Edna Haran-Furman for their assistance in the imaging setup and rtfMRI data collection, and Michal Harel for helpful input.
The authors declare no competing financial interests.
- Correspondence should be addressed to Rafael Malach at the above address. rafi.malach{at}weizmann.ac.il