Abstract
Spatial navigation is a multisensory process involving integration of visual and body-based cues. In rodents, head direction (HD) cells, which are most abundant in the thalamus, integrate these cues to code facing direction. Human fMRI studies examining HD coding in virtual environments (VE) have reported effects in retrosplenial complex and (pre-)subiculum, but not the thalamus. Furthermore, HD coding appeared insensitive to global landmarks. These tasks, however, provided only visual cues for orientation, and attending to global landmarks did not benefit task performance. In the present study, participants explored a VE comprising four separate locales, surrounded by four global landmarks. To provide body-based cues, participants wore a head-mounted display so that physical rotations changed facing direction in the VE. During subsequent MRI scanning, subjects saw stationary views of the environment and judged whether their orientation was the same as in the preceding trial. Parameter estimates extracted from retrosplenial cortex and the thalamus revealed significantly reduced BOLD responses when HD was repeated. Moreover, consistent with rodent findings, the signal did not continue to adapt over repetitions of the same HD. These results were supported by a whole-brain analysis showing additional repetition suppression in the precuneus. Together, our findings suggest that: (1) consistent with the rodent literature, the human thalamus may integrate visual and body-based, orientation cues; (2) global reference frame cues can be used to integrate HD across separate individual locales; and (3) immersive training procedures providing full body-based cues may help to elucidate the neural mechanisms supporting spatial navigation.
SIGNIFICANCE STATEMENT In rodents, head direction (HD) cells signal facing direction in the environment via increased firing when the animal assumes a certain orientation. Distinct brain regions, the retrosplenial cortex (RSC) and thalamus, code for visual and vestibular cues of orientation, respectively. Putative HD signals have been observed in human RSC but not the thalamus, potentially because body-based cues were not provided. Here, participants encoded HD in a novel virtual environment while wearing a head-mounted display to provide body-based cues for orientation. In subsequent fMRI scanning, we found evidence of an HD signal in RSC, thalamus, and precuneus. These findings harmonize rodent and human data, and suggest that immersive training procedures provide a viable way to examine the neural basis of navigation.
Introduction
Head direction (HD) cells, examined extensively in rodents, code orientation information by firing when the animal assumes a particular facing direction (for review, see Taube, 2007). These cells have been found in a number of regions, including retrosplenial cortex (RSC) and anterior dorsal thalamic nuclei (ADN). Although human functional imaging studies have reported HD coding in retrosplenial complex, a more extensive region of posteromedial cortex not limited to anatomical RSC (Baumann and Mattingley, 2010; Marchette et al., 2014), none have observed a contribution of the thalamus to this signal.
In rodents, HD cells increase in firing rate when the animal assumes a particular orientation. When the head is kept stationary, however, HD firing rates are maximal for the first 100 ms before stabilizing over several seconds to less than one-half of their initial firing rates (Shinder and Taube, 2014). The preferred firing direction of these cells is determined by both external visual, and internally generated, body-based, cues. Under normal conditions, the preferred firing direction of HD cells is governed strongly by visible landmarks (Taube et al., 1990a,b). Lesioning the RSC (Clark et al., 2010) or postsubiculum (PoS; Goodridge and Taube, 1997) reduces the influence of landmark position on HD cell firing, suggesting that these regions code visual orientation cues. ADN HD cells remain sensitive to facing direction, however, through self-motion cues. In contrast, removal of vestibular input completely disrupts orientation coding in the ADN, even in the presence of intact RSC and PoS (Stackman and Taube, 1997), suggesting that idiothetic information is critical for HD coding in this region. Accordingly, a key role of the ADN may be to integrate body-based and visual orientation cues (Clark et al., 2010).
Human fMRI studies examining HD coding in virtual environments (VE) have used visual, but not body-based, cues during training to differentiate orientation. Accordingly, effects observed in RSC (Baumann and Mattingley, 2010; Marchette et al., 2014), and subiculum (Chadwick et al., 2015) may reflect coding of visual cues for facing direction. Body-based cues, such as proprioceptive feedback or motor efference copies, provide important information regarding one's heading in the environment, and can support navigation in the absence of visual input (eg, in path integration; Etienne and Jeffery, 2004). A task in which both body-based and visual information can be used to differentiate orientation, therefore, may be more likely to recruit the thalamus.
A second open question pertains to the reference frame in which HD is coded. Marchette et al. (2014) found that whereas RSC coded for HD in reference frames anchored to local environmental features, there was no evidence for HD coding in a global reference frame. The task demands may explain this effect because participants made directional judgments regarding proximal objects, rather than global landmarks, which may have promoted the use of local reference frames. Successfully navigating complex real world environments, however, requires the integration of disparate locales into a cohesive cognitive map (Wolbers and Wiener, 2014), which would benefit from orientation coding in a global reference frame (Vass and Epstein, 2013).
In the current experiment, we asked whether there was evidence of thalamic HD computations when body-based cues were provided during learning of an environment, and whether HD can be coded in a global reference frame. To achieve this, participants learned a VE comprising four distinct buildings while wearing a head-mounted display, requiring them to physically turn around to change orientation. In a subsequent one-back task performed during fMRI scanning, stimuli were presented that cued different HDs. We predicted adaptation in the RSC and the thalamus, associated with repeating HD across buildings. Moreover, we analyzed the BOLD response over multiple repetitions of the same HD to test whether this signal plateaus, as has been demonstrated in rodents.
Materials and Methods
Participants
Nine healthy volunteers (2 female; age range: 19–32), all right-handed, with normal or corrected-to-normal vision, and no history of psychiatric disorder, participated in the experiment at the UC Santa Barbara Brain Imaging Center. The study received ethical approval from the local ethics committee.
Stimuli
A novel VE was created using 3D Studio Max comprising four unique, separate, enclosed rooms or “galleries” (2 square galleries, 1 circular, and 1 hexagonal) linked by a handrail (Fig. 1a). Four global landmarks (a cityscape, a bridge, and 2 different colored mountain ranges; Fig. 1b) were positioned at the cardinal HDs (0°, 90°, 180°, and 270°), and the environment had a black and white striped floor. Each gallery housed a unique category of artwork (geometric shapes, abstract art, color blocks, or animals), and contained four unique paintings (16 in total in the VE). To further disambiguate the galleries, each one contained a unique wall texture (Fig. 1c). Within each gallery, the four paintings were arranged so that each one occupied a position concordant with a global landmark, effectively meaning that individual paintings occupied a specific HD.
For the one-back test phase administered during MRI scanning, individual paintings were displayed with a partial view of the background wall texture (Fig. 1d).
Procedure
Training phase.
The experiment comprised training and scanned test phases. During training, participants familiarized themselves with the VE, and learned the specific paintings associated with different HDs, in advance of the one-back task in the scanner. To provide an immersive learning experience and to provide full body-based cues about self-rotation, participants wore a head-mounted display. The VE was projected to this display with motion trackers detecting head only or head and body rotations, which then provided an updated visual input so that the participant experienced full 360° exploration within the VE. In contrast to rotational movements, forward/backward translational movements within the VE were produced solely by forward and backward motions of a joystick (ie, no translations of the body). Before exploration of the VE, participants were made aware that they would be tested on their knowledge of the paintings' orientations in the galleries relative to the global landmarks. They were advised to use all environmental cues available (ie, the orientation of the black and white striped floor, and the relative positions of the four global landmarks) to monitor their orientation while transitioning between, and entering, the galleries. Critically, once inside a gallery, participants had no visual access to the outside world. After entering a gallery, they were transported automatically to a marker in the center of the room (Fig. 1c). During this time, the participant's movement was confined to rotating in place to view the four items on the walls. Participants pressed a button to indicate when they had finished studying the paintings, at which point the pictures were removed and two doors appeared so that they could exit the room. It is important to note that, because of the layout of the environment, the egocentric bearings of the outside landmarks differed in each gallery. For example, across the four galleries, the cityscape was located egocentrically to the left, right, front, and behind the participant. Participants were required, therefore, to develop allocentric knowledge of HD in the VE.
After each circuit of the environment, participants were tested on the knowledge of their facing direction relative to the global cues when facing each of the 16 paintings. Participants were seated in front of a computer with a joystick, and presented with one of the paintings with a text prompt beneath it asking them to point in the direction of a specific global landmark. If the participant made a mean average pointing error >10° across trials, they were returned to the VE for a further training phase. Training continued until the participant reached criterion before moving onto the test phase, which was the fMRI component of the task.
Test phase.
After reaching criterion in the training phase (ie, <10° mean average pointing error), participants performed a one-back task while undergoing MR scanning. In this task, the 16 paintings were presented individually, and participants were required to recall their facing direction associated with the painting in the current trial and compare it to their facing direction associated with the painting in the preceding trial. Given that participants were required to remain motionless in the scanner, it was our assumption that the visual cues would lead to a recapitulation of the information encoded with them during learning in the VE (ie, an encoding specificity effect; Tulving and Thomson, 1973; for review, see Watrous and Ekstrom, 2014). Several models of memory propose that associated event details are reinstated during retrieval of an episode (Norman and O'Reilly, 2003), and evidence for this cortical reinstatement has been demonstrated via stimulus-specific patterns of activity in fMRI (Johnson and Rugg, 2007), and electroencephalography (Wimber et al., 2012). In the current study, therefore, we assumed that the presentation of a visual cue to a specific HD would lead to reinstatement of the body-based cues about facing direction that were available during learning. Participants pressed one button if the facing directions differed over consecutive trials (non-repeat HD), and an alternate button if they were identical (repeat HD). Items were presented for 5 s, and participants were told to respond as quickly and accurately as possible during this time. Trial order was controlled so that for both non-repeat, and repeat HDs, images were drawn from a different gallery (ie, a different category of stimuli) on each trial. Any effects associated with the one-back, therefore, could not be explained by repetition of a semantic/visual category. A block comprised 32 trials, with each of the 16 items presented twice, serving once as a non-repeat HD trial, and once as a repeat HD trial (ie, 16 non-repeat HD, and 16 repeat HD trials per block).
Before this test phase, participants performed a short practice run outside the MR scanner. This practice was limited to one block of 16 trials, and the correct answer was provided after every trial. Importantly, the practice run, covered only a small portion of the potential transitions between stimuli during test, hence it was impossible for participants to learn the correct responses from feedback. During MRI scanning, feedback was not provided, and intertrial intervals varied randomly between 6 and 8 s. Participants viewed the test items, back-projected from a LCD projector via a mirror, mounted on the head-coil, angled at 45°. Responses were made using an MR-compatible button-box. Each scanning run comprised one block lasting ∼7 min; the experiment consisted of eight runs in total.
MR imaging parameters
Imaging data were acquired using a 3T MRI scanner (Siemens Trio), with a standard receiver head coil. Twenty-five contiguous axial slices (3 mm thickness), tilted to cover target brain areas (eg, RSC, thalamus), were acquired using a T2*-weighted gradient echo planar imaging sequence (TR/TE = 1340/30 ms; flip angle = 90°, voxel size = 3 mm isotropic, FoV = 192 × 192 mm). Before the functional scans, a high-resolution, whole-brain T1-weighted anatomical scan was acquired for each participant (TR/TE = 15/4.2 ms, flip angle = 20°, voxel size = 0.9 mm isotropic, FoV = 240 × 240 mm).
fMRI data preprocessing and analysis
Data analyses were conducted using FSL v5.0 (FMRIB). For each run, the EPI data were skull-stripped (Smith, 2002), realigned to the middle functional volume of the run (Jenkinson et al., 2002), high-pass filtered (100 s) to remove low-frequency scanner artifacts, smoothed with a 5 mm FWHM Gaussian kernel, and transformed to the participant's high-resolution anatomical image using boundary-based registration (Greve and Fischl, 2009), before linear registration, with 12 degrees of freedom, to the standard-space template (Montreal Neurological Institute, n = 152).
Two separate general linear models (GLMs) were constructed to test: (1) brain regions sensitive to repetition of HD, and (2) the profile of the BOLD response over multiple repetitions of stimuli cueing the same facing direction. First, to examine regions of the brain sensitive to repetition of HD information, three explanatory variables (EVs) were entered into FMRIB's improved linear model of the fMRI time-course data from each of the eight functional runs. These comprised: (1) correct responses to non-repeat HD trials, (2) correct responses to repeat HD trials, and (3) all incorrect responses. This model replicates previous analysis strategies (Baumann and Mattingley, 2010), and benefits from having an equal number of trials in the non-repeat HD and repeat HD regressors (mean number of correct non-repeat HD trials per run = 15.78, SD = 0.42; mean number of correct repeat-HD trials per run = 15.50, SD = 0.86). To examine the profile of BOLD response over multiple repetitions of HD, a second GLM was implemented in which the repeat HD regressor was separated according to whether a trial comprised a repetition of HD associated with the preceding trial only (one-back repeat HD), or the preceding two trials (two-back repeat HD; mean number of correct one-back repeat HD trials per run = 9.74, SD = 1.35; mean number of correct two-back repeat HD trials per run = 5.76, SD = 1.25). Although the two-back repeat HD EV models those trials in which the preceding two trials comprised the same HD, the participant was required only to make a one-back decision (ie, compare the HD on the current trial with respect to the immediately preceding trial). There was no difference, therefore, in mnemonic demand between the one-back and two-back repeats modeled here. The second model comprised four EVs: (1) correct responses to non-repeat HD trials, (2) correct responses to one-back repeat HD trials, (3) correct responses to two-back repeat HD trials, and (4) all incorrect responses. In both GLMs, the regressors were convolved with a double-gamma HRF and the temporal derivative added. Each participant's eight runs were combined using a fixed-effects model, the resulting images thresholded at Z > 2.3 to identify contiguous clusters, and a cluster extent of p < 0.05 applied to control FWE.
Given our a priori hypotheses regarding the role of the RSC and thalamus in HD coding, we examined effects in anatomical regions-of-interest (ROIs). The RSC anatomical ROI was created using FreeSurfer 5.0. The participant's T1 anatomical image was resampled to 1 mm isotropic resolution, segmented, the gray matter skeleton extracted, and parcellated into separate cortical regions (Destrieux et al., 2010). The RSC was defined as the posterior–ventral part of the cingulate gyrus, providing a mask slightly larger than BA29/30 (ie, RSC proper; Fig. 1e). The RSC masks were then resampled to 0.9 mm isotropic resolution, and subsequently warped to the standard space template (MNI-152), using the same transformation matrix parameters from the registration of the participant's T1 image to standard space in the functional analyses (Fig. 1f). FMRIB's Integrated Registration and Segmentation Tool (FIRST; Patenaude et al., 2011) was used to segment automatically the thalamus bilaterally in each participant's T1 structural image. The subject-specific thalamic ROIs were then warped to standard space. In our paradigm, different HDs were associated with different landmarks. The recovery of orientation information, therefore, could be interpreted as the recall of landmark representations, and adaptation effects in the RSC and/or thalamus interpreted as reflecting memory for these items. As a control analysis, we tested for adaptation effects also in bilateral hippocampus, a central node for episodic memory (Squire et al., 2004), where one might expect to observe memory-related adaptation effects (Kremers et al., 2014). Subject-specific hippocampal ROIs were identified using FMRIB's FIRST, before being warped to standard space.
For the first GLM, FSL's Featquery was used to extract the average percentage signal change value associated with: (1) “correct responses to non-repeat HD trials > baseline,” and (2) “correct responses to repeat HD trials > baseline,” in the bilateral RSC and thalamic anatomical ROIs. Given the size of our fMRI sample, it is possible that the data may violate the assumptions of normality, and that the results could be driven by outliers. To control for these possibilities, the percentage signal change data were first rank transformed using the Align Rank Transform (ART) tool (for details, see Wobbrock et al., 2011). Briefly, this method ranks the data according to a specific effect of interest after having “stripped” from it the contributions of other factors in the model (eg, any other main effects, and interactions). Once the data have been aligned and ranked, it can then be submitted to regular parametric ANOVA, with separate models used to examine each individual effect. For example, a two-way ANOVA comprising two main effects and an interaction term would require three separate ANOVA models, two in which the data have been ranked according to each main effect (with the contribution of the other main effect and interaction term removed from these data), and one for the interaction term, where the data have been ranked after the contribution of the main effects have been removed. For our ROI analysis, the resulting rank values were submitted to repeated-measures ANOVAs for the RSC, thalamus, and hippocampus comprising the factors Hemisphere (left; right) and Repetition (non-repeat HD; repeat HD). To examine whether there was further adaptation in the BOLD signal over repeats of the same HD (ie, our second fMRI GLM), the percentage signal change values associated with: (1) “correct responses to one-back repeat HD trials > baseline,” and (2) “correct responses to two-back repeat HD trials > baseline” were extracted from the anatomical ROIs, aligned, ranked, and entered into separate ANOVAs, for the RSC, thalamus, and hippocampus comprising the factors Hemisphere (left; right) and Repetition (one-back repeat HD; two-back repeat HD).
Following the targeted anatomical ROI analysis, whole-brain, group-level contrasts were implemented to identify other brain regions coding HD. These contrasts were performed using FSL's threshold-free cluster enhancement (TFCE; Smith and Nichols, 2009), with 5 mm variance smoothing (recommended for small sample sizes), using the Randomize tool (Winkler et al., 2014). The TFCE algorithm computes a voxelwise score that reflects both a voxel's height, and the sum of the spatially contiguous voxels supporting it (Smith and Nichols, 2009, their Eq. 1). The TFCE score, therefore, reflects both the strength of the signal at a given voxel, and its cluster extent. A benefit of this method is that it removes the need for the experimenter to select an arbitrary Z threshold for the group-level cluster statistic. Randomize is a permutation-based method allowing for the analysis of imaging data that, due to small sample size, may not adhere to parametric assumptions (eg, normal distribution). Combined with TFCE, FSL's Randomize constructs a study-specific null distribution of TFCE scores using permutations of the design matrix. For a one-sample t test design matrix (comprising a vector of ones), Randomize flips the signs of this matrix (ie, 1 to −1), and permutes all possible combinations of this design (in our case = 29 = 512 permutations). Voxelwise TFCE scores are then computed for each permutation, and the resulting values combined to form a null distribution. This distribution can then be thresholded to control the FWE, for example at the 95th percentile (ie, p < 0.05), and this value then used to threshold the original non-permuted TFCE image. A second benefit of this method, therefore, is that the null distribution is constructed using the observed data. For the first fMRI model, a contrast was implemented to examine brain regions showing repetition suppression associated with repeats, compared with non-repeats, of HD (“non-repeat HD > repeat HD,” ie, where in the brain is there a significant reduction in activity for repeats relative to non-repeats of HD?). The reverse of this contrast was modeled also to examine regions of cortex showing significantly increased BOLD response associated with repeats of HD relative to non-repeats (ie, repetition enhancement; Vass and Epstein, 2013). In the second fMRI GLM, we contrasted one-back repeat HD and two-back repeat HD trials to test whether either of these conditions showed a stronger repetition suppression effect. To localize more precisely effects within the thalamus, we used a stereotactic atlas of this region comprising 53 separate thalamic structures (Morel, 2007), generated in 3D (Krauth et al., 2010), and nonlinearly warped to the MNI-152 template (Jakab et al., 2012).
Results
Behavioral
To check whether behavioral performance was equivalent for all orientations and repetition conditions, accuracy and reaction time (RT) data for the one-back task were submitted to an Orientation (0°, 90°, 180°, 270°) × Repetition (non-repeat HD; repeat HD) repeated-measures ANOVA. Participants performed nearly at ceiling on the task (Fig. 2a), and the ANOVA revealed only a trend toward better performance for the non-repeat HD, relative to the repeat HD trials (F(1,8) = 4.49, p = 0.07, ηp2 = 0.36). There was no effect of orientation, or Orientation × Repetition interaction (F values <2.48, p values >0.07).
Behavioral priming has been used to examine the underlying mental representations of facing direction (Marchette et al., 2014). Specifically, stimuli cueing repetitions of HD resulted in faster RTs than non-repeats. This was thought to result from the RT cost of having to switch between different mental representations of facing direction when heading direction was not repeated. Consistent with previous data, we observed a main effect of Repetition (F(1,8) = 5.89, p = 0.04, ηp2 = 0.42), reflecting longer RTs for non-repeat HD trials (mean = 1596 ms, SEM = 118 ms), than for repeat HD trials (mean = 1502 ms, SEM = 121 ms). There were no other main effects, or interactions (F values <1.54, p values >0.22) suggesting that the four different facing directions were equally well learned.
It is possible that the sample size was too small to detect any differences in accuracy and/or RT between the different orientations. To test whether this null effect resulted from a lack of power, we repeated the task (outside of the MRI scanner) in a separate sample of 24 participants (14 female; mean age = 24.2, range: 20–31). The behavioral procedure for the additional participants was the same as reported for the fMRI task with the exception that the intertrial interval was adjusted to 2 s; target image presentation time was maintained at 5 s.
Mean accuracy and RTs for the one-back task were submitted to Orientation (0°, 90°, 180°, 270°) × Repetition (non-repeat HD; repeat HD) repeated-measures ANOVAs. For accuracy scores, we observed a main effect of Repetition, (F(1,23) = 8.23, p = 0.01, ηp2 = 0.26), modulated by a significant Orientation × Repetition interaction (F(3,69) = 3.18, p = 0.03, ηp2 = 0.12); there was no main effect, however, of Orientation (F(3,69) = 1.28, p = 0.29, ηp2 = 0.05). Simple main effects revealed no differences in accuracy across the different orientations for non-repeat, and repeat HDs (F values <2.35, p values >0.09). The Orientation × Repetition interaction resulted from significantly greater accuracy for non-repeat HD relative to repeat HD trials in the 180° orientation (F(1,23) = 10.13, p = 0.004). A similar pattern of data were evident for 0° and 90° orientations, however, these comparisons did not reach statistical significance (F(1,23) = 4.20, p = 0.05; F(1,23) = 4.02, p = 0.06, respectively); there was no difference in the level of accuracy between non-repeat and repeat HD trials for the 270° orientation (F(1,23) = 1.81, p = 0.19; Fig. 2b).
As in the fMRI study, RTs were faster for repeat versus non-repeat trials (main effect of Repetition: F(1,23) = 17.39, p < 0.001, ηp2 = 0.43). In addition, we observed a main effect of Orientation (F(3,69) = 12.90, p < 0.001, ηp2 = 0.36), qualified by an Orientation × Repetition (F(3,69) = 6.79, p < 0.001, ηp2 = 0.23) interaction. To further explore these effects, we tested whether RT differed across different orientations for non-repeat, and repeat HD conditions. For the non-repeat HD condition, responses for stimuli cueing 180° and 270° orientations were quicker than for stimuli cueing 0° and 90° (all t values > 2.93, p values < 0.008). For the repeat HD condition, RTs were faster for stimuli cueing the 270° orientation relative to all other orientations (all t values > 4.39, p values < 0.001). Furthermore, simple main effects comparing RT between non-repeat and repeat HD conditions for each individual orientation revealed significantly quicker responses for repeat HD trials in 0°, 90°, and 270° orientations (all F values > 8.21, p values < 0.01). Although there was no statistically significant difference in RTs between non-repeat, and repeat HD conditions for the 180° orientation (F(1,23) = 0.18, p = 0.67), the pattern of data were in the same direction with faster responses for repeat HD trials (mean = 1498 ms, SEM = 61 ms) compared with non-repeat HD trials (mean = 1516 ms, SEM = 62 ms).
In summary, in a separate behavioral experiment we found that, in terms of accuracy, performance was again nearly at ceiling and was matched across the four different orientations in the VE. For RT, there was a small benefit in RT for stimuli cueing the 270° orientation. This effect, however, was not consistent across both repetition conditions, and RTs were faster for repeat trials in all four orientations. Together, these results confirm the effects of the fMRI experiment and demonstrate the behavioral advantage of orientation priming, which was not driven by a particular orientation.
fMRI: ROI analysis
In the following section, we detail the analysis of: (1) percentage signal change values associated with non-repeat, and repeat HD trials; (2) percentage signal change values associated with one-back repeat HD and two-back repeat HD; and (3) the difference in percentage signal change values between non-repeat HD and one-back repeat HD trials versus the difference between one-back repeat HD and two-back repeat HD trials. For all analyses, we used participant-specific anatomical ROIs comprising the left and right RSC, left and right thalamus, and left and right hippocampus; results are described in this order.
In RSC there was a significant main effect of Repetition, with non-repeat HD trials associated with greater percentage signal change than repeat HD trials (F(1,8) = 22.73, p = 0.001, ηp2 = 0.74; Fig. 3a). Repetition condition interacted with Hemisphere (F(1,8) = 10.26, p = 0.013, ηp2 = 0.56), and follow-up nonparametric Wilcoxon signed-rank tests showed a trend toward greater activity associated with repeat-HD trials in right, relative to left RSC [Z = 1.84, p = 0.066 (two-tailed), r = 0.61]. In the thalamic ROIs, we observed a similar pattern of data (Fig. 3b), with significantly greater activity for non-repeat HD trials relative to repeat HD trials (F(1,8) = 20.16, p = 0.002, ηp2 = 0.72). There was no effect of Hemisphere, and this factor did not interact with Repetition (F values <2.80, p values >0.12). In contrast, in the hippocampus there were no significant effects (F values <1.14, p values >0.30).
Rodent data have shown that the firing rate of HD cells reaches a stable plateau after ∼4 s when the animal is held stationary (Shinder and Taube, 2014). To determine whether similar dynamics might be present in humans, we asked whether the profile of the BOLD response changes over multiple repetitions of the same HD. For both the RSC (Fig. 3c) and thalamus (Fig. 3d), there was no evidence that percentage signal change values differed according to Hemisphere (left; right), Repetition (one-back repeat HD; two-back repeat HD), or a Hemisphere × Repetition interaction (F values <1.14, p values >0.31). Likewise, percentage signal change values in the hippocampus did not differ according to these factors (F values <2.63, p values >0.13)
To provide stronger support for the notion that the BOLD signal plateaus over repetitions of stimuli cueing the same HD, we compared the difference in activity between the non-repeat HD and one-back repeat HD trials with the difference between the one-back repeat HD and two-back repeat HD trials. In both the RSC (Fig. 3e) and thalamus (Fig. 3f), nonparametric Wilcoxon signed-rank tests revealed significantly greater adaptation when HD was repeated for the first time (“non-repeat HD − one-back repeat HD”), compared with subsequent repetitions of HD [“one-back repeat HD − two-back repeat HD”'; RSC: Z = 2.07, p = 0.02 (one-tailed), r = 0.69; thalamus: Z = 2.07, p = 0.02 (one-tailed), r = 0.69]. Consistent with the results of the non-repeat HD versus repeat-HD comparison reported above, in the hippocampus there was no difference in the degree of adaptation between “non-repeat HD − one-back repeat HD” and “one-back repeat HD − two-back repeat HD” [Z = −0.53, p = 0.30 (one-tailed), r = 0.18].
To summarize, in anatomical ROIs comprising the RSC and thalamus, significant repetition suppression was observed following repeats of HD, relative to when HD direction was changed. Moreover, during a second repetition of the same HD, the BOLD signal remained suppressed but did not show any evidence of further attenuation. There was no evidence of adaptation, however, in the hippocampus associated with repeated HD.
fMRI: whole-brain analysis
To determine whether regions of cortex outside our predefined ROIs were sensitive to repetition of HD, we performed a whole-brain analysis. Using a voxelwise threshold of p < 0.05 (FWE-corrected), for the non-repeat HD > repeat HD contrast we found a large cluster of activity, centered on the left RSC, which encompassed a number of brain regions, including the right RSC, bilateral thalamus, precuneus, and putamen, and a smaller cluster in the temporal pole. Given the large spatial extent of the first cluster, the image was thresholded at p < 0.01 (FWE-corrected) to see in which regions the effect of this contrast was strongest. Supporting the ROI analysis, we observed separate clusters in left RSC (which included a local maximum in the right RSC) and left and right thalamus. Moreover, there was activity also in the precuneus (Fig. 4; Table 1). The reverse of this contrast (“repeat HD > non-repeat HD”; ie, repetition enhancement) was not associated with any regions of significantly increased BOLD response, at either threshold.
Using the Morel (2007) thalamic atlas, we identified the peak coordinate of the left thalamus cluster in the ventral division of the ventral lateral posterior nucleus. As can be seen in Figure 5, this cluster spanned a number of other thalamic subregions including the dorsal division of the ventral lateral posterior nucleus, the central lateral nucleus, the lateral dorsal and lateral posterior nuclei, the parvocellular division of the mediodorsal nucleus, and the posterior division of the ventral posterior lateral nucleus. The cluster in the right thalamus spanned the same thalamic subregions as the left. Its peak, however, was located at the border of the dorsal division of the ventral lateral posterior nucleus.
To examine the consistency of our effects, we investigated the repetition suppression contrast at the single-subject level. Seven of the nine participants showed significant clusters in the RSC (Fig. 6a). Eight of the nine subjects showed repetition suppression in the thalamus (masked with their own bilateral thalamic ROI) associated with repeating HD (Fig. 6b), with this contrast surviving correction for multiple comparisons in six subjects. The observed whole-brain effects in the RSC and thalamus, therefore, reflect consistent patterns of activity across nearly the entire sample.
Finally, consistent with the targeted ROI analyses, there were no differences in the level of activity associated with one-back repeat HD and two-back repeat HD trials anywhere in the brain.
Discussion
We used repetition suppression to examine brain regions coding HD in a VE using global reference frame cues. Unlike previous studies, orientation was learned using both visual and body-based cues; during MRI scanning only visual cues were provided for orientation. In bilateral RSC and thalamic ROIs, the BOLD signal attenuated in association with repeating HD. Consistent with rodent electrophysiology, this signal plateaued over multiple repetitions of the same facing direction. These data were supported by whole-brain analyses showing adaptation also in the precuneus. Here, we show evidence that: (1) the thalamus codes facing direction, (2) HD coding occurs in anatomically defined RSC (rather than retrosplenial complex), and (3) participants integrate HD across different locales using global reference frame cues.
In rats, both body-based and visual cues contribute to accurate orientation coding. The ADN of the thalamus is thought to integrate this information (Taube, 1995), as it receives projections from vestibular regions and shares reciprocal connections with visual processing areas (RSC/PoS; Clark et al., 2010; Yoder and Taube, 2014). Consistent with these data, previous human fMRI HD studies using VEs, in which only visual cues were provided, observed HD computations in visual processing regions (RSC/subiculum; Baumann and Mattingley, 2010; Chadwick et al., 2015; Marchette et al., 2014). Here, we provided also body-based cues, and were therefore more likely to tax brain regions integrating this information (ie, the thalamus). Given that only visual cues for HD were presented during test, we propose that these led to a recapitulation of the body-based information encoded during learning of the VE (Tulving and Thomson, 1973; Johnson and Rugg, 2007; Wimber et al., 2012), specifically the participant's physical orientation relative to the global landmarks. Because both cue types were available during learning, however, it is not possible to assess the relative contribution of RSC and thalamus to the processing of visual and body-based cues, respectively. In future neuroimaging studies it will be important to separate these different sources of HD information, and to make direct comparisons between “visual” and “visual plus vestibular” cues to help elucidate the precise roles of these brain regions in HD coding.
Our VE was designed to provide strong body-based cues for HD, with cardinal directions denoted by unique visual landmarks arranged in orthogonal bodily orientations. Moreover, participants monitored their orientation during navigation. These factors may have led to stronger body-based cues than those generated during navigation in the real world. A recent fMRI study, using stimuli comprising images of a real-world environment familiar to the participants (ie, encoded with body-based cues), found that the presubiculum, but not the thalamus, coded facing direction (Vass and Epstein, 2013). Navigation of this larger environment likely required participants to make a greater number of turns, leading to noisier body-based cues (Souman et al., 2009). Furthermore, computational models propose that under normal circumstances (ie, when not conflicting greatly with body-based cues) visual, rather than body-based cues, dominate the HD signal (Knight et al., 2014). Together, these factors may reduce the strength of the body-based cues during navigation in real-world environments, and potentially explain the discrepancy with the findings reported here. In summary, although we observed thalamic HD computations, the scale and layout of the environment, as well as attentional demands, may influence the contribution of this region to HD coding.
Using the Morel (2007) atlas, group-level bilateral thalamic cluster peaks were identified in the ventral/dorsal divisions of the ventral lateral posterior nucleus, rather than the ADN. In macaques, tracer studies show that the ventral lateral region of the thalamus projects to area V6a, a region involved in organizing visual and motor responses (Gamberini et al., 2015). Moreover, human area V6+ shares strong interconnections with V6a, which is sensitive to changes in head orientation (Arnoldussen et al., 2011). Accordingly, V6a may aid integration of visual and vestibular information to distinguish changes in head orientation relative to the environment, helping to encode facing direction. For several reasons, however, we express caution at attributing our group-level effect to specific thalamic subregions. First, the cluster peaks varied across subjects. Second, adaptation was observed in ROIs comprising the entire thalamus, meaning subthreshold effects were likely present in other regions. Finally, given the spatial resolution of our imaging data, we likely sampled over a number of subregions, and were not sensitive to smaller structures [eg, the volume of bilateral ADN in the 1 mm Morel (2007) atlas is only 16 mm3; Krauth et al., 2010; Jakab et al., 2012].
A second key finding was the demonstration of HD coding using global reference frame cues. Although Marchette et al. (2014) found that RSC was insensitive to this information, their participants were not instructed to attend to the global landmarks, and knowledge of landmark direction was not assessed. The different task demands, therefore, may explain the discrepancy in reference frame coding. Behavioral experiments suggest separate local spatial representations can be integrated into configural representations (Montello and Pick, 1993). For example, after walking around a complex VE, participants pointed more accurately to distant targets when aligned either with local or global reference frames (Meilinger et al., 2014). Similarly, separate spatial representations for individual environments are formed even when learned together (Han and Becker, 2014). Both local, and global reference frames, therefore, may operate in parallel, and it remains to be tested as to whether HD coding changes on a per-trial basis as a function of task instructions (ie, attend either to local or global reference frame cues).
An influential conceptualization of RSC function posits that this region translates between egocentric and allocentric spatial representations, contained in posterior parietal cortex and the medial temporal lobe, respectively (Byrne et al., 2007). Supporting this idea, imaging studies have shown RSC activity is modulated when translating between reference frames (Zhang et al., 2012; Dhindsa et al., 2014). It remains unclear, however, how these representations are synchronized. Alexander and Nitz (2015) found different cells in the rat RSC code for egocentric or allocentric reference frames, and conjunctions of these representations. Consequently, our findings of global reference frame coding, and those reporting HD coding in separate local reference frames (Marchette et al., 2014), are not contradictory; they may simply reflect different forms of spatial mapping supported by different cell populations in the RSC. In terms of how these representations are coordinated, a recent fMRI study found a positive correlation between RSC activity and the degree of landmark permanence (Auger et al., 2012), and these fixed landmark positions may help align different reference frames. This function is consistent with the deficits reported after RSC lesions (Maguire, 2001). Specifically, although patients recognize familiar landmarks, they cannot derive from them topographical information that supports navigation (Ino et al., 2002). We speculate, therefore, that the RSC adaptation observed here is driven by representations of allocentric HD, rather than memory for the landmark itself. Moreover, although we cannot rule out a memory processing explanation for the effects reported here, unlike previous studies (Kremers et al., 2014), we found no evidence of adaptation in the hippocampus, a key node for episodic memory (Squire et al., 2004).
The notion that subdivisions of parietal cortex contain cells coding in egocentric and allocentric reference frames is supported by evidence of adaptation in the precuneus. Although classically implicated in egocentric coding (Wolbers et al., 2008), our data implicate this region in allocentric HD coding. Although changes in allocentric HD may also result in an associated update in egocentric coordinates, our findings are in agreement with rodent data showing that posterior parietal cortex cells code allocentric HD (Wilber et al., 2014). Moreover, the primate posterior parietal cortex provides both egocentric and world-centered codes for external locations (Snyder et al., 1998). Consequently, it is also unclear at present whether the transformation between egocentric and allocentric coordinates is exclusive to RSC.
Finally, it has been suggested that fMRI may not be appropriate to investigate spatial navigation because participants lie motionless in the scanner (Taube et al., 2013). Furthermore, in rats HD cell firing does not adapt when the same orientation is adopted repeatedly, meaning that fMRI repetition suppression is potentially insensitive to this effect. However, when rats adopt a new head orientation after passive rotation, and their heads are kept stationary, HD firing rates are maximal for the first 100 ms before adapting over the next 4 s to stabilize at 42% of their initial values (Shinder and Taube, 2014). Consistent with these data, and supporting previous fMRI results (Baumann and Mattingley, 2010), we observed adaptation in key structures of the rodent HD network associated with repetition of HD information. Furthermore, the BOLD response plateaued, reducing when HD was first repeated but not over multiple repetitions. Even though there is no simple relationship between neural firing rates and the hemodynamic response (Ekstrom et al., 2009), and a shorter duration, rather than reduced frequency, of neural firing could explain the observed fMRI adaptation (Grill-Spector et al., 2006), our data accord with rodent electrophysiology. FMRI, in combination with realistic training procedures, therefore, appears to be a viable method to understand neural mechanisms supporting spatial navigation.
Footnotes
This work was supported by a Karp Discovery Award and by a Starting Grant of the European Research Council to T.W. (AGESPACE 335090). We thank Jack Loomis for helpful comments on an earlier version of this paper, and Masaki Miyanohara and Rebecca Korn for help with data collection.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr Jonathan Shine, German Center for Neurodegenerative Diseases (DZNE), Aging and Cognition Research Group, Building 64, Room 519, Leipziger Straβe 44, 39120 Magdeburg, Germany. Jonathan.shine{at}DZNE.de