Abstract
Recent functional imaging studies have proposed that the human entorhinal cortex (ERC) is subdivided into functionally distinct anterolateral (alERC) and posteromedial (pmERC) subregions. The alERC overlaps with regions that are affected earliest by Alzheimer's disease pathology, yet its cognitive function remains poorly understood. Previous human fMRI studies have focused on its role in object memory, but rodent studies on the putatively homologous lateral entorhinal cortex suggest that it also plays an important role in representing spatial properties of objects. To investigate the cognitive effects of human alERC volume differences, we developed an eye-tracking-based task to evaluate intra-item configural processing (i.e., processing the arrangement of an object's features) and used manual segmentation based on a recently developed protocol to delineate the alERC/pmERC and other medial temporal lobe (MTL) subregions. In a group of older adult men and women at varying stages of brain atrophy and cognitive decline, we found that intra-item configural processing, regardless of an object's novelty, was strongly predicted by alERC volume, but not by the volume of any other MTL subregion. These results provide the first evidence that the human alERC plays a role in supporting a distinct aspect of object processing, namely attending to the arrangement of an object's component features.
SIGNIFICANCE STATEMENT Alzheimer's disease pathology appears earliest in brain regions that overlap with the anterolateral entorhinal cortex (alERC). However, the cognitive role of the alERC is poorly understood. Previous human studies treat the alERC as an extension of the neighboring perirhinal cortex, supporting object memory. Animal studies suggest that the alERC may support the spatial properties of objects. In a group of older adult humans at the earliest stages of cognitive decline, we show here that alERC volume selectively predicted configural processing (attention to the spatial arrangement of an object's parts). This is the first study to demonstrate a cognitive role related to alERC volume in humans. This task can be adapted to serve as an early detection method for Alzheimer's disease pathology.
- anterolateral entorhinal cortex
- configural processing
- eye tracking
- medial temporal lobe
- mild cognitive impairment
- MRI volumetry
Introduction
Recent studies have highlighted the division of the human entorhinal cortex (ERC) into anterolateral (alERC) and posteromedial (pmERC) halves, which belong to functionally distinct pathways and support different cognitive roles (Maass et al., 2015; Navarro Schröder et al., 2015). Critically, the alERC overlaps significantly with the regions described as transentorhinal cortex (Braak and Braak, 1991) or lateral entorhinal cortex (Khan et al., 2014), which exhibit Alzheimer's disease pathology earlier than any other brain region. Despite the clinical importance of alERC decline, no human studies to date have attempted to link alERC volume to particular cognitive functions.
The alERC and pmERC are thought to be the human homologs of the rodent lateral (LEC) and medial (MEC) entorhinal cortices, which belong to two distinct functional pathways in the rodent brain. Neuroanatomical studies show that the LEC primarily receives projections from the perirhinal cortex (PRC), whereas the MEC receives most of its input from the postrhinal cortex, the rodent analog of the human parahippocampal cortex (PHC) (Naber et al., 1997; see also Suzuki and Amaral, 1994 for the same anatomical dissociation in monkeys). In turn, the LEC and MEC project to distinct regions of CA1 and the subiculum in the hippocampus (Witter, 1993). However, the separation between these two pathways is not absolute; for example, there are reciprocal connections between the LEC and MEC (for review, see van Strien et al., 2009). Experimental work suggests that homologous pathways may be present in humans: for example, Maass et al. (2015) defined the alERC and pmERC based on their functional connectivity to the PRC and PHC, respectively. Because this protocol derived the alERC's boundaries through functional connectivity with the PRC (paralleling LEC–PRC connections in rodents and monkeys), we argue that it is a more precise human analog to the rodent LEC than previously used definitions of the human LEC, which involved simple geometric division of the ERC.
On the same anatomical basis, the PMAT (posterior medial, anterior temporal) framework proposes that the alERC and the PRC form the hub of a network relating representations of specific entities (e.g., objects/faces) to their associated saliences and existing semantic concepts (Ranganath and Ritchey, 2012; Ritchey et al., 2015). A theoretically aligned view, the representational–hierarchical model, holds that the PRC and, by extension, the adjoining alERC, sits at the apex of the ventral visual stream, an object-processing pathway of increasingly complex representations. Relatively simple object features are represented in posterior regions and more complex conjunctions of object features (at approximately the level of an object) are represented in the PRC and likely the neighboring alERC (Cowell et al., 2010; Saksida and Bussey, 2010; Barense et al., 2012; Erez et al., 2016).
Consistent with these models, the two previous cognitive studies investigating the human LEC (using fMRI in healthy young adults) showed increased LEC activation when distinguishing faces/objects from lures with similar features (Schultz et al., 2012; Reagh and Yassa, 2014). In contrast, and possibly reflecting reciprocal connections with the spatially responsive MEC/hippocampus, rodent studies suggest that the LEC may be involved in representing the spatial properties of objects (Knierim et al., 2014). Direct recording studies report LEC place cells that fired at the location of novel objects (Deshmukh and Knierim, 2011) or at locations where objects were located previously (Tsao et al., 2013).
Connecting the theoretical models with the rodent literature, we reasoned that the alERC may be involved in processing the conjunctive arrangement of the parts of an object (i.e., intra-item configural processing, which may be considered one of many spatial properties of an object). To assess the effect of alERC volume differences on configural processing, we designed a novel eye-tracking paradigm, tested a group of older adult participants at varying stages of decline, and used a recently developed method to quantify the volume of their alERC (Maass et al., 2015) and surrounding MTL cortical regions and hippocampal subfields (Olsen et al., 2013).
Materials and Methods
Participants
Thirty-eight community-dwelling older adults were recruited from the community in Toronto. Data from three participants were excluded due to eye-tracker failure. The remaining participants had an average age of 71.7 years (SD: 5.2, range: 58–81, 27 women). Participants had previously received the Montreal Cognitive Assessment (MoCA) (Nasreddine et al., 2005) within the last 22 months (M: 10.4, SD: 6, range: 0–22) and were selected to provide a distribution of MoCA scores (M: 25.7, SD: 2.5, range: 21–30). The MoCA is sensitive to the presence of mild cognitive impairment, which is associated with MTL/hippocampus volume loss (Jack et al., 1997). Our intention was to select for a participant group who had a good distribution of cognitive abilities and MTL/hippocampal regional volumes. These participants were a subset of an original sample of 40 participants who were chosen such that 20 had scored above the MoCA cutoff score (≥26) and 20 had scored below it (<25) (Olsen, Yeung et al., 2017). Of the 35 participants whose data we report here, 16 scored above the MoCA cutoff score and 19 scored below it. These two groups were matched for age (original sample: t(38) = 1.29, p = 0.20; 35 participants in this study: t(33) = 0.51, p = 0.61) and years of education (original sample: t(38) = 0.51, p = 0.61; 35 participants in this study: t(33) = 0.20, p = 0.84). Despite our attempt to match participants above and below the MoCA cutoff score in terms of demographic characteristics, MoCA and age remained marginally correlated among the 35 participants in this study (r = −0.289, p = 0.07). For the purposes of the present study, we were primarily interested in how MTL volume differences related to cognitive performance rather than how participants who scored above/below the MoCA threshold differed; therefore, for all subsequent analyses, we treated all the participants as a single group.
Participants received a battery of neuropsychological tests to further characterize their cognitive status. The battery consisted of the Logical Memory subtest from the Wechsler Memory Scale, 4th Ed. (Wechsler, 2009), Trails A and B (Reitan and Wolfson, 1985), the Digit Span subtest from the Wechsler Adult Intelligence Scale, 4th Ed. (Wechsler, 2008), the Rey–Osterrieth Complex Figure Test (Osterreith, 1944), the Weschler Abbreviated Scale of Intelligence (Wechsler, 1999), and the Visual Object and Space Perception battery (Warrington and James, 1991). Results of the neuropsychological battery are given in Table 1. All participants had normal or corrected-to-normal vision (with glasses or bifocals) and were screened for color blindness, psychological or neurological disorders, brain damage (i.e., stroke or surgery), and metal implants or health conditions that would have precluded MRI. All participants gave informed consent. This research received ethical approval from the Research Ethics Board of the University of Toronto.
MRI scan parameters
High-resolution T2-weighted images were acquired in an oblique–coronal plane perpendicular to the long axis of the hippocampus (TE/TR = 68 ms/3000 ms, 20–28 slices depending on head size, 512 × 512 acquisition matrix, voxel size = 0.43 × 0.43 × 3 mm, no skip, FOV = 220 mm), on a 3 T Siemens Trio scanner at the Rotman Research Institute at Baycrest (Toronto, Ontario). The first slice was placed anterior to the appearance of the collateral sulcus (including the temporal pole where possible) and the last posterior to the hippocampal tail to ensure full coverage of the entire hippocampus and all of the MTL cortices included in the volumetric analyses for all participants. To confirm slice placement, a T1-weighted MP-RAGE whole-brain anatomical scan (TE/TR = 2.63 ms/2000 ms, 176 slices perpendicular to the AC-PC line, 256 × 192 acquisition matrix, voxel size = 1 × 1 × 1 mm, FOV = 256 mm) was acquired immediately before the T2-weighted scan. The T1-weighted images were also used to estimate total intracranial volume for head-size correction (see “Volume correction for head size” section below).
Manual segmentation
For each participant, L.-K.Y. manually segmented three hippocampal subfields (CA1, dentate gyrus/CA2 and 3, and subiculum) and four MTL cortices (alERC, pmERC, PRC, and PHC) on coronal slices of the T2-wighted structural scans (in-plane resolution: 0.43 × 0.43 mm, 3 mm between slices) using FSLview (version 3.1). Manual segmentation followed the Olsen–Amaral–Palombo (OAP) protocol (Olsen et al., 2013; Palombo et al., 2013; see also the appendix to Yushkevich et al., 2015a) supplemented with a modified version of the protocol provided by Maass et al. (2015) for the subdivisions of the ERC (see Figure 1 for a visualization of the segmentation protocol).
We considered these particular regions for two reasons. First, because these regions are directly connected to the alERC (Suzuki and Amaral, 1994; Burwell, 2000), we wished to explore whether any observed alERC–behavior correlations were mediated by its inputs and outputs. Second, these regions support related cognitive functions/representations that may aid performance. The hippocampus (particularly the trisynaptic loop formed by DG–CA3–CA1) is implicated in pattern separation/completion processes necessary to disambiguate similar stimuli (Norman and O'Reilly, 2003; Rolls, 2016). The PRC, PHC, and hippocampus are theorized to support object/context representations used in memory and perception (Cowell et al., 2010; Barense et al., 2012; Ranganath and Ritchey, 2012).
The OAP protocol follows the guidelines of Insausti et al. (1998) for delineating the ERC and PRC and the guidelines of Pruessner et al. (2002) for delineating the PHC. The ERC was further subdivided into the alERC and the pmERC following the protocol of Maass et al. (2015), which was based on functional connectivity with the PRC and PHC, respectively. See Olsen, Yeung et al. (2017) for a more extensive description of the alERC/pmERC boundary used in our segmentation protocol. We note that, because we followed the protocol of Insausti et al. (1998) to define the lateral edge of the ERC, the resulting lateral boundary of the alERC/pmERC regions in our protocol differs slightly from that of Maass et al. (2015). Our alERC/pmERC regions extend into the collateral sulcus when the depth of the collateral sulcus is “shallow” (depth <1 cm) or “regular” (depth between 1 and 1.5 cm). As a result, the alERC/pmERC subregions defined here overlap with the trans entorhinal region defined by Braak and Braak (1991, 1992) and also with the medial PRC regions used elsewhere in the literature (Krumm et al., 2016; Wolk et al., 2017).
Following the OAP protocol, hippocampal subfield segmentations were based on Amaral and Insausti (1990). The strata radiatum, lacunosum, and moleculare (SLRM) was used to divide the subiculum and CA1 from the CA2/3/dentate gyrus. The internal boundaries between CA1 and the other two regions are detailed in Figure 1 of Palombo et al. (2013). The OAP protocol (developed for use in structural scans of younger adults) typically includes two additional ROIs, which cover the anterior head and the posterior tail of the hippocampus, where the organization of the subfields is more complex and the SLRM is sometimes too indistinct to differentiate hippocampal subfields. There is currently little consensus as to how to subdivide these regions into subfields using in vivo 3 T MRI, which is why it has been our practice to combine them into a single ROI, as do other high-resolution protocols (e.g., see Schlicting and Preston protocol in Yushkevich et al., 2015a). Because the hippocampal subfields within these regions were not segmented into subregions, these regions were excluded from further analysis.
Average volumes for each manually segmented brain region are presented in Table 2 and correlations between brain region volumes are presented in Table 3.
Intra-rater and inter-rater segmentation reliability
Intra-rater reliability was established by comparing the segmentation of five randomly selected scans, completed by the same rater (L.-K.Y.) after a delay of 1–4 months. Inter-rater reliability was evaluated by comparing the segmentation of five randomly selected scans by a second rater (R.K.O) to those of L.-K.Y. Both authors were blinded to MoCA score, task performance, and the identities of participants until after all manual segmentation (including inter-rater and intra-rater reliability) was completed. Reliability was assessed using the intra-class correlation coefficient (ICC, which evaluates volume reliability; Shrout and Fleiss, 1979) and the Dice metric (which also takes spatial overlap into account; Dice, 1945), computed separately for each region in each hemisphere. ICC(3,k) was computed for intra-rater reliability (consistency) and ICC(2,k) was computed for inter-rater reliability (agreement). Dice was derived using the formula 2 * (area of intersecting region)/(area of original segmentation + area of repeat segmentation); a Dice overlap metric of 0 represents no overlap, whereas a metric of 1 represents perfect overlap. Intra-rater and inter-rater reliability results are shown in Table 4. These scores are comparable to reliability values reported previously for manual segmentation of hippocampal subfields and MTL cortices (Wisse et al., 2012; Yushkevich et al., 2015b) and are consistent with our previous work (Olsen et al., 2013; Palombo et al., 2013).
Volume correction for head size
All manually segmented region volumes were corrected for head size using a regression-based method to account for differences in brain size between participants. Estimated total intracranial volume (eTIV) was derived using FreeSurfer (version 5.3) (Buckner et al., 2004). By regressing the volume of each region with eTIV, a regression slope β was obtained for each region (representing the effect of eTIV change on that region's volume). Then, the volume of each region was adjusted by that participant's eTIV using the following formula: The head size correction was computed separately for each region in each hemisphere. Volumes were subsequently summed in each region across the two hemispheres, giving a single volume for each region for each participant.
Eye-tracking task
We developed a novel eye-tracking paradigm assessing intra-item configural processing of conjunctive objects at varying degrees of novelty and investigated whether performance was affected by volumetric differences in the alERC and other MTL regions (Fig. 2A). Participants incidentally viewed individual computer-generated conjunctive objects (comprised of distinct upper and lower halves presented on a gray background; Fig. 2B) for 5 s while their viewing of three equally sized ROIs on the objects (top, middle, and bottom) was recorded using EyeLink 1000/II eye trackers (SR Research). Each participant completed 16 blocks, each containing 21 trials (each block used entirely unique stimuli). Within each block, three objects were repeatedly presented six times each. On the seventh repetition, we assessed the effect of novelty for whole objects (versus parts of objects) by presenting one object from each of three novelty conditions: (1) a repeated object identical to an item previously presented in the same block; (2) a recombined object in which each of the two halves had been presented previously as parts of different objects in the block; and (3) a novel object in which both halves were new. After each trial, participants were asked to rate how well the two parts of the stimulus fit together (on a scale of 1–3) to encourage holistic viewing of the conjunctive objects. No time limit was imposed on answering this question and eye movements were not recorded during this time. Across participants, we counterbalanced for which set of objects were used in each novelty condition.
We defined two primary and three ancillary eye-tracking-based outcome variables for each novelty condition (Fig. 2C). Our first primary outcome variable was the proportion of fixations to the middle ROI (which contained the “join” between the upper and lower halves of the object). Because the middle ROI contained information that would allow participants to distinguish successfully between repeated objects (familiar features in a familiar configuration) and recombined objects (familiar features in a novel configuration), differences in viewing to the middle ROI were taken to reflect differences in configural processing for a particular object. As shown in Figure 2B, this was calculated by dividing the number of fixations made to the middle ROI by the total number of fixations made to the whole object on a per-trial basis and then averaging over all the trials in each novelty condition (repeated, recombined, novel). To avoid skewing the results with trials in which there were too few fixations (where the denominator is very small, leading to larger variability in the proportion measure), we excluded all trials with <5 fixations on the object from this analysis; this accounted for only ∼1.8% of trials overall.
Our second primary outcome variable was the normalized number of fixations to the entire object. This was calculated by taking the average number of fixations made to the objects for all the trials in a given novelty condition (repeated, recombined, or novel) and normalizing by the average number of fixations made to all the objects during their initial presentation in the first repetition. The number of fixations made to objects during the first repetition within each block (when all the objects were entirely novel) served as a baseline for how many fixations that a particular participant would make to an entirely novel object. This followed the procedure that we used in our previous work (Yeung et al., 2013) to derive a normalized eye-tracking-based measure of novelty that controlled for absolute differences in the number of fixations between participants. Because more fixations are made to novel objects compared with previously viewed objects (Althoff and Cohen, 1999), this variable serves as a measure of novelty detection for the object as a whole. A score of 1 or more here indicates that objects in a given novelty condition were being treated as novel (i.e., the same number of fixations were made compared with when the object was entirely novel during the first presentation), whereas a score of <1 indicates that the objects were being treated as though they had been seen previously (i.e., fewer fixations were made compared with when the object was entirely novel).
To further explore changes in the proportion of fixations directed to the middle ROI across novelty conditions, we derived three ancillary eye-tracking measures: the normalized number of fixations to the middle ROI, the normalized number of fixations to the peripheral ROIs, and the number of transitions between ROIs. The first two ancillary measures used the same normalization method used for the normalized number of fixations to the entire object. For both the middle and peripheral ROIs, the average number of fixations made to those respective ROIs in all of the trials of a certain novelty condition was normalized by the average number of fixations made to the entire objects shown in the first repetition (again, serving as a baseline for the number of fixations made to an entirely novel object). As a result of how they were calculated, the normalized number of fixations to the middle and peripheral ROIs necessarily sum to the normalized number of fixations to the entire object. Note that the normalized number of fixations to the middle ROI is distinct and different from the proportion of fixations to the middle ROI. The former is a count of how many fixations were made to the middle ROI, serving as a measure of novelty detection. In contrast, the latter describes the relative distribution of fixations across the object and serves as a measure of configural processing (i.e., the first primary outcome measure described above). The number of transitions between ROIs counted the number of saccades that moved between different parts of the conjunctive objects. In the context of this study, these saccades may reflect spatial processing necessary to bind together different object halves into a coherent object representation.
Stimulus and ROI properties
The conjunctive objects comprised upper and lower halves. In the recombined condition, object halves retained their relative locations from when they were first presented (i.e., an object half that had appeared previously as an upper half stayed as an upper half in the recombined object). The objects varied in width from 264 to 564 pixels (M: 346.6 pixels, SD: 50.0 pixels) subtending 25.8–55.1% of the screen horizontally (M: 33.9%, SD: 4.9%) and covering a horizontal visual angle of 9.6–20.4° (M: 12.6°, SD: 1.8°). In height, the objects varied from 287 to 602 pixels (M: 470.1 pixels, SD: 61.3 pixels), subtending 37.3–78.3% of the screen vertically (M: 61.2%, SD: 8.0%) and covering a vertical visual angle of 11.6–24.1° (M: 18.6°, SD: 2.5°).
The location of each stimulus was jittered pseudorandomly so that the center of the stimulus was not always presented at the center of the screen (Fig. 2B). In all cases, the entire stimulus appeared on the screen (i.e., the jittering did not cause any stimuli to be cut off by the edge of the screen). Despite jittering our stimuli in this fashion, the first fixation still fell on the middle ROI on 68.6% of trials while falling on the peripheral ROIs on 15.2% of trials and off the object entirely for 16.1% of trials (this results from the relatively large size of the stimuli and the requirement that the entire stimuli stay on the screen, constraining the number of possible locations it could take). To address the concern that this might have caused a bias in our results, we have removed the first fixation on every trial from all our analyses. The ROIs were defined based on the size of each individual stimulus. All three ROIs spanned the entire width of each stimulus and each ROI covered exactly one-third of the vertical extent of each stimulus (as depicted in Fig. 2B). Notably, these ROIs excluded the rest of the screen outside of each stimulus.
Eye-tracker setup
The experimental task was presented on a 21.2-inch monitor (36 × 30 cm) at a resolution of 1024 × 768 pixels using Experiment Builder (SR Research). For 25 participants, eye-tracking measures were recorded using an EyeLink II head-mounted eye tracker; for the remaining 13 participants, eye-tracking measures were recorded using an EyeLink 1000 desktop-mounted eye tracker. Repeated-measures ANOVAs showed that there was no main effect of the eye-tracker model in terms of fixations per trial (F(1,34) = 0.88, p = 0.35) or in terms of the proportion of fixations made to the middle ROI (F(1,34) = 0.16, p = 0.69). There was also no interaction of eye-tracker model and condition either in terms of fixations per trial (F(2,68) = 0.80, p = 0.45) or in the proportion of fixations made to the middle ROI (F(2,68) = 0.04, p = 0.96). Accordingly, all analyses were collapsed across the eye-tracker model used.
The EyeLink 1000 sampled at a rate of 1000 Hz with a spatial resolution of 0.01° and an accuracy of 0.25°, whereas the EyeLink II sampled at a rate of 500 Hz with a spatial resolution of 0.01° and an accuracy of 0.5°. Participants were positioned 55 cm away from the monitor; participants using the EyeLink 1000 placed their heads on a chin rest to limit head motion, which was unnecessary with the EyeLink II system because it corrects for head motion. Nine-point calibration was performed before testing, and was repeated until the average gaze error was <1°, with no point having a gaze error exceeding 1.5°. Before each trial, drift correction was performed (which causes the initial fixation on each trial to be focused at the center of the screen), with 9-point calibration being repeated if drift error exceeded 2°.
Statistical analysis
Repeated-measures ANOVAs and planned paired-samples t tests were used to identify differences in each of the primary outcome variables (proportion of fixations to the middle ROI and normalized number of fixations to the entire object) in each novelty condition (i.e., only considering novel, recombined, and repeated trials shown during the seventh repetition). To further clarify the differences in the primary outcome variables across different novelty conditions, we performed paired-samples t tests on the ancillary outcome variables (normalized number of fixations to middle ROI, normalized number of fixations to peripheral ROIs, and number of transitions between ROIs). Theses ancillary variables merely reflected aspects of the primary outcome variables and were not truly independent from them (e.g., the normalized number of fixations to the entire object is the sum of the normalized number of fixations to the middle and peripheral ROIs); therefore, they were not included in the subsequent volumetric analyses.
To assess the importance of each brain region on the primary outcome variables only, multiple regression was used with each outcome variable as the dependent variable and the volumes of the seven brain regions as predictors. For brain regions that significantly predicted task performance, Steiger's Z test was used to explore differences in their correlations across novelty conditions. Additional multiple regression analyses evaluated whether the proportion of fixations directed to the middle ROI in each novelty condition predicted variance in alERC volume unaccounted for by MoCA or age. All statistical tests were two-tailed and conducted at α = 0.05. Mauchly's test of sphericity was applied to repeated-measures ANOVAs; when the assumption of sphericity was violated, the Greenhouse–Geisser correction was applied. Multiple regressions were tested for multicollinearity; residual plots were inspected to check for nonlinearity and heteroscedasticity.
Results
Behavioral results
We ran repeated-measures ANOVAs investigating the effect of novelty condition (repeated, recombined, and novel) on the proportion of fixations directed to the middle ROI and the normalized number of fixations to the entire object (our two primary eye-tracking measures). These showed main effects of novelty condition for the proportion of fixations directed to the middle ROI (F(2,68) = 11.83, p = 1.58 × 10−4, η2 = 0.26), as well as the normalized number of fixations to the entire object (F(2,68) = 62.15, p = 4.45 × 10−16, η2 = 0.65) (Fig. 3). Mauchly's test indicated the assumption of sphericity was violated for the ANOVA on the proportion of fixations to the middle ROI (χ2(2) = 8.88, p = 0.01; Greenhouse–Geisser correction was applied, ε = 0.81). t tests showed that a greater proportion of fixations were directed to the middle ROI for repeated objects than recombined objects (t(34) = 2.99, p = 0.005). In turn, a greater proportion of fixations directed to the middle ROI for recombined objects than novel objects (t(34) = 2.20, p = 0.03; Fig. 3A). t tests comparing the normalized number of fixations with the entire object showed fewer fixations to repeated objects than recombined objects (t(34) = 7.86, p = 3.74 × 10−9) and fewer fixations for recombined objects than novel objects (t(34) = 3.03, p = 0.005) (Fig. 3B). This pattern matched our previously reported findings (Yeung et al., 2013). Figure 3C shows the unnormalized fixation counts to the entire object, the middle ROI, and the peripheral ROIs.
Using our ancillary outcome measures to investigate the differences in the proportion of fixations to the middle ROI across conditions, we found that this was not driven by differences in the normalized number of fixations directed to the middle ROI (repeated vs recombined: t(34) = 1.88, p = 0.07; recombined vs novel: t(34) = 0.38, p = 0.70; Fig. 4A). Rather, it is caused by the greater normalized number of fixations directed to the peripheral ROIs in the recombined and novel conditions (repeated vs recombined: t(34) = 6.88, p = 6.33 × 10−8; recombined vs novel: t(34) = 2.98, p = 0.005; Fig. 4B).
One explanation for this pattern of fixations is that processing of spatial information in the configurally important middle ROI is necessary for object recognition in all conditions, but recombined objects (which have a novel configuration) and novel objects (which have a novel configuration and novel features) require additional processing of the peripheral features for successful identification. In contrast, viewing to the middle ROI provides sufficient information for successful identification of repeated objects. This interpretation is supported by the data on transitions between ROIs (Fig. 4C): fewer transitions between ROIs were made to the repeated objects compared with objects in the other two novelty conditions (repeated vs recombined: t(34) = 11.26, p = 5.17 × 10−13, repeated vs novel: t(34) = 13.07, p = 8.12 × 10−15); however, the difference between recombined and novel objects only trended toward significance (t(34) = 1.97, p = 0.057).
After viewing each conjunctive object for 5 s, participants rated how well the two parts of the object fit together on a scale of 1–3 (1 = did not go together well, 3 = went together well). Paired-samples t tests showed no differences in the fit ratings between repeated (M: 1.62, SD: 0.36) and recombined (M: 1.60, SD: 0.39) objects (repeated vs recombined: t(34) = 0.61, p = 0.54). However, novel items (M: 1.75, SD: 0.49) had a slightly higher fit rating than items in the other two novelty conditions (novel vs repeated: t(34) = 2.02, p = 0.052; novel vs recombined: t(34) = 2.89, p = 0.007).
Volumetric imaging results
Multiple regression analysis was used to assess the influence of MTL/hippocampal subregion volume on the proportion of fixations made to the middle ROI (the primary eye-tracking measure of configural processing). Using brain region volumes as predictors revealed that only alERC volume was a significant predictor for the proportion of fixations made to the middle ROI for all three novelty conditions from the seventh repetition (repeated: t(30) = 3.60, p = 0.001, β = 0.60, sr = 0.53; recombined: t(30) = 4.13, p = 3.16 × 10−4, β = 0.66, sr = 0.59; novel: t(30) = 4.51, p = 1.12 × 10−4, β = 0.73, sr = 0.65; Table 5). In other words, greater alERC volume predicted a greater proportion of fixations to the middle ROI regardless of the object's novelty. Figure 5 illustrates this relationship graphically in two different ways: as correlations in Figure 5A and as a median split in Figure 5B. This effect holds true even when we consider only the trials in the first six repetitions, treating these trials as though they were an additional novelty condition (t(30) = 3.99, p = 4.58 × 10−4, β = 0.65, sr = 0.58). Steiger's Z test was used to compare the correlations of the alERC volume residuals (i.e., accounting solely for the contribution of the alERC predictor with the contribution of other regions removed) with proportion of fixations to the middle ROI across the three novelty conditions. This revealed no significant differences (repeated vs recombined: Z = 0.52, p = 0.60; recombined vs novel: Z = 0.79, p = 0.43; repeated vs novel: Z = 1.11, p = 0.26). Consistent with recent findings (Olsen et al., 2016), no region was a significant predictor for the normalized number of fixations to the entire object (Table 5), indicating that MTL volume differences did not affect eye movement measures of novelty detection.
To verify that the association between alERC volume and configural processing was not simply driven by global cognitive decline, we investigated whether the proportion of fixations to the middle ROI predicted alERC volume beyond the effect of MoCA or age. Even when MoCA and age were included as predictors in the same model (i.e., having accounted for the variance in alERC volume that they explain), the proportion of fixations to the middle ROI was a significant predictor for alERC volume in all conditions (repeated: t(34) = 3.29, p = 0.002, β = 0.49, sr = 0.48; recombined: t(34) = 4.32, p = 1.48 × 10−4, β = 0.59, sr = 0.58; novel: t(34) = 5.25, p = 1.04 × 10−5, β = 0.67, sr = 0.65; first six repetitions: t(34) = 4.03, p = 3.37 × 10−4, β = 0.56, sr = 0.56; Table 5). Indeed, looking at the semipartial correlations, it accounted for more variance in alERC volume than MoCA, age, or both combined.
These results suggest that our task may be tapping into a form of cognitive decline that is more sensitive to neurodegenerative changes in alERC volume than the cognitive processes assessed by the MoCA or predicted by chronological age. We note that, with our current sample of participants, age was not a significant predictor for alERC volume even if MoCA was excluded as a predictor. However, this is likely due to our participant selection process (in which the group of participants who scored above the MoCA threshold score were matched in age to the group who scored below the MoCA threshold score), rather than suggesting that age has no effect on alERC volume.
As a post hoc analysis, we looked at whether there were laterality effects in the alERC that affected task performance. We repeated the multiple regression models predicting the proportion of fixations to the middle ROI using brain regions as predictors (as in Table 5), splitting the alERC predictor by hemisphere. This did not show any clear laterality effects: comparing the standardized coefficients (betas) of those models, some conditions revealed a stronger association with the left alERC and other conditions with the right alERC [repeated: 0.332 (left), 0.416 (right); recombined: 0.476 (left), 0.325 (right); novel: 0.454 (left), 0.445 (right); first six repetitions: 0.331 (left), 0.483 (right)].
Discussion
In this study, we examined older adults' viewing patterns to conjunctive objects and compared them with the volumes of their MTL regions and hippocampus subfields. Crucially, we observed that the proportion of fixations directed to the critical region of a conjunctive object (a measure of configural processing) was significantly and selectively predicted by alERC volume regardless of an object's novelty. This effect was not driven by any changes in overall viewing. Moreover, we showed that individual differences in viewing, reflecting different degrees of configural processing, were related to individual differences in alERC volume, even after accounting for the effects of MoCA score and age. Together, our results demonstrate that alERC volume was strongly associated with attention to the spatial arrangement of the parts of an object. To the best of our knowledge, this is the first study to demonstrate such a role for the alERC and the first to measure the effects of alERC structural volume on cognition.
Previous neuroimaging studies of the human LEC focused on how that region supports object representations (Schultz et al., 2012; Reagh and Yassa, 2014), whereas rodent studies suggested that the putatively homologous LEC represents spatial properties of objects (Deshmukh and Knierim, 2011; Deshmukh et al., 2012; Tsao et al., 2013). This study connects these lines of evidence, suggesting that the human alERC may support intra-item configural processing (i.e., attention to the spatial arrangement of an object's features). In rodents, the LEC has recurrent connections with the MEC (van Strien et al., 2009), which plays an important role in spatial representation (Moser et al., 2014; Sasaki et al., 2015), and with the hippocampus, which supports relational binding (Olsen et al., 2012) and spatial memory (Schiller et al., 2015). There is evidence supporting a similar pattern of functional connectivity in humans (Maass et al., 2015). When combined with the alERC's putative inputs from the perirhinal cortex, which is believed to support object processing (Cowell et al., 2010; Saksida and Bussey, 2010; Ranganath and Ritchey, 2012), this suggests that the alERC is a critical region where information from both the object and spatial pathways converge.
Although item memory has been connected to the alERC using fMRI (Schultz et al., 2012; Reagh and Yassa, 2014) and to the PRC in convergent work across many different methodologies (e.g., Meunier et al., 1993; Henson et al., 2003; Bowles et al., 2007), we did not observe any memory effects in either region. In addition, unlike previous findings from our group (Barense et al., 2012; Peterson et al., 2012; Erez et al., 2016), we did not observe any conjunctive processing effects related to PRC. However, these results are not necessarily contradictory. Because we used eye-tracking/volumetry in older adults rather than fMRI in younger adults or neuropsychological investigations of patients with large PRC/alERC lesions, it is conceivable that these divergent methodologies are differentially sensitive to subtle degrees of damage. For example, it is possible that mnemonic/perceptual processes related to PRC/alERC-based object representations are more robust and require significant volume (or white matter tract) loss before clear deficits are present, whereas the volume differences here fall below the threshold needed to observe such deficits. However, our results are consistent with findings showing that, when participants with broad MTL lesions (including the alERC) misidentified impossible 2D objects, they showed less viewing to the critical region that identified the object as configurally impossible (Lee and Rudebeck, 2010).
Although we found that age was not a significant predictor for alERC volume after having accounted for the effect of intra-item configural processing, longitudinal structural imaging studies have shown that the volume of MTL regions, including the ERC, decline with age, particularly in older adults over the age of 50 (Raz et al., 2004). Despite our efforts to match participants demographically who scored above and below the MoCA threshold score (and thus minimize the effect of age on regional volumes), it is possible that variation in alERC volume in our sample was related to age. However, this was also likely to be true for all of the other MTL regions we assessed (Raz et al., 2004; La Joie et al., 2013; Wisse et al., 2014). Strikingly, we found that the alERC was the only MTL region with a volume that predicted intra-item configural processing in our experimental task. In this regard, our results are consistent with findings that volume reduction in the entire ERC is associated with poorer memory outcomes in healthy older adults (Rodrigue and Raz, 2004; Fjell et al., 2014).
A particular concern in comparing across different studies of the alERC is the variable definition of its boundaries. We defined the alERC based on histology laterally (Insausti et al., 1998) and on functional connectivity analysis medially (Maass et al., 2015). Our definition of the alERC includes part of the parahippocampal gyrus (adjacent to pmERC) and part of the medial bank of the collateral sulcus (adjacent to PRC). Both of the previous fMRI studies (Schultz et al., 2012; Reagh and Yassa, 2014) that found alERC involvement in object memory placed the alERC mostly on the medial bank of the collateral sulcus. Another group identified a largely coterminous region on the medial bank of the collateral sulcus as the “medial perirhinal cortex” and reported that it plays a role in disambiguating visually similar objects (Kivisaari et al., 2012) and experiences cortical thinning in preclinical Alzheimer's disease (Krumm et al., 2016). The greater medial extent of our alERC might explain the additional role in configural processing of objects that we report. This suggests a continuum in the ERC between more spatial representations in the pmERC and more object-like representations in the alERC. Indeed, cells responsive to spatial properties, cells that are object specific, and cells responsive to combinations of spatial and object-specific dimensions are not strictly separated in the two halves of the ERC; rather, they are widely distributed over both regions (Keene et al., 2016). Further work is necessary to pinpoint definitively the functional heterogeneity of the ERC and the corresponding spatial distribution of these functional subregions.
On a more practical level, an eye-tracking-based cognitive task tracking alERC decline might serve as a cost-effective early screening method for Alzheimer's disease pathology (Crutcher et al., 2009) years before traditional cognitive symptoms appear. Both postmortem histology (Braak and Braak, 1991) and in vivo cerebral blood flow imaging (Khan et al., 2014) show that Alzheimer's disease pathology appears earliest in the alERC. Similarly, morphometric analysis shows that the earliest preclinical Alzheimer's disease atrophy can also be found in the alERC (Miller et al., 2015), suggesting that alERC volume may be a sensitive biomarker for preclinical Alzheimer's disease. If our task taps into a form of cognitive change that is particularly sensitive to changes in alERC volume, then it may be useful for identifying asymptomatic preclinical Alzheimer's disease patients for clinical trials, as well as screening for potential Alzheimer's disease when a treatment is developed.
Footnotes
This work was supported by the Canadian Natural Sciences Engineering Research Council (Discovery and Accelerator Grants to M.D.B.; Canada Graduate Scholarship, Doctoral Program, to L.-K.Y.), the James S. McDonnell Foundation (Scholar Award to M.D.B.), and the Canada Research Chairs Program (J.D.R and M.D.B.). We thank Dr. Nicholas Rule for assistance with multiple regression analysis.
The authors declare no competing financial interests.
- Correspondence should be addressed to Lok-Kin Yeung, Taub Institute, Columbia University Medical Center, 630 West 168th Street, P.O. Box 16, New York, NY 10032. ly2143{at}cumc.columbia.edu