Abstract
The use of information from the hippocampal memory system in motivated behavior depends on its communication with the ventral striatum. When an animal encounters cues that signal subsequent reward, its reward expectancy is raised. It is unknown, however, how this process affects hippocampal dynamics and their influence on target structures, such as ventral striatum. We show that, in rats, reward-predictive cues result in enhanced hippocampal theta and beta band rhythmic activity during subsequent action, compared with uncued goal-directed navigation. The beta band component, also labeled theta's harmonic, involves selective hippocampal CA1 cell groups showing frequency doubling of firing periodicity relative to theta rhythmicity and it partitions the theta cycle into segments showing clear versus poor spike timing organization. We found that theta phase precession occurred over a wider range than previously reported. This was apparent from spikes emitted near the peak of the theta cycle exhibiting large “phase precessing jumps” relative to spikes in foregoing cycles. Neither this phenomenon nor the regular manifestation of theta phase precession was affected by reward expectancy. Ventral striatal neuronal firing phase-locked not only to hippocampal theta, but also to beta band activity. Both hippocampus and ventral striatum showed increased synchronization between neuronal firing and local field potential activity during cued compared with uncued goal approaches. These results suggest that cue-triggered reward expectancy intensifies hippocampal output to target structures, such as the ventral striatum, by which the hippocampus may gain prioritized access to systems modulating motivated behaviors.
SIGNIFICANCE STATEMENT Here we show that temporally discrete cues raising reward expectancy enhance both theta and beta band activity in the hippocampus once goal-directed navigation has been initiated. These rhythmic activities are associated with increased synchronization of neuronal firing patterns in the hippocampus and the connected ventral striatum. When transmitted to downstream target structures, this expectancy-related state of intensified processing in the hippocampus may modulate goal-directed action.
Introduction
The hippocampus is considered essential for episodic memory, which records personally experienced events (“what”) in space (“where”) and time (“when”) (Milner et al., 1998). An important basis for representing the spatiotemporal components of memory traces is the location-selective discharge of hippocampal neurons (“place cells”) as rodents navigate through an environment (O'Keefe and Dostrovsky, 1971). By analyzing the firing rates of many place cells, the rat's position can be reconstructed (Wilson and McNaughton, 1993). The firing rates of hippocampal neurons are modulated by several other factors, such as environmental cues, task context, time, attentional processes, and motivational state (Muller and Kubie, 1987; Markus et al., 1995; Frank et al., 2000; Wood et al., 2000; Leutgeb et al., 2005; Kennedy and Shapiro, 2009; Fenton et al., 2010; Kraus et al., 2013). In addition to rate coding, hippocampal firing is temporally organized by mass activity in the theta band (in rodents: 6–12 Hz). As an animal traverses the place field of a given CA1 cell, its spikes are emitted first during later phases of the theta cycle, progressing to earlier phases when the distance traveled through the field increases (O'Keefe and Recce, 1993; Skaggs et al., 1996). Not only does this theta phase precession provide a means for refined spatial coding by including the distance traveled through the place field (Huxter et al., 2008; Cei et al., 2014), it also aligns activity of sequentially activated cells in a temporally compressed way and harbors a principle for predictive coding of future locations (Lisman and Redish, 2009). For individual neurons, the phase shift on theta cycles seems to be less reliable near the relatively silent peak phase of theta (Skaggs et al., 1996; Maurer et al., 2006). This phase may separate distinct theta sequences of assembly activity and may function as a “reset” allowing a new cycle of encoding and retrieval to begin (Hasselmo, 2005; Dragoi and Buzsáki, 2006). At the same time, this discontinuity in sequential firing is potentially problematic because it limits the length and continuity of memorized or predicted path representations and may degrade net effects of sequence-dependent synaptic plasticity in the network (compare Skaggs et al., 1996; Mehta et al., 2002).
The sequential activation of cell assemblies in the theta cycle is furthermore structured by γ oscillations (40–100 Hz) nested within theta cycles (Bragin et al., 1995; Jensen and Lisman, 1996; Harris et al., 2003). In addition to theta and gamma rhythmicity, activity in the beta band (15–35 Hz) has been reported in hippocampus (Martin et al., 2007; Igarashi et al., 2014; Rangel et al., 2015). Beta band activity includes a first harmonic of theta, corresponding to twice the keynote frequency; these two terms will be used interchangeably here. Although its spectral power was found to depend on the animal's running speed (Terrazas et al., 2005), the behavioral correlates and functions of the hippocampal beta band activity during navigation remain largely unknown.
Hippocampal dynamics are mostly addressed during spatial exploration that is either spontaneous (Vanderwolf, 1969) or motivated by unpredictable, spatially scattered food reward (“pellet-chasing”) (Wilson and McNaughton, 1993; Skaggs et al., 1996; Huxter et al., 2008). Little is known, however, about the effects of temporally specific changes in reward expectancy on hippocampal dynamics. Such changes take place when reward-predictive cues are offered at discrete times during goal-directed navigation. Effects of motivationally salient cues on hippocampal place-field maps were highlighted previously (Lansink et al., 2012), but it remains to be examined how such cues affect rhythmic synchronization in the hippocampus and connected target structures. Especially the ventral striatum is considered a relevant output structure, as it integrates hippocampal information with inputs from the amygdala, prefrontal cortex, and thalamic nuclei (Pennartz et al., 1994; Voorn et al., 2004) and modulates behavioral output by generating reward-predictive firing (Apicella et al., 1991; Schultz et al., 1992; Pennartz et al., 2011).
Here, we first tested how reward-predictive cues that trigger goal-directed navigation affect hippocampal rhythmicity and phase coding compared with very similar, but uncued navigation. Second, we investigated how cue-driven motivational changes in hippocampal dynamics cohere with modulation of the ventral striatum as a key structure influencing goal approach (compare Dalley et al., 2005).
Materials and Methods
All experimental procedures were in accordance with the Dutch national guidelines on the conduct of animal experiments. The procedures and validation of the behavioral task, electrophysiological recordings, and data acquisition are described in detail previously (Lansink et al., 2012). Briefly, three male Wistar rats (300–450 g) were implanted with a dual-bundle, 12 tetrode microdrive aiming for targets in the dorsal hippocampal area CA1 (4.0, 2.5 mm relative to bregma) and the ventral striatum (−1.8, 1.4 mm relative to bregma) (Lansink et al., 2007). Additional electrodes were positioned in the hippocampal fissure for recording local field potentials (LFPs) and in the corpus callosum overlying the hippocampus for referencing tetrode signals. A ground screw was inserted in the contralateral parietal bone. Histology and tetrode track reconstruction confirmed that hippocampal recording sites were in CA1 area and in ventral striatum; 78% of the ventral striatal recording sites were in the core and 22% in the shell region. Rats were kept on a reversed day-night cycle such that experiments were conducted in their active period. During the recording period, rats were food restricted to 85%–90% of their free-feeding weight.
Behavior and data acquisition
Implanted rats were trained to collect sucrose solution rewards by approaching and nose poking into a fluid well underneath an illuminated cue light in a fully automated, Y-shaped maze consisting of three identical chambers surrounding an equilateral center platform (Ito et al., 2008; Lansink et al., 2012). Each chamber contained three combinations of a cue light and a fluid well, one on each wall (see Fig. 1A). Nose pokes into the fluid wells were registered by interruption of infrared beams, which could trigger a solenoid valve system to deliver sucrose solution (15%) into the fluid well. The rat's movement activity was tracked by 3 infrared beams per chamber, one of which was located at the chamber's entrance.
Naive rats learned to associate a discrete cue (i.e., the illumination of a cue light) and reward availability in daily sessions of 135 trials that each started with cue light presentation. In each 9 trial block, cue lights were presented once in random order. Nose poke responses (>500 ms) in the cued fluid well within 15 s following light onset were rewarded with sucrose solution (70 μl), whereas responses to other fluid ports in this period were neither rewarded nor punished. Cue lights were dimmed after a random interval 1–4 s following fluid delivery or when the 15 s response period had elapsed and an intertrial interval started (10–20 s randomly selected). Intertrial interval duration doubled when rats failed to break the infrared beam between its current chamber and the central triangle. In intertrial interval periods, rats exhibited fluid well approach behaviors very similar to the periods when cues were illuminated, allowing comparison of neuronal activity patterns associated to cued versus noncued fluid well approaches. Rats were trained on this schedule until they performed above criterion (i.e., 90% correct responses on the first 90 trials in 3 consecutive sessions). Following this phase, rats were subjected to sessions in which the probability of reward after a correct nose poke depended on the spatial position of the chamber relative to environmental cues in the laboratory space. Reward probability was 75% for one of the chambers (“75% chamber”), the location of which was different for each rat, whereas the other two chambers yielded reward in only 25% of the trials (“25% chambers”). Reward was provided according to a pseudorandom schedule in which correctly performed trials were rewarded in 3 of 4 (75% chamber) or 1 of 4 (25% chambers) cue illuminations at a given fluid well. All other parameters were identical to the first set of sessions. The approach behavior of the rats as well as parameters, such as power, coherence (weighted phase lag index [WPLI]), spike-field coherence (pairwise phase consistency [PPC]), number of phase-locked units, and LFP phase coupling were not different in sessions using the probabilistic reward schedule compared with the 100% reward availability sessions, and all sessions were therefore pooled. A total of 23 sessions, in which rats performed above criterion, were included in our analysis (9 sessions with 100% reward availability and 14 under the probabilistic reward schedule). Analyses including single unit activity comprised 17 of these sessions (9 sessions with 100% reward probability and 8 under the probabilistic schedule).
Neuronal activity, behavioral video tracking data, and photo-beam breaks were recorded using a 64 channel Cheetah data acquisition system (Neuralynx). Waveforms were saved in 1 ms windows each time the voltage signal exceeded a manually preset threshold (32 kHz; gain: 1000–5000×; filter settings: 600–6000 Hz). LFPs were sampled continuously at a rate of 1690 Hz (gain 500; filter settings 1–475 Hz). The rat's position was tracked with light-emitting diodes on the rat's head stage (60 frames/s; resolution of 0.4 cm/pixel).
Data analysis
Spike sorting.
Individual units were identified on the basis of spike waveform events sharing similar waveform properties, including peak amplitude, energy, and principal components using automated and manual clustering software (KlustaKwik and MClust). Clusters of events were discarded if (1) they did not show a characteristic spike waveform or a consistent waveform profile across tetrode leads, (2) >0.1% of the spike intervals was <2 ms, and (3) the number of events was <50. Putative interneurons were excluded from the dataset on the basis of firing rate (>8 Hz) and waveform characteristics, such as low peak-to-valley width and valley shape (hippocampus: 2 units; ventral striatum: 3 units). A total of 194 hippocampal and 195 ventral striatal units were included in the analyzed dataset. All analyses were conducted with MATLAB (The MathWorks) using in-house developed software, the Chronux toolbox and FieldTrip (Oostenveld et al., 2011).
Power and coherence of LFPs.
LFP traces were cleared of 50 Hz and its odd harmonics and rereferenced (i.e., the mean of all channels in a given structure was subtracted from the traces to minimize the influence of external oscillatory activity on power and coherence measures). A weighted average of power was computed by first taking the average metric over all channels per session. Each session mean was then multiplied by the number of behavioral events (e.g., chamber entries, cue illuminations) in the session, after which the mean power was computed over the grand sum of session means. The same procedure was followed for coherence.
Power.
Time-frequency decompositions were constructed for 10 s intervals around chamber entries using the multitaper method from the Chronux toolbox in a time window of 2 s and a step size of 0.1 s, resulting in a frequency resolution of 0.5 Hz. The bandwidth product was 3.5 with 6 tapers. Background power was computed by taking the average power levels in a 5 s period before a cue light switched on and no (cued) nose poke event was registered. Relative power was computed by dividing the absolute power by the baseline resulting in a reference value of 1.
Power spectral densities.
These were estimated using Welch's method in windows of 2 s around theta peaks. Segments of the traces with 75% overlap were transformed with a Hamming window and averaged. Normalization for comparison with theoretical harmonic values was performed by setting power in theta band to 1. Multitaper methods yielded similar results.
Coherence.
Coherence between LFPs was assessed with the WPLI (Vinck et al., 2011), which is relatively insensitive to volume conduction, noise, power variations, and sample size. For the hippocampal LFPs, similar results were obtained with conventional coherence measures and without rereferencing of the data. WPLI was estimated for a time window of 6 s surrounding chamber entry events by computing power and the cross-spectrum using a sliding window of 1 s with steps of 0.1 s.
Velocity.
The running speed of the rat was computed at each position (60 frames/s, step 0.016 s) using the distance that the rat traveled in 0.05 s time windows. The mean (± SEM) running speed across sessions for cued trials was 48.3 ± 1.2 cm/s and for noncued trials 37.0 ± 1.0 cm/s (Wicoxon's sum rank test, p < 10−5). To assess whether the differences in oscillatory activity in cued versus noncued trials could be ascribed to the running speed of the rats, power and coherence (WPLI) across sessions were recomputed as described above for velocity-matched trials between cued and noncued conditions. Using a sampling-without-replacement procedure, cued trials were paired to noncued trials from the same session showing the least mean velocity difference over an interval of 1 s surrounding chamber entry. Trial pairs with a velocity difference >5 cm/s were excluded. This procedure included a total of 1684 trial pairs (73 ± 3 trials/session; n = 23 sessions). In a subsequent analysis using velocity-matched trial pairs, the mean (± SEM) running speed across sessions was 31.1 ± 0.9 cm/s for cued trials and 31.4 ± 0.9 cm/s for noncued trials (Wilcoxon's matched-pairs signed-rank test [WMPSR], not significant).
Firing rate modulation by oscillations.
Modulation of a unit's firing rate by oscillations was determined by first filtering the LFP traces recorded from the hippocampal fissure using a Chebyshev Type 1 bandpass filter between 6 and 12 Hz for the theta band and between 15 and 20 Hz for the beta band. The analysis included spikes in periods in which the power in the theta or beta band divided by the total power (1–250 Hz) was ≥0.1. Binned spikes (15°/bin) were then plotted relative to two successive periods of theta or beta oscillations (0° corresponds to the peak of the cycle). Peaks and troughs in the firing phase histograms were determined with the locfit curve fitting method (Chronux toolbox). The phase angles for the first and, if applicable, second peak were determined by taking the mean of all phase values between the troughs flanking the peak. Spike distributions were tested for nonuniformity over the phase interval of the peak with the Rayleigh test.
Beta-theta phase relations.
Phase phase relationships between theta and beta band (or first harmonic) activity were determined by plotting each spike time in a session as a function of beta versus theta phase. The phase of beta is expected to change about twice as fast as the theta phase; therefore, it is assumed that the cloud of spike phase data points per session can be approximated by a line with a slope of arctan(2) and a certain offset. The phase lag of beta with respect to theta is ϕlag = ϕbeta − 2 × ϕtheta.
Spike-LFP coherence.
The phase between spiking activity and the LFP recorded in the hippocampal fissure was investigated using the PPC measure (Vinck et al., 2010, 2012). The LFP was filtered using a Hanning window with a length of 5 cycles per frequency step of 1 Hz. For every frequency unit, the instantaneous LFP phase was computed for all possible time points through convolution, allowing to readily extract the phases of individual spikes. The PPC was computed for all pairs of spikes in a time window of 6 s synchronized on cued or spontaneous chamber entries. Time-frequency representations were generated using a 500 ms sliding window with a step size of 10 ms and included for each window neurons that fired at least 50 spikes in that window. PPC traces were computed for a 1 s time window centered on cued or spontaneous chamber entries and included neurons firing ≥50 spikes in both conditions. Statistical differences between conditions were assessed with WMPSR (p < 0.05).
Phase precession.
Position data of the rat were linearized for the three Y-maze chambers by first rotating the (x,y) position samples in each chamber such that all chambers overlapped in a common projected space and then by collapsing the y-coordinates such that the approach to each separate reward site was represented by one linear path. The spikes that were generated in a 2 s time window surrounding chamber entries were mapped to the linearized path. The centers of the firing fields of neurons were identified by first binning the path into segments of 10 pixels/4 cm and then by identifying the bin with the maximum firing rate, labeled the place field center. The borders of the firing fields were determined by the first bin relative to the maximal firing rate bin to each site that contained less spikes than 10% of the maximal firing rate bin. Position values corresponding to the included spikes were scaled between −1 and 1.
Spike phase.
The LFP trace recorded at the hippocampal fissure was bandpass filtered using a zero phase lag forward and reverse first-order Butterworth filter (8–12 Hz for theta, 15–20 Hz for beta). The phase of individual spikes was extracted from the Hilbert transform of the filtered LFP trace for theta and beta. A phase of 0° corresponds to the peak of the LFP and 180° to the trough following the peak. Beta cycles were partitioned in 2 groups, beta-1 contained the cycles that closely aligned the rising slope of theta and beta-2 were the cycles that approximately ran along the falling slope of theta (see Figure 6A, right).
Phase precession.
A circular regression line was fit to the spike phase plots as described by Kempter et al. (2012); the angle of the fitted circular regression line was limited between 0 and −1. A neuron was classified as phase precessing when there was a significant (p < 0.05) linear-circular correlation between spike phase and linearized position, when the slope of the fitted regression line fell within −0.05 and −0.75 and with R2 > 0.2.
Peak phase spikes.
Peak phase spikes were defined as occurring in the phase range of [−40°, 40°] relative to the theta peak and emitted after the rat passed one-fourth of the firing field (i.e., the normalized position of the rat was between [−0.5,1], where [−1, 1] is defined as the full firing field). Similar results were obtained when different phase ranges was used to select peak phase spikes (e.g., [−20°, 20°]).
Results
Reward cues enhance theta band activity and trigger beta band activity
We first examined LFPs recorded near the hippocampal fissure and in the CA1 pyramidal layer from rats performing a visually cued goal-approach task in a Y-shaped maze (Ito et al., 2008; Lansink et al., 2012) (Fig. 1A). Situated around a triangular central platform, each of the three Y-maze chambers harbored three cue lights with reward ports underneath. At illumination of a cue light, conditioned rats readily initiated an approach response from their self-chosen location in the maze to the illuminated goal site, where they briefly waited before a sucrose-solution reward was delivered. When the cue light switched off, rats often initiated further approach responses, now to nonilluminated goal sites. This allowed us to contrast oscillatory patterns exhibited during virtually identical behaviors; namely, approaches that were driven by temporally discrete cues raising reward expectancy versus noncued (i.e., spontaneous) approaches.
Power in the theta (6–10 Hz) and beta (15–20 Hz) bands was significantly higher during cued than noncued approaches. In both situations, however, the changes in these frequency bands showed approximately the same temporal relation to task events (Fig. 1B,C). Power in the theta band peaked just before the rat entered the cued chamber, whereas beta band power peaked at chamber entry and declined to baseline before the rat reached the goal site. The relationship between changes in theta and beta band and power was highly nonlinear, which was apparent from the significantly larger relative power difference between cued and noncued conditions for beta than for theta band activity (Fig. 1C; ratio of average power in cued to noncued conditions for theta: 1.14 ± 0.03 and for beta 1.33 ± 0.03; WMPSR; p < 2 × 10−4). In real-time LFP traces, beta band activity was recognizable as “shoulders” on theta peaks (Fig. 1D,E). Analyses of peak-triggered averages of raw hippocampal theta oscillations confirmed this profile, expressing beta activity peaks as being, at least in part, out of phase with the main theta peaks (Fig. 1E). The enhancement of beta band activity during cued approaches was, however, not limited to the falling phase of theta, as will be indicated below, which is in agreement with the (approximately) theta-doubled frequency observed in Figure 1B, C. Theta and beta frequency oscillations maintained a stable phase relationship, which was apparent from the relatively constant phase delay within hippocampal traces across time in both cued and noncued conditions (Fig. 1F,G).
No significant power differences were detected in the γ range (30–100 Hz) in relation to chamber entries.
Cued goal approach strengthens hippocampal synchronization
Communication between neuronal populations may be regulated by rhythmic synchronization within and between multiarea networks (Engel et al., 1991; Fries, 2005; Bosman et al., 2014). We tested whether the increase in theta and beta power was paralleled by strengthened synchronization between hippocampal LFPs, recorded at different sites across the dorsal CA1 pyramidal cell layer using the WPLI (Vinck et al., 2011). This coherence measure is relatively insensitive to volume conduction, noise, power variations, and sample size. As for power, the hippocampal WPLI for the theta and beta band peaked when the rat entered a chamber and was significantly stronger for cued than for noncued chamber entries (Fig. 2A,B).
For both frequency bands, the increases in power and coherence (WPLI) were not aligned to cue onset (Fig. 1B, “1”), which was the earliest predictor of upcoming reward, but started on average about a second later, approximately upon chamber entry. Indeed, when power and WPLI were aligned to cue onset, the increases were more dispersed in time, indicating that cue onset itself is not strongly modulating oscillatory synchronization (Fig. 3A,B).
By itself, the increase in theta and beta power could result from the attribution of reward value to the cue-action sequence by associative learning, or from the physical change of cue light illumination. We addressed this question by contrasting sessions in which rats were learning the task (responding correctly to < 90% of cue light illuminations; precriterion sessions) to sessions in which rats were highly proficient in task execution (> 90% correct responses to cue lights; postcriterion sessions). Theta and especially beta band power increased more strongly for cued approaches in postcriterion compared with precriterion sessions, confirming a learning effect (Fig. 3C). In line with the learning effect on spectral power, the increase in hippocampal WPLI in the theta and beta range was stronger in postcriterion than precriterion sessions (Fig. 3D).
Theta and beta band activity has been reported to rise with locomotion velocity (see Terrazas et al. (2005), referring to changes in the first harmonic of theta. Other studies showed, however, that only a small fraction of the variance in the power and coherence of theta could be explained by running speed and that these theta features varied more strongly with other aspects of behavioral tasks (Wyble et al., 2004; Montgomery et al., 2009). To test whether differences between cued and noncued approaches can be ascribed to such an effect, we recomputed spectral power and WPLI for cued and noncued trials that were matched for velocity. Under these conditions, the peak power and peak coherence in the theta and beta range were also significantly stronger for cued than for noncued approaches, which renders a confound arising from a velocity difference unlikely (Fig. 3E,F).
Beta band activity affects local hippocampal firing
We next examined whether the cue-induced enhancement in hippocampal LFP synchrony is locally relevant and thus affects CA1 spiking. The majority of 194 CA1 neurons showed general firing modulation by the theta rhythm (i.e., they showed one preferred firing phase during one theta cycle; n = 158; 81%; Fig. 4A,B, left). Another subset of neurons showed two significant preferred firing phases in one theta cycle, indicating an additional beta band-specific firing modulation (Fig. 4A,B, middle; n = 22; 11%). When the spiking of neurons was related to LFP traces filtered in the beta band (15–20 Hz), 54% of the neurons (n = 104 of 194) showed a significant preferred firing phase (Fig. 4A,B, right). Together, these results indicate that a subset of cells preferably fire on one of two beta cycles included in a theta period and other cells fire on each beta cycle (Fig. 4A,B). Population histograms of all cells that exhibited significant phase locking to theta oscillations in the cued condition showed two peaks, in line with the “shoulders” and humps identified in the averages of raw LFP traces (Rayleigh test, p < 10−10) (Mizuseki et al., 2009), whereas in the noncued condition only one peak was present (Fig. 4C, top; Rayleigh test, p < 10−6). Correspondingly, firing probability across the CA1 population exhibited a significant peak to beta band activity in the cued but not uncued condition (Fig. 4C, bottom; Rayleigh test, p < 10−6). Spike locking to beta band activity was consistently related to theta locking as appeared from the linear relationship between the phases of single-unit spikes relative to theta and beta cycles, respectively (Fig. 4D). The median phase delays of beta band activity across all recorded spike-LFP pairs were 107° and 288° relative to the theta cycle (Fig. 4E). This result helps to better understand the manifestation of beta activity in hippocampal LFPs (Figs. 1, 2). First, the phase lags of the LFP beta shoulders in between theta peaks (at ∼100°; Fig. 1F) nicely align with the first spike peak at 107° (Fig. 4E). Second, that a second distinct beta peak is not clearly visible in the smoothed raw LFP (Fig. 1D,E) can be understood from the finding that the second (spiking) peak (∼288°; Fig. 4E) lies close to the theta peak (0 or 360°; compare Fig. 1F).
To probe the temporal evolution of hippocampal firing modulation by LFPs across the task sequence, we assessed spike-field coherence with the PPC measure, which is not biased by sample size effects (Vinck et al., 2010, 2012). Spike-field coherence in the theta and beta bands was most prominent at chamber entry and shortly thereafter, and was significantly elevated in cued versus noncued goal-site approaches (Fig. 5A,B).
Phase precession and spikes emitted at the peak phase of theta
Theta phase precession temporally organizes hippocampal firing (O'Keefe and Recce, 1993; Skaggs and McNaughton, 1996), but thus far the question of whether phase precession is modulated by cue-driven reward expectancy and accompanying beta band activity has remained open. We assessed phase precession by plotting theta- and beta-referenced phases of hippocampal spikes as a function of the animal's position during cued and noncued approaches (Fig. 6A–C). Beta cycles were considered running from peak to peak; beta-1 is the cycle that closely aligns to the rising slope of theta, and beta-2 is the cycle that approximately runs along the falling slope of theta; Figure 6A, right). While confirming significant theta phase precession (94 of 194 neurons, 48%), we also report beta phase precession for a substantial fraction of hippocampal neurons (Fig. 6D). A significantly larger fraction of neurons precessed on the beta cycle that is closely aligned to the rising theta slope (beta-1; 61 of 194 neurons, 31%) compared with the cycle that accompanies theta's falling slope (beta-2; 22 of 194 neurons; 11%). This bimodality of phase precession is consistent with existing evidence (Skaggs et al., 1996; Yamaguchi et al., 2002; Maurer et al., 2006). In combination with the stable phase relation between theta and beta band activity (Fig. 1F,G), this result indicates that beta phase precession should be considered a straightforward reflection of theta phase precession. Thus, phase precession on beta cycles is not independent from theta phase precession, but it does offer temporal delineation of theta phase precession segments by way of LFP and corresponding spike dynamics (Fig. 4).
Given the distinction in phase precession associated with the rising and falling slope, we asked whether theta phase precession is resilient against variations in reward expectancy. A similar number of neurons showed precession in the noncued (93 of 194 neurons; 48%) compared with cued conditions (94 of 194 neurons; 48%; Fig. 6D) and the proportion of rising and falling slope precessing neurons was the same in both conditions (cued: beta-1: 61 of 194 neurons; 31%; beta-2: 22 of 194 neurons; 11%; noncued: beta-1: 61 of 194 neurons; 31%; beta-2: 24 of 194 neurons; 12%; Fig. 6D). When comparing cued versus uncued condions, the mean slopes of the fitted regression lines of phase versus position, computed across all significantly phase precessing units, were not significantly different (WMPSR), indicating similar precession rates across conditions.
Many spikes that were emitted from the same spatial position emerged at two different phases of theta cycles: namely, the descending phase and the peak (Fig. 7A,B) (Maurer et al., 2006). The precise temporal organization of firing around the falling slope of theta may therefore not be fully classified with the standard phase precession analysis. We labeled the spike population occurring near the peak of the theta cycle “peak phase spikes.” Peak phase spikes may participate in theta phase precession because they often follow spikes that were emitted at later phases of theta in foregoing cycles (Fig. 7C,D; spikes 4 and 6), although with a large phase difference relative to the previous spike (spikes 3 and 5, respectively). Peak phase spikes are often followed by spikes on the descending slope, which do not demonstrably conform to theta phase precession (Fig. 7C,D; spike 5). We found that the number of peak phase spikes was equally distributed between cued and noncued conditions (Fig. 7E). To assess the significance of peak phase spikes and their suspected large phase difference to the previous spike (“phase jumps”) to phase precession, we computed the phase difference between each peak phase spike ([−40°, 40°] relative to the theta peak) and the temporally nearest spike that was emitted before the peak phase spike and compared this with the phase difference between each spike that was emitted in the troughs of theta ([140°, 220°] relative to the theta peak) and its nearest-neighbor spike with emitted before the trough spike under scrutiny. Spikes within 15 ms of a peak or trough spike and spikes with a positive phase delay (i.e., nonprecessing spikes) were not taken into account. The peak phase spikes showed on average a larger negative phase difference with their nearest neighboring spike than the trough phase spikes (Wilcoxon's rank sum test, p = 10−17; Fig. 7F) indicating that the phase jumps associated to peak phase spikes are a consistent factor in phase precession. Results were similar for cued (Fig. 7F) and noncued conditions. Including also spikes with a positive phase delay relative to the peak or trough spike yielded comparable results. Together, these results indicate that (1) phase precession is a robust coding mechanism that generally applies across both cue-motivated and spontaneous behaviors, and (2) the range of theta phase precession is greater than previously reported. Also peak phase spikes are included in this phenomenon, exhibiting large “phase jumps” relative to spikes in foregoing cycles.
Hippocampal beta band activity coheres with modulation of ventral striatal activity
Probing the impact of altered hippocampal synchrony on a target area, we tested whether neuronal activity in the ventral striatum coheres with changes in hippocampal dynamics. Particularly low-frequency LFP activity recorded from ventral striatum may be susceptible to volume conduction effects arising from distant brain areas, such as the hippocampus (Sirota et al., 2008). Indeed, we found that a seemingly strong hippocampal-ventral striatal LFP coherence in the theta and beta bands vanished when the LFP signals were rereferenced to the locally averaged LFP (see Materials and Methods; this does not imply per se that no physiological coherence exists at the LFP level, but given the lack of sufficient locally recorded LFP differences, such coherence cannot be demonstrated). Ventral striatal firing patterns, however, are not susceptible to this problem, prompting us to investigate phase locking between the spike trains of 195 ventral striatal units and hippocampal LFPs. Ventral striatal firing was modulated by both theta and beta band activity in 115 (59%) and 50 (26%) of 195 neurons, respectively (Fig. 8A,B). These numbers were lower than for hippocampal area CA1 in both frequency bands, as expected from a limited connection strength and partially indirect projection (Groenewegen et al., 1987). Consistent with hippocampal results, spike-field coherence in the theta and beta band was enhanced in close temporal proximity to chamber entries and was significantly stronger during cued compared with noncued conditions (Fig. 8C,D).
Discussion
In this Y-maze task, which contrasted goal-directed navigation triggered by reward-predictive cues with uncued but similar behavior, hippocampal rhythmicity was strongly modulated by the concurrent motivational state of the rat. When reward expectancy was raised by cues, approach to goal sites was accompanied by enhanced theta and especially beta band activity in hippocampal LFPs, compared with uncued approaches (Fig. 1). Periods of enhanced theta and beta band activity were associated with increased synchronization between CA1 LFPs recorded at different locations (Fig. 2) and between LFPs and firing of single pyramidal neurons (Figs. 4, 5). In general, phase precession involved spikes emitted around the peaks of theta, which exhibited large “phase precessing jumps” relative to spikes in foregoing cycles (Figs. 6, 7). However, no modulation of theta phase precession by reward expectancy was found. The strengthened hippocampal synchronization was coexpressed in the connected ventral striatum as a stronger phase-locked firing of single units to hippocampal theta and beta band activity (Fig. 8).
Enhanced rhythmic activity during goal-directed action
Theta and beta band activity were most powerfully expressed, not when reward expectancy was directly raised by cue appearance (Fig. 2A,B), but when the animal had already initiated its approach action, entering the chamber of interest (Figs. 12–3, 5). This coupling of strengthened theta and emerging beta band activity with cue-induced action, jointly with beta-synchronized firing (Figs. 4A–C, 5), suggests that beta band activity intensifies hippocampal processing when the agent needs to combine motivation derived from predictive stimuli with navigation toward goals, potentially to map its direct action path and currently encountered sensory features in great detail (compare Hollup et al., 2001; Johnson et al., 2007; Lansink et al., 2012; Pfeiffer and Foster, 2013; Pezzulo et al., 2014). This view is partially in line with the proposal by Engel and Fries (2010) that beta activity may serve to maintain cognitive set in a top-down manner, as reward expectancy may well contribute to cognitive set. However, like beta band activity in the striatum (Leventhal et al., 2012), our hippocampal data deviate from this framework because the motivational cues in our task did not induce beta activity by themselves but only when coupled to locomotor action.
Diversity of beta band activity in hippocampal-striatal systems
We found beta band activity in the 15–20 Hz range, which was tightly coupled to the theta rhythm (Figs. 1F,G, 4D,E). These aspects of beta activity appear to be different from hippocampal, cortical, or striatal beta oscillations described earlier. For example, Igarashi et al. (2014) found a 20–40 Hz beta coherence between hippocampus and entorhinal cortex. Rangel et al. (2015) reported that a beta band power (15–20 Hz) increase in the hippocampal dentate gyrus was accompanied by a concurrent theta band power decrease. An ∼20 Hz striatal beta activity was characterized as an independent state, distinct from theta oscillations (Berke, 2009). Functionally, these and the current reported types of beta band activity seem to be in some way associated with the processing of behaviorally meaningful cues. However, Berke et al. (2008) showed a 23–30 Hz beta activity in mouse hippocampus that was correlated with novelty.
CA1 [15–20] Hz beta band has received relatively limited attention (Terrazas et al., 2005), possibly because this type of beta band activity is only weakly present under conditions in which hippocampal dynamics were often investigated: that is, during spatial exploration that is either spontaneous (Vanderwolf, 1969) or motivated by unpredictable, spatially scattered food reward (“pellet-chasing”; Wilson and McNaughton, 1993; Skaggs et al., 1996; Huxter et al., 2008) and do not include temporally discrete cues. In rat CA1 LFPs obtained during a pellet-chasing paradigm, we observed beta band power levels that were comparable to the noncued condition in the current study (J.J. Bos, M.A.V., and C.M.A.P.; unpublished data).
Origins of reward expectancy effects in the hippocampus
It is unclear which sources supply the hippocampus with motivational information during ongoing actions. This information may be provided by a hippocampal dopamine signal generated through ventral tegmental afferents to CA1 (Groenewegen et al., 1993; Zahm, 2000). Dopamine signals, however, are expected to arrive in the hippocampus at cue onset, as earliest predictor of reward (Mirenowicz and Schultz, 1994) and not during cue-triggered action. Another source may be the medial prefrontal cortex, in which the single-unit firing patterns correlate with reward-associated action sequences (Mulder et al., 2003) and which can exert population effects on several brain areas, including ventral striatum and VTA (Ferenczi et al., 2016). Moreover, reward expectancy signals coded by striatal neurons (Schultz et al., 1992; Roitman et al., 2005; Lansink et al., 2008) may reenter the corticolimbic system via the ventral pallidal-mediodorsal thalamic route (Zahm and Brog, 1992; Groenewegen et al., 1997). Furthermore, the indirect projection from the medial prefrontal cortex to hippocampus via the thalamic nucleus reuniens contributes to future-path coding during goal-directed navigation (Ito et al., 2015). Alternatively, motivational information may reach the hippocampus through a circuit of emotion-related areas, including the anterior cingulate cortex and amygdala (Cardinal et al., 2002; Rajasethupathy et al., 2015). A complex pattern of projections from amygdalar nuclei to hippocampal CA subfields, subiculum, and lateral entorhinal cortex may convey the emotional salience of sensory stimuli (Robbins et al., 1989; Cardinal et al., 2002). Future research must shed light on the precise composition of the brain network responsible for expectancy-driven navigation.
Theta phase precession includes “peak phase” spikes
We found theta phase precession to be a robust coding mechanism that is stable across cued and noncued approach behaviors. Beta band activity allowed us to partition the theta cycle into two halves, of which precession was clear on the rising slope (beta-1) but not on the falling slope (beta-2; Figs. 6, 7). This is consistent with earlier evidence indicating that the phase shift seems to be less reliable near the theta peak, which thus may be thought of as giving rise to a gap in phase precession (Skaggs et al., 1996; Yamaguchi et al., 2002; Maurer et al., 2006). Temporal organization of firing patterns may be continued in this phase, however, by spikes that occurred during the peak and late ascending slope of theta but were emitted from the same spatial positions as spikes occurring on the falling slope of theta (Fig. 7). We propose that these peak-related spikes may have an additional bridging or look-ahead function relative to spikes on the early ascending slope, thus linking cell assemblies that code longer sequences of positions (compare Dragoi and Buzsáki, 2006; Lisman and Redish, 2009). Here, a recorded CA1 cell is taken to represent a much larger group (assembly) of neurons coding the same position, and the specific timing of peak phase spikes is well suited to link to, and initiate, a new sequence of assemblies, covering the rising theta slope at progressively earlier phases. Maurer et al. (2006) also observed heterogeneous phase distributions of single-cell spikes within the theta cycle, but these were ascribed to cells expressing multiple place fields, thus participating in more than one assembly. In contrast, our findings show that neurons with unitary place fields generate spikes that “jump ahead” to continue precession in the transitional peak zone of theta (Fig. 7), thereby opening a time window to fit in more sequential assemblies than in the conventional scheme, and thus potentially coding more remote places or important events, such as upcoming rewards (Wikenheiser and Redish, 2015; but see Grieves et al., 2016).
Impact of hippocampal theta and beta activity on target areas
Our results suggest that the continuation of structured CA1 firing across the theta cycle, as manifested by peak phase spikes, reduces the portion of temporally unstructured activity that is transmitted to targets, such as the ventral striatum. Enhanced theta and beta band processing was coexpressed in the hippocampus and ventral striatum, as shown by phase locking and phase modulation of the firing rate of individual ventral striatal neurons (Fig. 8). Combined with previous findings on theta-modulated ventral striatal firing (Lansink et al., 2009; van der Meer and Redish, 2011), our findings suggest that navigation in theta-mode provides a baseline process in which ventral striatum (and potentially other hippocampal targets) are regularly supplied with hippocampal updates on spatial-contextual positions that are integrated to guide spontaneous or habitual locomotion. We propose that the induction of a combined hippocampal theta-beta rhythm upon reward-predictive cues and response initiation constitutes a second process, manifest in LFP coherence and beta band firing (Figs. 2, 4, 5) by which the hippocampus may gain prioritized access to systems regulating behavioral output. The neural basis of such a gain effect may lie both in the overall enhanced excitability of CA1 neuronal subsets (Fig. 4) and in the temporally structured dynamics by which spike trains of CA1 are propagated to output areas, such as subiculum and ventral striatum (Groenewegen et al., 1987; Pennartz et al., 1994). This proposal awaits further testing in experiments examining the causal role of hippocampal theta and beta band activity in modulating goal-directed actions. Given the correlative nature of the current findings, such experiments are also needed to reveal whether beta-band modulated firing in the ventral striatum exclusively depends on hippocampal output or on a larger network of afferent structures (Pennartz et al., 1994; Voorn et al., 2004).
Together, these results provide a more versatile and flexible picture of hippocampal dynamics and its impact on motivated behavior than previously suggested by findings on theta and gamma rhythmicity. Cues that raise reward expectancy induce a combined theta-beta band mode in both hippocampus and ventral striatum once goal-directed navigation has been initiated. The coupling between hippocampal beta band activity and phase modulated ventral striatal firing underscores its relevance for the guidance of motivated behaviors (Ito et al., 2008). These results raise further questions on mechanisms of theta phase sequencing: computational studies will be needed to account for the emergence of combined theta-beta band activity, peak phase spikes, and phase jumps in dual-oscillator as well as attractor type of models (Tsodyks et al., 1996; Wallenstein and Hasselmo, 1997; Mehta et al., 2002; Geisler et al., 2007; Burgess, 2008).
Footnotes
This work was supported by Human Frontier Science Program Grant RGP0127/2001, Netherlands Organization for Scientific Research VICI Grant 918.46.609, European Union Grant 270108, and Human Brain Project HBP SGA1 to C.M.A.P. and Netherlands Organization for Scientific Research VENI Grant 863.11.010 to C.S.L. We thank Jeroen J. Bos for the use of rat hippocampal LFP data recorded during a pellet chasing task; Fernando Lopes da Silva for comments on the manuscript; Ole Jensen for general feedback; Kenneth D. Harris and A. David Redish for the availability of unit isolation software KlustaKwik and MClust, respectively is highly appreciated. We are grateful to Trevor W. Robbins, Barry J. Everitt, Rutsuko Ito, Carol A. Barnes and Bruce L. McNaughton for their help in designing the behavioral paradigm.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Carien S. Lansink, Science Park 904, 1098 XH Amsterdam, The Netherlands. c.s.lansink{at}uva.nl. Martin A. Vinck's present address: Ernst Strüngmann Institute (ESI) for Neuroscience in Cooperation with Max Planck Society, Frankfurt, Germany. Jadin C. Jackson's present address: Medtronic, Inc. Minneapolis, MN 55432.