Abstract
Cross-frequency interactions between oscillations in local field potentials (LFPs) are thought to support communication between brain structures by temporally coordinating neural activity. It is unknown, however, whether such interactions differentiate between different levels of performance in decision-making tasks. Here, we investigated theta (4–12 Hz) to gamma (30–100 Hz) phase-amplitude coupling in LFP recordings from rat orbitofrontal cortex. Across subsequent periods of a task in which rats learned to discriminate two odors associated with positive and negative outcomes, theta-to-gamma phase-amplitude coupling (PAC) was highest during the odor-sampling task period that preceded a Go/NoGo decision. This task-dependent modulation could not be explained by changes in oscillatory power and appeared to be time-locked to odor onset, not to the timing of the behavioral response. We found that PAC strength during odor sampling correlated with learning, as indexed by improved performance across trials. Moreover, this increase in PAC magnitude was apparent only on trials with correct Go and NoGo decisions, but not incorrect Go decisions. In addition, we found that PAC preferred coupling phase showed consistency over sessions only for correct, but not incorrect trials.
In conclusion, orbitofrontal cortex theta-gamma PAC strength differentiates between different levels of performance in an olfactory decision-making task and may play a role in the generation and utilization of stimulus-based outcome predictions, necessary for adaptive decision-making.
Introduction
The orbitofrontal cortex (OFC) plays a key role in the adjustment of ongoing behavior. In particular, OFC is suggested to update expected outcome representations when rules linking stimuli to outcomes are changed, as both humans and animals with orbitofrontal damage show impairments adjusting their choice behavior after such changes (Rolls et al., 1994; Baxter et al., 2000; Schoenbaum et al., 2002; Fellows and Farah, 2003). The OFC seems especially implicated when model-based, as opposed to model-free, representation of stimuli or states are required for successful behavioral performance (Takahashi et al., 2009; McDannald et al., 2011, 2012).
OFC spike patterns reflect the relative value of cues in a context-dependent manner, both at the single unit and population level (Schoenbaum et al., 1998; Tremblay and Schultz, 1999; Hikosaka and Watanabe, 2000; Wallis and Miller, 2003; Padoa-Schioppa and Assad, 2006; van Duuren et al., 2008). Lacking direct output to primary motor structures (Cavada et al., 2000; Ongür and Price, 2000; Groenewegen and Uylings, 2010; Hoover and Vertes, 2011), it is hypothesized that OFC transmits information about expected outcome to connected structures such as the basolateral amygdala, striatum, and autonomous brainstem centers, to contribute to stimulus-specific reward predictions and prediction errors to influence behavior (Hare et al., 2008; Takahashi et al., 2009; Pennartz et al., 2011a; Schoenbaum et al., 2011; McDannald et al., 2012). Efficient cross-structural communication is thought to require a mechanism by which changes in the excitability of connected structures are synchronized. Synchronized synaptic activity, as observed in local field potentials (LFPs), exhibits oscillatory patterns in different frequency bands that covary with neuronal excitability and can become coherent between structures (Buzsáki and Draguhn, 2004; Fries, 2005; Canolty et al., 2006, 2010; Womelsdorf et al., 2007; Schroeder and Lakatos, 2009). Interactions between oscillations with different frequencies have been found as phase-amplitude coupling (PAC) where the phase of a slow oscillation modulates the amplitude of a faster oscillation.
In the rodent brain, PAC has been examined especially in the entorhinal cortex (Chrobak and Buzsáki, 1998) and hippocampus (Bragin et al., 1995; Chrobak and Buzsáki, 1998; Lisman, 2005; Colgin et al., 2009). PAC may also occur across structures (Sirota et al., 2008; Tort et al., 2008; Maris et al., 2011; van der Meij et al., 2012). Recent reports have related PAC to context-based rule retrieval (Tort et al., 2009) and to working memory (Fujisawa and Buzsáki, 2011). However, it remains unclear whether and how PAC is involved in associative stimulus–reward learning and decision-making. If PAC plays a functional role in this type of learning through temporal patterning of mass activity, it can be hypothesized that changes in PAC strength should correlate with task performance.
We quantified PAC using weighted phase-locking factors (wPLFs; Maris et al., 2011; van der Meij et al., 2012) and found event-related PAC between theta and gamma frequencies in OFC recordings of rats engaged in a two-odor associative learning task. During odor sampling, PAC increased in a learning-dependent fashion and was significantly stronger when it preceded correct versus incorrect decisions.
Materials and Methods
Behavioral task.
The general behavioral paradigm and recording methods have been reported previously (van Wingerden et al., 2010a,b, 2012). Briefly, we recorded LFPs and single unit data from three male Wistar rats engaged in an olfactory decision making task (10 sessions total; Fig. 1). Rats sampled odorants by making a nose poke into an odor port equipped with infrared photobeams, which precisely registered the onset and offset of the poke. Odor pokes were required to last at least 750 ms. Contingent on rules linking stimuli to outcomes, animals decided to make a “Go” response (locomotion to fluid well, waiting and subsequent outcome delivery) or a “NoGo” response (refraining from entering the fluid well for 5 s. poststimulus). Upon entering the fluid well, the delivery of outcome was delayed by 1000 ms. Novel odorants were used on each session. Go/NoGo trials were pseudorandomly ordered in blocks of five plus five trials. Four trial types were isolated, according to signal detection conventions: Hits (Go responses resulting in positive outcome; HIT), False Alarms (Go responses resulting in negative outcome; FA), Correct Rejections (NoGo responses, correctly avoiding negative outcome; CREJ), and Misses (incorrect NoGo responses; these rarely occurred and were not further analyzed; Fig. 1). HIT and CREJ trials were scored as “correct”, MISS and FA trials as “incorrect”. Acquisition of task rules was indexed by generating instantaneous performance scores: the proportion of correct decisions (HITs and CREJ trials) in a 20-trial sliding window. This approach yielded a time-resolved performance index, with values ranging from 55 to 95% correct performance in steps of 5%. When rats performed at or above a behavioral criterion of 85% correct (see Fig. 4A), the session was terminated or, in some cases, a reversal of task contingencies followed (not further analyzed here).
Surgical procedures.
Animals were anesthetized by injecting 0.08 ml/100 g Hypnorm (0.2 mg/ml fentanyl, 10 mg/ml fluanison; VetaPharma) intramuscularly followed by 0.04 ml/100 g Dormicum (5 mg/ml midazolam; Roche) subcutaneously. A microdrive, holding 14 individually moveable tetrode drivers, was implanted onto the left hemisphere dorsal to the OFC with tetrode placements from +2.6 to +4.0 mm AP and −2.0 to −3.6 mm ML of bregma and recordings sampled from −3.0 to −5.0 mm DV). The drivers were loaded with 11 or 12 tetrodes and two reference electrodes. Immediately after surgery, all tetrodes and reference electrodes were advanced 0.8 mm into the brain and subsequently advanced in daily steps to the upper border of the OFC according to a standardized rat brain atlas (± −3.0 mm DV; Paxinos and Watson, 2007). The reference electrodes were lowered to a depth of 1.2–2.0 mm ending up initially ∼1.0 mm dorsal to the set of recording electrodes and adjusted to minimize spiking activity on the reference channel. A stainless steel screw, positioned in the left parietal bone, served as ground.
Electrophysiology
Using a 64-channel Neuralynx system, we recorded from an array of 11–12 tetrodes (Gray et al., 1995), with a reference electrode placed in the supplemental motor cortex dorsal to OFC. Signals were referenced online and LFPs were extracted from each tetrode by low-pass filtering the continuously sampled (at 1874 Hz) signal between 1 and 475 Hz. Events in the behavioral task were coregistered and time-stamped by the Cheetah system. Histological verification indicated that most recordings were made from ventral and lateral orbitofrontal (VO/LO) and agranular insular (AI) cortex with some spread into dorsolateral orbitofrontal cortex (Paxinos and Watson, 2007).
PAC.
To quantify PAC between different frequencies recorded from OFC LFPs, we computed wPLFs (van der Meij et al., 2012). This PAC measure captures the consistency of phase-amplitude locking between two oscillatory processes, yielding a magnitude and preferred phase. These coefficients were calculated using output from a time-resolved spectral analysis. Spectral analysis consisted of convolving LFP signals in a 600 ms window with complex-valued wavelets, one for every frequency of interest. All wavelets were constructed by element-wise multiplication of three-cycle complex exponentials at the frequency of interest with a Hanning taper of equal length. Only those frequencies for which the wavelet had a unique integer number of samples per cycle were obtained, yielding a frequency resolution of ∼1 Hz for the frequencies up to 20 Hz and a resolution of ∼4 Hz for the frequencies up to 100 Hz. Next, we entered the obtained complex-valued signal, incorporating phase and amplitude from one electrode (ch1) and the amplitude-envelope from another electrode (ch2) into the wPLF function. The wPLF is itself complex-valued, and represents on the one hand a normalized index of the consistency of the relation of amplitude from ch2 with the phase of ch1 over trials, and on the other hand the preferred coupling phase, separately for all frequency combinations. This can be visualized in a magnitude or phase bispectrum (see Fig. 1 for magnitude). Further analyses were performed in Matlab, using custom-made scripts. Circular statistics were computed using CircStat, an open-source Matlab toolbox (Berens, 2009). Because all channels were online referenced against one of two reference tetrodes outside the OFC, our results could in principle be confounded by PAC occurring on the reference channels. To control for the possibility of an external source, we re-referenced all recording channels in a separate analysis by subtracting the mean of all channels, thereby minimizing the impact of a global (external) oscillatory structure shared by all channels on wPLF measures, and report the results along with the main findings.
Selection of physiologically relevant PAC.
We were interested in the possible interaction of theta-band frequencies (4–12 Hz) and gamma-range rhythms (30–100 Hz). Therefore, we sorted the channels per session according to theta power, averaged over trials in the time window of interest, and separately, also according to averaged gamma power. We selected the four channels with highest theta and gamma power as phase or amplitude-providing channels per session, respectively. Because wavelet coefficients obtained from the same channel can show spurious coupling artifacts, we only report on wPLFs obtained from between-channel pairs (yielding in total max; N = 16 selected channel combinations per session). Selection of the channels with high-power in the theta or gamma range (i.e., based on a univariate characteristic of the channels) ensures reliable theta phase estimation and sufficient physiological gamma band activity, thereby improving the precision of the wPLF measure (a bivariate characteristic of the signals). High theta or gamma power by itself, however, does not imply larger wPLF magnitude (see Figs. 6, 7). For comparison, corresponding findings using all channel pairs are also reported along with the main findings. Exploratory analyses indicated that, for both S+ and S− type trials, a cluster with high wPLF magnitude could be found in the frequency pair combinations resulting from combining the 7–12 Hz interval for phase-providing channels and the 50–70 Hz interval for amplitude-providing channels (black boxes in Fig. 2B). Further results are reported from this frequencies of interest (FOI) window. In the main analysis, wPLFs were averaged per session (13–15 wPLFs from channel combinations averaged into a mean wPLF per session per trial type). As a control analysis, the main findings were replicated using session-level averages created from the full set of all 110 channel-combination pairs in a session. The results of these control analyses are presented along the main analysis.
Selection of optimal PAC window.
Based on previous research, we hypothesized that theta-gamma PAC would be most prominent in the early odor sampling period, where increases in oscillatory power and spike-field synchronization in these frequency bands coexist (van Wingerden et al., 2010b; Pennartz et al., 2011b). We used an event-related method for calculating PAC, by concatenating complex-valued wPLFs, obtained from 600 ms windows, over trials, thereby retaining high temporal resolution within trials with respect to behavioral events. To determine an optimal window for detecting wPLF magnitude, we used a sliding window approach, calculating magnitudes (from the previously established FOI window) per session in blocks of 600 ms. We next moved this analysis window from −1.4 to +1.4 s relative to odor onset, yielding 12 windows of 600 ms centered on time points from −1.1 to +1.1 s stepped in 200 ms increments (Figs. 2A, 4B–D). We narrowed this period down to the time window that resulted in the highest wPLF magnitudes, averaged across sessions and for both odor exemplars (selected window: centered at +0.5 s after odor onset, spanning from +0.2 to +0.8 s after odor onset).
Comparisons between trial types: magnitude, preferred phase, and phase concentration.
As a first step, whole session averages were constructed for wPLF magnitudes taken from this selected time window, per trial type. To facilitate comparison across sessions, wPLF magnitudes were mean-normalized between trial types under comparison (i.e., once for the S+/S− trial comparison and once for HIT, CREJ, and FA trials). Normalized wPLF magnitude was compared between conditions at the session-average level (N = 10; Fig. 2C,D) using paired tests. The bispectrum across the full range of analyzed frequencies is shown in Figure 2B for HIT and FA trial types. The circular distribution of preferred coupling phase relations was assessed by performing Rayleigh's test for circular uniformity on the preferred coupling phase distribution per trial type. To evaluate differences in mean coupling phase between conditions, we calculated pairwise coupling phase differences between conditions for each channel combination and tested whether the mean phase of the resulting distribution of coupling phase differences was significantly different from zero. To assess differences in the concentration of preferred coupling phases across sessions, we fitted a Von Mises circular normal distribution to the circular distribution of the coupling phases within sessions, yielding the concentration variable κ per trial type. To compare κ values between trial types across sessions, κ values were Z-scored across trial types within sessions. Subsequently, paired t tests on Z-scored κ values were carried out.
Correlations between PAC and behavioral parameters.
To assess whether changes in PAC scores are related to behavioral learning, we performed a correlation analysis between wPLF magnitudes and behavioral performance. First, we calculated time-resolved wPLF magnitudes by averaging wPLF magnitudes using a sliding window block of four trials, per trial type. Second, we extracted the time-resolved behavioral performance scores (ranging from 55 to 95% in steps of 5%; chance level at 50%, see above) for each trial. Third, per trial type within a session, we normalized wPLF magnitudes by subtracting their mean and dividing by their SD across trials (Z-scoring, per trial type). Finally, we calculated R2 values for regressions of Z-scored wPLF magnitude, averaged per performance bin across sessions, on behavioral performance scores (Fig. 3A,B). This analysis was performed both using pooled wPLF magnitudes from all three trial types (Fig. 3A), and also separately per trial type (Fig. 3B). We also computed Pearson correlation coefficients between these time-resolved performance and neural measures using averages per performance bin per session, pooled over sessions (one data point per session per performance bin; Fig. 3C). To corroborate this correlation-based analysis, the change in wPLF magnitude from the block of trials with behavioral performance lower than 70% to the block of trials with performance >85% (ΔwPLF) was calculated per trial type. ΔwPLF values were compared against zero and between trial types.
Additional behavioral parameters (i.e., odor sample time, reaction time from odor port to fluid well) were similarly converted to four trial time-resolved parameters and correlated to time-resolved wPLF magnitudes.
Analysis of changes in wPLF magnitude within sessions.
To compare the evolution of PAC strength directly to the learning curve within sessions, controlling for different session durations, we used linear interpolation to generate equal-length time series of 50 estimates of wPLF magnitudes per trial type. Next, we pooled the interpolated data points from all sessions and computed Pearson's correlation coefficients with 95% confidence intervals between trial number and the pooled interpolated wPLF magnitude, separately for each trial type. This analysis could potentially reveal dynamics in PAC strength nonspecific to trial type but instead related to the passage of time. Alternatively, divergent patterns of PAC strength evolution between trial types would favor an explanation that links PAC to learning stimulus-outcome contingencies.
Controlling for fluctuations in theta power in phase-providing channels. Because wPLF magnitude, and indeed any measure of PAC strength, depends on accurate estimation of the phase of the phase-providing frequency, power differences in the phase-providing frequency between trials or conditions being compared will affect the signal-to-noise ratio of the estimated phase. To examine this potential confound, we computed the correlation between behavioral performance and mean normalized theta power, averaged at the session level, analogous to the correlation analysis between wPLF and performance. We also computed Pearson's correlation coefficients with 95% confidence intervals between trial number and theta power, separately for each trial type, averaging the theta power over the four selected channels and across the selected frequencies (4–12 Hz; see Fig. 6).
Anatomical distribution of PAC scores.
To assess whether PAC strength increases as a function of recording depth, whole-session wPLF magnitudes were extracted per channel pair. As each rat contributed at least three sessions, these sessions were categorized as “superficial,” “intermediate,” or “deep,” based on position in the recording order for that rat. wPLF magnitudes per channel pair were averaged across depth level and Z-scored within a channel pair to facilitate aggregation across sessions. Subsequently, Z-scored wPLF magnitudes were averaged into session means and pairwise comparisons were made between wPLF means at the session level for the different depth levels.
In 2 of 3 rats (N = 7 of 10 sessions), recordings were made with a circular tetrode arrangement. This tetrode placement layout allows at least a crude analysis of the distribution of PAC strength based on anatomical location. wPLF magnitudes were sorted according to phase-providing channel and according to amplitude-providing channel. To combine data from all seven sessions, wPLF magnitudes across channels were normalized between 0 and 1 within a session and compared between anatomical quadrants (posteromedial, posterolateral, anterolateral, and anteromedial).
Results
We trained rats (N = 3) to make Go–NoGo decisions in a two-odor discrimination associative learning task (Fig. 1A). In each session (N = 10 sessions in total) of this task, animals must learn to discriminate between a pair of novel odors. One odor (S+) is predictive of reward (sucrose solution) when followed by a Go response, whereas a second odor (S−) is predictive of an aversive outcome (quinine solution) if followed by a Go response.
Behavioral results
A typical trial begins with the onset of a light stimulus, followed by a nose poke into an odor chamber. Here, the animal samples one of two odors and decides whether or not to collect an associated outcome in a fluid well, located at the other side of the operant cage (Go response). In the course of learning, animals make Go responses to the odor predicting positive outcome (HIT trials; 60 ± 7 or 49.9 ± 0.56% per session; mean ± SEM), as well as to the odor predicting negative outcome (FA trials; 22 ± 5 or 17.4 ± 0.68%). When rats correctly decide to avoid the negative outcome (NoGo), the trial times out after 10 s (CREJ trials; 32 ± 4 or 27.1 ± 0.74%). Only rarely did animals fail to retrieve the positive outcome (MISS trials; 7 ± 1 or 5.5 ± 0.59%) and therefore this trial type was not analyzed further. On average, animals took 120 ± 16 trials to reach a behavioral criterion, defined as 85% of trials correct in a block of 20 trials. Upon reaching the criterion, either the session was terminated or the odor-outcome contingencies were reversed. This reversal phase was not further analyzed here. Over the course of learning, the reaction time (defined as the time from odor offset to entry into the fluid well) decreased significantly for HIT trials (mean ± SEM: 2.94 ± 0.11 s; Pearson correlation r = −0.14, p < 0.01), but not FA trials (2.93 ± 0.14 s, difference not significant; r = −0.09, n.s.).
In previous research using this task, we have found increased spectral power and spike-field rhythmic synchronization in the theta (4–12 Hz) and gamma (30–100 Hz) frequency bands, compared with baseline, specifically during the sampling of outcome-predictive odors in this task (van Wingerden et al., 2010a,b, 2012). Recent reports have indicated that systematic relationships between oscillations in different frequency bands can exist (PAC; Tort et al., 2009; Axmacher et al., 2010; Fujisawa and Buzsaki, 2011; van der Meij et al., 2012). Therefore, we asked whether a systematic relation between theta phase and gamma amplitude exists during odor sampling and whether this relationship is modulated in conjunction with associative learning. If cross-frequency interactions between local field potential oscillations contribute to information processing (Lisman and Buzsáki, 2008; Canolty and Knight, 2010; van der Meij et al., 2012; Lisman and Jensen, 2013) then it can be hypothesized that measures of PAC correlate with decision-making performance. We recorded LFP signals from the OFC of three rats engaged in this behavioral task to evaluate this hypothesis.
Theta-gamma PAC occurs time-locked to odor onset
To quantify the dynamics of phase-amplitude coupling, we calculated between-channel PAC using wPLFs (Maris et al., 2011; van der Meij et al., 2012) per session, in time windows of 600 ms time-locked to odor onset, separately for odor sampling periods using stimuli predicting an appetitive or an aversive outcome. Figure 1A shows the color-coded magnitude of the wPLF bispectrum for an example session, averaged over all trials and selected between-channel combinations (Fig. 1B shows an example of raw and filtered LFP traces). The top row shows the bispectra for the periods of sampling odor predicting sucrose (S+, “positive odor”), whereas the bottom row depicts the sampling periods related to the odor predicting negative outcome (S−, “negative odor”). Figure 2A shows the time course of wPLF magnitude, averaged across sessions and relative to odor onset, for S+ trials (purple) and S− trials (orange). Both time courses peak ∼500 ms into the odor-sampling period and decay to levels not significantly above preodor sampling baseline periods after odor offset (average odor sampling duration, 1.24 ± 0.07 s). Importantly, odor sample duration did not shorten as sessions progressed, which could otherwise have affected wPLF calculation in the chosen time window. Odor sample durations, pooled across sessions, increased within sessions only for HIT trials (median ± SEM: 1.43 ± 0.06 s; Pearson correlation with trial number: r = 0.89, p < 0.001), but not for CREJ (1.03 ± 0.02 s; r = 0.25, p = 0.08) and FA trials (1.06 ± 0.04 s; r = 0.04, p = 0.80). The mean odor sample duration for HITs was significantly larger than for both FA and CREJ trials (both p < 0.05, t test); the difference between FA and CREJ was not significant. Hereafter, we will restrict analysis to the 600 ms time window centered at 0.5 s after odor onset (ranging from 200 to 800 ms after odor onset).
Theta-gamma PAC differentiates between trial types
We next asked whether differences in wPLF magnitude could be observed between trial types. When we compared S+ versus S− odor sampling periods within sessions, controlling for absolute magnitude differences between sessions, we found significantly higher wPLF differences, averaged across the FOI window, for S+ trials compared with all S− trials (difference in wPLF magnitude, mean ± SE; 0.017 ± 0.010, p < 0.05 Wilcoxon test; Figure 2C). This result could be unpacked in a significant difference between HIT trials (S+ trials with a Go response, HIT) and FA trials (S− trials with Go response; FA, difference in wPLF magnitude = 0.022 ± 0.013, p < 0.05 Wilcoxon test). Figure 2B shows the complete bispectrum of wPLF magnitude for HITS (left) and FA trials (right), averaged across all sessions.
To investigate whether these differences between trial types exist only between those channel pairs with large spectral power (selected channels), or whether they generalize to the whole dataset, we reran this analysis using averages of whole-session wPLF magnitudes (N = 10, as in the main analysis) constructed from all available channel pairs (N = 110 per session). This approach introduced additional noise, as signals with low relative spectral power will render less precise theta phase estimations and smaller fluctuations in gamma amplitude compared with the channels selected on univariate characteristics in the main analysis [though higher spectral power (univariate parameter) does not imply larger wPLF magnitude (bivariate parameter) per se, see below]. In addition, we controlled for a potential influence of shared frequency components (originating, for example, from the reference electrodes) on wPLF magnitude by re-referencing the raw LFP data to the mean of the recorded traces. The results of these two control analyses for whole-session PAC are depicted in Figure 2D (top, all channels; bottom, re-referenced channels). These degraded signals still showed the same overall pattern for comparisons between HIT, CREJ, and FA trials. However, comparisons between trial types failed to reach statistical significance using these datasets.
PAC preferred coupling phase differentiates between trial types
The reported wPLF magnitudes indicate that, within sessions, the preferred coupling phase between phase-providing and amplitude-providing channels shows consistency across trials, and more so for correct versus incorrect trial types. If this coupling is a general temporal patterning phenomenon related to evaluative sensory processing in OFC, one could expect that the preferred coupling phase remains relatively constant across sessions and animals as well (consistency in preferred coupling phase over sessions, which could differ between trial types as well). Moreover, the preferred coupling phase could differ between successful and unsuccessful trials.
To address these issues, we first calculated the circular mean of preferred phases across sessions and tested the distribution of these phases for uniformity over the circle (using Rayleigh's test). Second, to measure the concentration of the phase distribution and compare this concentration between conditions, we fitted the dataset of phase angles with a von Mises distribution, from which we obtained the concentration parameter κ. Finally, to evaluate differences in mean phase between conditions, we calculated pairwise phase differences between conditions at the channel combination level and tested whether the mean phase of the resulting distribution of phase differences was significantly different from zero (Table 1). When analyzed over sessions, the distribution of mean preferred coupling phase was significantly different from uniformity for HIT (Rayleigh's Z = 5.65, p < 0.01) and CREJ trials (Z = 3.93, p < 0.05) but not FA trials (Z = 2.43, n.s.). Furthermore, direct comparisons between Rayleigh's Z values, normalized across trial types within sessions, aggregated over sessions, indicated significant differences in phase concentration between HIT and FA trials (dZ: 1.06 ± 0.44, p < 0.05), CREJ and FA trials (dZ: 0.84 ± 0.36, p < 0.05) but not HIT and CREJ trials (dZ: 0.22 ± 0.36, n.s.). Pairwise differences in z-transformed κ values obtained from von Mises distribution fits confirmed this analysis: both κHIT (0.45 ± 0.23) and κCREJ (0.17 ± 0.23) were larger than κFA (−0.62 ± 0.23, p = 0.009 and p = 0.035, respectively). Again, the difference between κHIT and κCREJ was not significant. When we examined the pairwise phase differences between trial types, averaged across the FOI window and across channel-combinations into session-level mean coupling phases, we found that the pairwise phase difference between HIT and FA coupling phases was significantly different from 0 (Δphase = −31.5°, 95% CI: −57.9° to −5.22°, p < 0.05 one-sample circular test for mean direction). Contrasts between HIT and CREJ (Δphase = −15.0°, CI: −39.2° to 9.26°), and between CREJ and FA trial phases (Δphase = −8.51°, CI: −37.4° to 20.4°) did not reach statistical significance. These results suggest that the consistency of preferred PAC coupling phases across sessions is greater for correct compared with incorrect trials and is associated, in HIT trials, with somewhat earlier coupling phases.
Correlations between PAC and behavioral parameters
Next, we analyzed whether wPLF magnitudes are sensitive to learning within sessions. To compare within-session changes in session-level averages of wPLF magnitude with behavioral performance data, we first created a time-series of wPLF magnitudes by computing wPLFs across a block of four trials, and sliding this block across the session. To control for differences in wPLF magnitude between sessions, the magnitude time-series were z-transformed per session. Next, these wPLF magnitudes were binned according to behavioral performance (see Materials and Methods). The regression of mean binned wPLF magnitude on performance score was significant (R2 = 0.97; Pearson's r = 0.98, p < 0.001; Fig. 3A). As a control analysis, we performed a similar regression of wPLF magnitudes, averaged across sessions per performance bin, on behavioral performance for wPLF magnitudes taken from the dataset with session-averages constructed from channel combinations within a session (PACALL) and for wPLF magnitudes from the re-referenced dataset (PACREREF). Both control analyses showed significant correlations as well (RALL2 = 0.93; r = 0.97, p < 0.001; RREREF2 = 0.87; r = 0.93, p < 0.01). When trial types were examined separately, we found significant Pearson correlations between z-transformed wPLF magnitudes and performance scores for HIT (R2 = 0.64 for a regression of wPLF magnitudes for HIT trials, averaged per performance bin across sessions, onto performance; r = 0.80, p < 0.05), CREJ (R2 = 0.54, r = 0.73, p < 0.05), but not FA trials (R2 = 0.14; r = −0.38, p = 0.40; Fig. 3B). A correlation between z-scored wPLF magnitude, averaged per performance bin per session, pooled across all sessions (yielding 1 value per performance bin per session), and performance was also significant for HIT (r = 0.23; 95% CI: 0.12–0.34; p < 0.001) and CREJ trials (r = 0.13; CI: 0.01–0.26; p < 0.05), but not FA trials (r = −0.09; CI: −0.30 to 0.14, n.s.;Fig. 3C).
To corroborate this correlation-based approach, wPLF differences (ΔwPLF, calculated as session averages of wPLF differences, per trial type) between the group of trials with performance >85% correct (high-performance bin) and the group of trials when it was lower than 70% correct (low-performance bin) were calculated. In line with these correlations, we observed ΔwPLF values significantly different from 0 for HIT (0.012 ± 0.003; p < 0.0001 one-sample t test; Fig. 3D) and CREJ trials (0.013 ± 0.003: p < 0.0001). For FA trials, mean wPLF became smaller for high performance trials compared with low-performance trials (−0.010 ± 0.004; p < 0.05). In the high performance bin, ΔwPLFHIT and ΔwPLFCREJ differed significantly from ΔwPLFFA (both p < 0.001, t test), but not from each other. A similar analysis performed on the PACALL dataset yielded comparable results: ΔwPLFHIT-ALL (0.012 ± 0.002) and ΔwPLFCREJ-ALL (0.006 ± 0.002) both differed from 0 (p < 0.001 and p < 0.01, respectively) and from ΔwPLFFA-ALL (0.000 ± 0.003, n.s.; pairwise comparisons to HITALL and CREJALL both p <0.05), but the difference in ΔwPLF between HITALL and CREJALL was not significant. This suggests that the learning-related changes in wPLF magnitude selectively occur in trials with correct decisions.
To compare the evolution of PAC strength directly to the learning curve, we correlated wPLF magnitudes to the percentage of session completion. We controlled for sessions with unequal trial numbers by linear interpolating wPLF magnitudes and behavioral performance per session to equal-length time series of 50 estimates per trial type (Fig. 4). To illustrate the dynamics of PAC in relation to odor sampling, we extended this analysis over 15 partially overlapping windows, stepped in 100 ms increments relative to odor onset (−0.3 to +1.1 s). Using this interpolated dataset, the pattern of results obtained for changes in wPLF magnitude within sessions was comparable to the analysis of wPLF magnitude binned on performance data, i.e., increments in PAC strength with increasing session completion for HIT and CREJ, but not FA trials. As illustrated in Figure 5, Pearson coefficients with 95% confidence intervals for correlations between the percentage of session completion (50 steps of 2%) and all interpolated wPLF magnitude data points, averaged into session-based means for selected channels, were significantly >0 for HIT (r = 0.19; 95% CI: 0.11–0.28, p < 0.001; Fig. 5) and CREJ (r = 0.16; 95% CI: 0.08–0.25, p < 0.001) but not FA trials (r = 0.04; 95% CI: −0.06 to 0.12, n.s). In addition, the correlation coefficients for HIT and CREJ trials were significantly different from the correlation coefficient for FA trials (HIT vs FA: p < 0.05; CREJ vs FA: p < 0.05; Fisher's z test for comparing correlation coefficients). We repeated this analysis for session-based wPLFs, averaged across all channel pairs, with similar results (HITall: r: 0.22, p < 0.001; CREJall: r: 0.13, p < 0.01; FAall: r: 0.09, n.s.). This analysis suggests that the possibility of a nonselective drift in wPLF magnitudes varying with progressive session completion influencing the previous performance-based analysis is unlikely, as the variation with time in the task was different between trials with correct versus incorrect decisions.
Correlations between wPLF magnitude and reaction times (defined as the time between odor offset and fluid well entry for HIT and FA trials) were not significant (both p >0.1); correlations between wPLF magnitude and odor sampling time were significant only for HIT trials (r = 0.68, p < 0.001). Even though odor sample times increased for HIT trials, the onset of elevated PAC remained locked to odor onset: correlations between the time bin exhibiting maximum PAC strength (peak wPLF magnitude), relative to odor onset, and odor sample durations on a given trial were not significant for any trial type. Rather, the peak wPLF magnitude remained time-locked to odor onset, occurring in the time window centered 500 ms after odor onset instead of following the dynamic in median odor sampling time for HIT trials (Fig. 4B, purple line). The bin with peak wPLF magnitude did not differ significantly between the first and last quintile of session length for any trial type. The coefficient for a Spearman rank correlation between median odor sample duration and peak magnitude bin relative to odor offset on HIT trials was significant however (r = −0.15, p = 0.001), suggesting that, as odor sample durations get longer for HITS, peak magnitude shifts away from the termination of odor sampling and the initiation of the behavioral response. Therefore, we interpret the correlation between odor sampling duration and wPLF magnitude as a spurious byproduct of the pre-existing correlations between session completion and wPLF magnitude on the one hand, and between session completion and odor sampling duration on the other, and not as evidence that wPLF magnitudes increase because there would be a prolonged period of elevated PAC.
In conclusion, using performance-based and session length-based metrics we found that PAC became stronger across learning trials, independent of other behavioral parameters, but only for correct (HITs and CREJs) and not incorrect decisions.
PAC magnitude is not positively correlated with theta power
Because the wPLF magnitude calculation depends, in part, on an accurate estimation of the phase of oscillations in the theta band, fluctuations in theta power could bias PAC measures. However, in contrast to wPLF magnitude, we found that z-scored theta power, averaged over all channels within a session and aggregated over sessions, correlated negatively with behavioral performance (R2 = 0.51; Pearson' r = −0.71, p < 0.05; Fig. 6A). Broken down by trial type, we found significant negative correlations for HIT (r = −0.20; p < 0.001; Fig. 6B) and CREJ (r = −0.16, p < 0.05), but not FA trials (r = 0.00, n.s.). Direct comparisons of normalized theta power between the high- and low-performance bin (Δtheta, a.u.) revealed a significant decrease of normalized theta power for HIT (−0.14 ± 0.06; p < 0.05 one-sample t test vs 0; Fig. 6C) and CREJ (−0.15 ± 0.06; p < 0.05), but not FA trials (−0.05 ± 0.12; n.s.). These Δtheta values did not differ significantly between trial types. This analysis suggests that the increase in wPLF magnitude with performance is not due to better phase estimation in LFP epochs with higher theta power, and that differences in wPLF magnitude between trial types can exist in the absence of significant differences in theta power.
Variation in wPLF magnitude according to electrode placement
We obtained multiple recordings from each rat, spread out over recording days. As tetrodes were advanced along the dorsoventral axis daily, early and late sessions predominantly sample dorsal and ventral OFC, respectively. To compare the evolution of wPLF magnitude over different recording depths, sessions were categorized as dorsal, intermediate, or ventral, based on position in the recording order per rat. wPLF magnitudes per channel pair were mean-normalized across depths within a rat. Subsequently, wPLF magnitudes were averaged across channel combinations within sessions (Fig. 7A). Pairwise comparisons between mean-normalized wPLF magnitudes across depth levels revealed significant differences between dorsal (mean ± SEM, −0.030 ± 0.009) and intermediate (0.007 ± 0.008, p < 0.05, t test) levels, as well as between dorsal and ventral (0.023 ± 0.007, p < 0.01; intermediate vs ventral, n.s.) depth levels. These findings suggest that high PAC can be found throughout the intermediate and ventral layers of the OFC and do not support alternative explanations on the origin of OFC PAC that would place the source of the rhythmic activity underlying PAC outside the OFC.
In 2 of 3 rats (N = 7 of 10 sessions), recordings were made with a circular tetrode arrangement (Fig. 7B,C), which allowed us to examine the coarse distribution of PAC strength based on anatomical location in the horizontal plane. To investigate whether wPLF magnitudes were asymmetrically distributed, we sorted session-averaged wPLF magnitudes from selected channel pairs once according to phase-providing channel (Fig. 7B) and once according to amplitude-providing channel (Fig. 7C). To combine data from different sessions, wPLF magnitudes were normalized within a session, averaged into session-level means, and subsequently pooled. We split the channel data in four anatomical quadrants along the anterior–posterior and mediolateral axes and tested the pooled data of all other quadrants against the mean wPLF magnitude per quadrant. In the dataset sorted according to phase-providing channel, only the posteromedial quadrant exhibited significantly different wPLF magnitudes compared with the mean of the other quadrants (mean ± SEM: Zpm = 0.32; Zother − 0.06 ± 0.15; one-sample t test, p < 0.05; Fig. 7D). For the amplitude-providing channel data, both the posterolateral (mean ± SEM: Zpm = 0.95; Zother −0.30 ± 0.12; p < 0.001) and the posteromedial (mean ± SEM: Zpm = −0.52; Zother 0.33 ± 0.20; p < 0.001) quadrant differed from the other quadrants (Fig. 7E). Note that these quadrants exhibiting the largest wPLF deviations do not necessarily correspond to the quadrants with the largest N (visualized per electrode location as circle diameter) of selected channels, i.e., channels with either high univariate power in the theta (phase-providing electrode; Fig. 7B) or gamma band (amplitude-providing electrode; Fig. 7C).
Discussion
Here we demonstrate PAC between theta (4–12 Hz) and gamma (30–100 Hz) oscillatory activity, two frequency bands previously identified in local field potential recordings from rat OFC in the context of the current behavioral task (van Wingerden et al., 2010a,b, 2012; Pennartz et al., 2011b). In our olfactory discrimination task, where rats learn the association between a stimulus predicting positive outcome and a stimulus predicting negative outcome, PAC appeared time-locked to odor sampling (Fig. 1).
When sessions were split into samples of HIT, FA, and CREJ, session averaged PAC was found to be higher on HIT compared with FA trials (Fig. 2).
The concentration of preferred coupling phases, pooled across sessions, also differed between trial types, with preferred coupling phases in HIT and CREJ trials showing more consistency than FA trials. Pairwise comparisons on the preferred coupling phase showed a phase lag for FA error trials compared with HIT trials. This suggests that evaluative processing of two distinct odor stimuli, leading to correct Go and NoGo decisions respectively, is associated with a relatively fixed coupling phase range and, conversely, that inconsistency in preferred coupling phase is associated with errors.
Within sessions, PAC strength correlated positively with behavioral performance, but only for correct trials (HITS and CREJ trials) and resulted in significant tandem increments in PAC for these trials (Fig. 3). These results suggest that higher PAC is associated with good decisions, regardless of the associated behavioral response (Go vs NoGo), and does not depend on nonspecific drift with time across the learning session (Figs. 4, 5). Importantly, these within-session changes did not correlate positively with theta power; rather, in periods of high performance, when wPLF magnitudes differed between trial types, theta power did not (Fig. 6). Finally, we presented initial evidence that PAC strength within the OFC complex is heterogeneous (Fig. 7), with strong PAC occurring between posteromedial phase-providing channels (VO/LO) and posterolateral amplitude-providing channels (LO/AI). With respect to the dorsal-ventral axis, wPLF magnitudes were found to rise quickly with increasing depth to remain at similar levels once the OFC core (intermediate-ventral) was reached. These results are consistent with our previous reports of a source-sink couple contributing to theta-band oscillations located in central OFC (van Wingerden et al., 2010a), and phase heterogeneity of these theta-band oscillations across the OFC recording sites (Vinck et al., 2011). Combined with the results from the locally re-referenced dataset, this renders an explanation of observed PAC in terms oscillations exogenous to OFC (e.g., from piriform cortex) unlikely. In contrast to other reports from human recordings (Axmacher et al., 2010; Voytek et al., 2010), we did not find systematic variation in the exact frequency of the phase-providing channels associated with peak wPLF magnitudes.
Our report of rodent PAC during evaluative stimulus sampling contrasts with a body of studies demonstrating PAC in human EEG/MEG and intracortical subdural electro-corticogram recordings that focused primarily on passive rest, reward processing and motor/verbal or working memory tasks (Canolty et al., 2006; Osipova et al., 2008; Cohen et al., 2009a b; Maris et al., 2011; van der Meij et al., 2012). Recent reports have shown PAC in rodents in relation to context retrieval in the hippocampus (Tort et al., 2009) and working memory processes in the medial prefrontal cortex (Fujisawa and Buzsáki, 2011). Tort et al. (2008, 2009) described PAC within the striatum and hippocampus, also between striato-hippocampal channel pairs. In their data, within-area CA3 theta-gamma PAC was modulated by learning, and persisted with overtraining. Whereas PAC in our data was strongest in the absence of whole-body movement, Tort et al., using an item-in-context learning paradigm, found that PAC was maximal during active spatial exploration in an open-field maze, which they related to context-based retrieval of a task rule. This hippocampal PAC, however, differs from the OFC PAC described here in that it declined sharply upon stimulus sampling (odorant sniffing). Fujisawa and Buzsáki (2011) showed that the phase of a 4 Hz slow oscillation recorded from mPFC modulated the amplitude of gamma-band frequencies during the working memory period of a T-maze task. Our data differ from this study in showing theta-gamma PAC in the absence of locomotion.
Moreover, our measure of PAC strength, wPLF magnitude (van der Meij et al., 2012), not only increases with task acquisition (cf. Tort et al., 2009), but, importantly, also discriminates between correct and incorrect decisions. Incorrect Go trials (FA) showed significantly lower wPLF magnitudes and a significantly lagging coupling phase across sessions compared with correct Go trials (HIT). In addition, these error trials lacked the within-session increase observed in correct trials, both in relation to performance and session completion. Conversely, no differences between PAC for good decisions (HIT vs CREJ trials) were detected either in whole-session PAC (Fig. 2) or in within-session PAC changes (Fig. 3D). A possible explanation for the lack of a significant difference between CREJ and FA trials in whole-session wPLF magnitude, in contrast to the observed difference between HIT and FA trials, could lie in the wPLF magnitude offset (i.e., at session onset; as apparent in Fig. 5) between HIT and CREJ trial types, as the correlation coefficients for wPLF magnitude increases for these two trial types did not differ significantly. Moreover, the relative increase of wPLF magnitude with learning also did not differ significantly between the correct trial types (Fig. 3D). Finally, the interpolated data (Fig. 4) suggested an earlier onset of the rise in wPLF magnitude for HITS compared with the other trial types. As the acquisition phase was terminated immediately after the rats reached a behavioral criterion, our data do not speak in favor or against the possibility that PAC saturation effects could have occurred with prolonged supracriterion performance or overtraining (cf. Tort et al., 2009).
We hypothesize that orbitofrontal theta-gamma PAC in this task is instrumental in supporting stimulus evaluation, i.e., the attribution of outcome expectancies to sensory input, and subsequent decision-making. In support of this hypothesis, we previously showed that particular subsets of OFC single units, conveying odor or expected outcome information, phase-lock preferentially to theta and gamma band frequencies, respectively, during odor sampling (van Wingerden et al., 2010b). The modulation of gamma power by theta phase in this behavioral period may be instrumental in forging links through Hebbian, spike-timing-dependent plasticity (Bi and Poo, 1998; Cassenaer and Laurent, 2007; Canolty and Knight, 2010) between units that are part of assemblies representing stimulus-outcome and action-outcome associations within areas belonging to the orbital and medial prefrontal networks identified anatomically (Ongür and Price, 2000, 2003; Hoover and Vertes, 2011) and functionally (Mar et al., 2011).
Furthermore, theta-band modulated synchrony of firing patterns conveying reward-related information within OFC may be essential for efficient modification of stimulus-outcome and action-outcome associations in distal target areas (Schoenbaum et al., 2009, 2011; Takahashi et al., 2009; Pennartz et al., 2011b; Wallis and Kennerley, 2011) through cross-structural coherence in low frequencies such as theta-band (Schroeder and Lakatos, 2009; Benchenane et al., 2010; Womelsdorf et al., 2010; Fujisawa and Buzsáki, 2011). The superimposed OFC gamma rhythm, probably reflecting local processing (van Wingerden et al., 2010b; Buzsáki and Wang, 2012) may promote precise spike timing within assemblies, possibly enhancing cross-structural spike-timing-dependent associative processes and retrieval of stimulus-outcome mappings. We previously reported that gamma-band synchronization did not appear to discriminate between trial types, under control conditions (van Wingerden et al., 2010b) or under local antagonism of the NMDA-receptor (van Wingerden et al., 2012). It is therefore striking that in the current dataset, clear PAC differences emerge between correct and incorrect trial types, suggesting it is the precise phase-timing, and not the spectral power of the gamma synchronization, that is important for correct performance. As wPLF magnitudes were correlated with correct behavioral performance, these findings are consistent with the hypothesis that PAC may be associated with a high quality of retrieval of expected outcome information associated with stimuli, and may contribute to synaptic plasticity in local circuits (Canolty and Knight, 2010), associated with changes in neural discrimination of Go/NoGo stimuli in OFC (van Wingerden et al., 2012). They are also consistent with the idea, previously coined in relation to OFC gamma rhythm (van Wingerden et al., 2010b), that high theta-gamma PAC may function to suppress (inhibit) premature or incorrect behavioral (Go) responses in Hit and CREJ trials, respectively. Consistent with this, PAC was low in FA trials, quite probably along with an insufficient inhibition of the incorrect Go response. These two accounts both explain why PAC remains high in correct trial types when most learning has been completed. In this light, it would be interesting to examine PAC dynamics under a regime of plasticity blockade. A critical test for the hypothesis that PAC is causally important for the functions in outcome-related learning attributed to the OFC, however, would be the selective disruption of PAC or its components, for example using optogenetic control of spike timing or gamma synchronization (cf. Cardin et al., 2009) in the context of a learning task known to depend on OFC integrity.
Footnotes
- Correspondence should be addressed to Cyriel Pennartz, Center for Neuroscience, Faculty of Science, University of Amsterdam, PO Box 94216, 1090 GE Amsterdam, The Netherlands. C.M.A.Pennartz{at}uva.nl