WWW.JNEUROSCI.ORG
-
The Journal of Neuroscience MBF Stereo Investigator
 QUICK SEARCH:   [advanced]


     
-


HOME
  |  
SEARCH  |   ARCHIVE  |   SUBSCRIBE  |   CONTACT  |   HELP

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow A correction has been published
Right arrow Submit an eLetter
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (90)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Brown, E. N.
Right arrow Articles by Wilson, M. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Brown, E. N.
Right arrow Articles by Wilson, M. A.

 Previous Article  |  Next Article 

The Journal of Neuroscience, September 15, 1998, 18(18):7411-7425

A Statistical Paradigm for Neural Spike Train Decoding Applied to Position Prediction from Ensemble Firing Patterns of Rat Hippocampal Place Cells

Emery N. Brown1, Loren M. Frank2, Dengda Tang1, Michael C. Quirk2, and Matthew A. Wilson2

1 Statistics Research Laboratory, Department of Anesthesia and Critical Care, Harvard Medical School, Massachusetts General Hospital, Boston, Massachusetts 02114-2698, and 2 Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139

    ABSTRACT
Top
Abstract
Introduction
Materials & Methods
Results
Discussion
References

The problem of predicting the position of a freely foraging rat based on the ensemble firing patterns of place cells recorded from the CA1 region of its hippocampus is used to develop a two-stage statistical paradigm for neural spike train decoding. In the first, or encoding stage, place cell spiking activity is modeled as an inhomogeneous Poisson process whose instantaneous rate is a function of the animal's position in space and phase of its theta rhythm. The animal's path is modeled as a Gaussian random walk. In the second, or decoding stage, a Bayesian statistical paradigm is used to derive a nonlinear recursive causal filter algorithm for predicting the position of the animal from the place cell ensemble firing patterns. The algebra of the decoding algorithm defines an explicit map of the discrete spike trains into the position prediction. The confidence regions for the position predictions quantify spike train information in terms of the most probable locations of the animal given the ensemble firing pattern. Under our inhomogeneous Poisson model position was a three to five times stronger modulator of the place cell spiking activity than theta phase in an open circular environment. For animal 1 (2) the median decoding error based on 34 (33) place cells recorded during 10 min of foraging was 8.0 (7.7) cm. Our statistical paradigm provides a reliable approach for quantifying the spatial information in the ensemble place cell firing patterns and defines a generally applicable framework for studying information encoding in neural systems.

Key words: hippocampal place cells; Bayesian statistics; information encoding; decoding algorithm; nonlinear recursive filter; random walk; inhomogeneous Poisson process; point process.

    INTRODUCTION
Top
Abstract
Introduction
Materials & Methods
Results
Discussion
References

Neural systems encode their representations of biological signals in the firing patterns of neuron populations. Mathematical algorithms designed to decode these firing patterns offer one approach to deciphering how neural systems represent and transmit information. To illustrate, the spiking activity of CA1 place cells in the rat hippocampus correlates with both the rat's position in its environment and the phase of the theta rhythm as the animal performs spatial behavioral tasks (O'Keefe and Dostrovsky, 1971; O'Keefe and Reece, 1993; Skaggs et al., 1996). Wilson and McNaughton (1993) used occupancy-normalized histograms to represent place cell firing propensity as a function of a rat's position in its environment and a maximum correlation algorithm to decode the animal's position from the firing patterns of the place cell ensemble. Related work on population-averaging and tuning curve methods has been reported by Georgopoulos et al. (1986), Seung and Sompolinsky (1993), Abbott (1994), Salinas and Abbott (1994), and Snippe (1996).

Spike train decoding has also been studied in a two-stage approach using Bayesian statistical methods (Bialek and Zee, 1990; Bialek et al., 1991; Warland et al., 1992; Sanger 1996; Rieke et al., 1997; Zhang et al., 1998). The first, or encoding stage, characterizes the probability of neural spiking activity given the biological signal, whereas the second, or decoding stage, uses Bayes' rule to determine the most probable value of the signal given the spiking activity. The Bayesian approach is a general analytic framework that, unlike either the maximum correlation or population-averaging methods, has an associated paradigm for statistical inference (Mendel, 1995). To date four practices common to the application of the Bayesian paradigm in statistical signal processing have yet to be fully applied in decoding analyses. These are (1) using a parametric statistical model to represent the dependence of the spiking activity on the biological signal and to test specific biological hypotheses; (2) deriving formulae that define the explicit map of the discrete spike trains into the continuous signal predictions; (3) specifying confidence regions for the signal predictions derived from ensemble spike train activity; and (4) implementing the decoding algorithm recursively. Application of these practices should yield better quantitative descriptions of how neuron populations encode information.

For example, the estimated parameters from a statistical model would provide succinct, interpretable representations of salient spike train properties. As a consequence, statistical hypothesis tests can be used to quantify the relative biological importance of model components and to identify through goodness-of-fit analyses spike train properties the model failed to describe. A formula describing the mapping of spike trains into the signal would demonstrate exactly how the decoding algorithm interprets and converts spike train information into signal predictions. Confidence statements provide a statistical measure of spike train information in terms of the uncertainty in the algorithm's prediction of the signal. Under a recursive formulation, decoding would be conducted in a causal manner consistent with the sequential way neural systems update; the current signal prediction is computed from the previous signal prediction plus the new information in the spike train about the change in the signal since the previous prediction.

We use the problem of position prediction from the ensemble firing patterns of hippocampal CA1 place cells recorded from freely foraging rats to develop a comprehensive, two-stage statistical paradigm for neural spike train decoding that applies the four signal processing practices stated above. In the encoding stage we model place cell spiking activity as an inhomogeneous Poisson process whose instantaneous firing rate is a function of the animal's position in the environment and phase of the theta rhythm. We model the animal's path during foraging as a Gaussian random walk. In the decoding stage we use Bayesian statistical theory to derive a nonlinear, recursive causal filter algorithm for predicting the animal's position from place cell ensemble firing patterns. We apply the paradigm to place cell, theta phase, and position data from two rats freely foraging in an open environment.

    MATERIALS AND METHODS
Top
Abstract
Introduction
Materials & Methods
Results
Discussion
References

Experimental methods

Two approximately 8-month-old Long-Evans rats (Charles River Laboratories, Wilmington, MA) were implanted with microdrive arrays housing 12 tetrodes (four wire electrodes) (Wilson and McNaughton, 1993) using surgical procedures in accordance with National Institutes of Health and Massachusetts Institute of Technology guidelines. Anesthesia was induced with ketamine 50 mg/kg, xylazine 6 mg/kg, and ethanol 0.35 cc/kg in 0.6 cc/kg normal saline and maintained with 1-2% isoflurane delivered by mask. The skin was incised, the skull was exposed, and six screw holes were drilled. The skull screws were inserted to provide an anchor for the microdrive assembly. An additional hole was drilled over the right CA1 region of the hippocampus (coordinates, -3.5 anteroposterior, 2.75 lateral). The dura was removed, the drive was positioned immediately above the brain surface, the remaining space in the hole was filled with bone wax, and dental acrylic was applied to secure the microdrive assembly holding the tetrodes to the skull. Approximately 2 hr after recovery from anesthesia and surgery, the tetrodes were advanced into the brain. Each tetrode had a total diameter of ~45 µm, and the spacing between tetrodes was 250-300 µm. The tips of the tetrodes were cut to a blunt end and plated with gold to a final impedance of 200-300 KOmega .

Over 7 d, the electrodes were slowly advanced to the pyramidal cell layer of the hippocampal CA1 region. During this period the animals were food-deprived to 85% of their free-feeding weight and trained to forage for randomly scattered chocolate pellets in a black cylindrical environment 70 cm in diameter with 30-cm-high walls (Muller et al., 1987). Two cue cards, each with different black-and-white patterns, were placed on opposite sides of the apparatus to give the animals stable visual cues. Training involved exposing the animal to the apparatus and allowing it to become comfortable and explore freely. After a few days, the animals began to forage for chocolate and soon moved continuously through the environment.

Once the electrodes were within the cell layer, recordings of the animal's position, spike activity, and EEG were made during a 25 min foraging period for animal 1 and a 23 min period for animal 2. Position data were recorded by a tracking system that sampled the position of a pair of infrared diode arrays on each animal's head. The arrays were mounted on a boom attached to the animal's head stage so that from the camera's point of view, the front diode array was slightly in front of the animal's nose and the rear array was above the animal's neck. Position data were sampled at 60 Hz with each diode array powered on alternate camera frames; i.e., each diode was on for 30 frames/sec, and only one diode was illuminated per frame. The camera sampled a 256 × 364 pixel grid, which corresponded to a rectangular view of 153.6 × 218.4 cm. The animal's position was computed as the mean location of the two diode arrays in two adjacent camera frames. To remove obvious motion artifact, the raw position data were smoothed off-line with a span 30 point (1 sec) running average filter. Missing position samples that occurred when one of the diode arrays was blocked were filled in by linear interpolation from neighboring data in the off-line analysis.

Signals from each electrode were bandpass-filtered between 600 Hz and 6 kHz. Spike waveforms were amplified 10,000 times and sampled at 31.25 kHz/channel and saved to disk. A recording session consisted of the foraging period bracketed by 30-40 min during which baseline spike activity was recorded while the animal rested quietly. At the completion of the recording session, the data were transferred to a workstation where information about peak amplitudes and widths of the spike waveforms on each of the four channels of the tetrode was used to cluster the data into individual units, and assign each spike to a single cell. For animal 1 (2), 33 (34) place cells were recorded during its 25 (23) min foraging period and used in the place field encoding and decoding analysis.

Continuous EEG data were taken from the same electrodes used for unit recording. One wire from each tetrode was selected for EEG recordings, and the signal was filtered between 1 Hz and 3 kHz, sampled at 2 kHz/channel and saved to disk. The single EEG channel showing the most robust theta rhythm was identified and resampled at 250 Hz, and the theta rhythm was extracted by applying a Fourier filter with a pass band of 6-14 Hz. The phase of the theta rhythm was determined by identifying successive peaks in the theta rhythm and assuming that successive peaks represented a complete theta cycle from 0 and 2pi . Each point between the peaks was assigned a phase between 0 and 2pi proportional to the fraction of the distance the point lay between the two peaks (Skaggs et al., 1996). The theta rhythm does not have the same phase at different sites of the hippocampus; however, the phase difference between sites is constant. Hence, it is sufficient to model theta phase modulation of place cell spiking activity with the EEG signal recorded from a single site (Skaggs et al., 1996).

Statistical methods

The hippocampus encodes information about the position of the animal in its environment in the firing patterns of its place cells. We develop a statistical model to estimate the encoding process and a statistical algorithm to decode the position of the animal in its environment using our model estimate of the encoding process. We divide the experiment in two parts and conduct the statistical paradigm in two stages: the encoding and decoding stages. We define the encoding stage as the first 15 and 13 min of spike train, path, and theta rhythm data for animals 1 and 2, respectively, and estimate the parameters of the inhomogeneous Poisson process model for each place cell and the random walk model for each animal. We define the decoding stage as the last 10 min of the experiment for each animal and use the ensemble spike train firing patterns of the place cells and random walk parameters determined in the encoding stage to predict position.

To begin we define our notation. Let (0, T] denote the foraging interval for a given animal and assume that within this interval the spike times of C place cells are simultaneously recorded. For animals 1 and 2, T = 25 and 23 min respectively. Let tic denote the spike recorded from cell c at time ti in (0, T], where c = 1, ... , C, and C is the total number of place cells. Let x(t) = [x1(t), x2(t)]' be the 2 × 1 vector denoting the animal's position at time t, and let phi (t) be the phase of the theta rhythm at time t. The notation x(t)' denotes the transpose of the vector x(t).

Encoding stage: the place cell model. Our statistical model for the place field is defined by representing the spatial and theta phase dependence of the place cell firing propensity as an inhomogeneous Poisson process. An inhomogeneous Poisson process is a Poisson process in which the rate parameter is not constant (homogeneous) but varies as a function of time and/or some other physical quantity such as space (Cressie, 1993). Here, the rate parameter of the inhomogeneous Poisson process is modeled as a function of the animal's position in the environment and phase of the theta rhythm. The position component for cell c is modeled as a Gaussian function defined as:
&lgr;<SUP>c</SUP><SUB>x</SUB>(t‖x(t), &xgr;<SUP>c</SUP><SUB>x</SUB>)=<UP>exp</UP><FENCE>&agr;<SUB>c</SUB>−<FR><NU>1</NU><DE>2</DE></FR>(x(t)−&mgr;<SUB>c</SUB>)′W<SUP><UP>−</UP>1</SUP><SUB>c</SUB>(x(t)−&mgr;<SUB>c</SUB>)</FENCE>, (1)
where µc = [µc,1, µc,2]' is the 2 × 1 vector whose components are the x1 and x2 coordinates of the place field center, alpha c is the location intensity parameter,
W<SUB>c</SUB>=<FENCE><AR><R><C>&sfgr;<SUP>2</SUP><SUB>c,1</SUB></C><C>0</C></R><R><C>0</C><C>&sfgr;<SUP>2</SUP><SUB>c2</SUB></C></R></AR></FENCE> (2)
is a scale matrix whose scale parameters in the x1 and x2 directions are sigma c,12 and sigma c,22, respectively, and xi xc = [alpha c, µc, Wc]. Our original formulation of the place cell model included non-zero off-diagonal terms of the scale matrix to allow varying spatial orientations of the estimated place fields (Brown et al., 1996, 1997a). Because we found these parameters to be statistically indistinguishable from zero in our previous analyses, we omit them from the current model. The theta phase component of cell c is modeled as a cosine function defined as:
&lgr;<SUP>c</SUP><SUB>&thgr;</SUB>(t‖&phgr;(t), &xgr;<SUP>c</SUP><SUB>&thgr;</SUB>)=<UP>exp</UP>{&bgr;<SUB>c</SUB><UP>cos</UP>(&phgr;(t)−&phgr;<SUB>c</SUB>)}, (3)
where beta c is a modulation factor, phi c is the theta phase of maximum instantaneous firing rate for cell c, and xi theta c = [beta c, phi c]. The instantaneous firing rate function for cell c is the product of the position component in Equation 1 and the theta rhythm component in Equation 3 and is given as:
&lgr;<SUP>c</SUP>(t‖x(t), &phgr;(t), &xgr;<SUP>c</SUP><SUB>x</SUB>)=&lgr;<SUP>c</SUP><SUB>x</SUB>(t‖x(t), &xgr;<SUP>c</SUP><SUB>x</SUB>)&lgr;<SUP>c</SUP><SUB>&thgr;</SUB>(t‖&phgr;(t), &xgr;<SUP>c</SUP><SUB>&thgr;</SUB>), (4)
where xi c = [xi xc, xi theta c]. The maximum instantaneous firing rate of place cell c is exp{alpha c + beta c} and occurs at x(t) = µc and phi (t) = phi c. The instantaneous firing rate model in Equation 4 does not consider the modulation of place cell firing propensity attributable to the interaction between position and theta phase known as phase precession (O'Keefe and Reece, 1993). We assume that individual place cells form an ensemble of conditionally independent Poisson processes. That is, the place cells are independent given their model parameters. In principle, it is possible to give a more detailed formulation of ensemble place cell spiking activity that includes possible interdependencies among cells (Ogata, 1981). Such a formulation is not considered here. The inhomogeneous Poisson model defined in Equations 1-4 was fit to the spike train data of each place cell by maximum likelihood (Cressie, 1993). The importance of the theta phase model component was assessed using likelihood ratio tests (Cassella and Berger, 1990) and Akaike's Information Criterion (AIC) (Box et al., 1994).

After model fitting we evaluated validity of the Poisson assumption in two ways using the fact that a Poisson process defined on an interval is also a Poisson process on any subinterval of the original interval (Cressie, 1993). First, based on the estimated Poisson model parameters, we computed for each place cell the 95% confidence interval for the true number of spikes in the entire experiment, in the encoding stage and in the decoding stage. In each case, we assessed agreement with the Poisson model by determining whether the recorded number of spikes was within the 95% confidence interval estimated from the model.

Second, for each place cell we identified between 10 to 65 subpaths on which the animal traversed the field of that cell for at least 0.5 sec. The region of the place field we sampled was the ellipse located at the place cell center, which contained 67% of the volume of the fitted Gaussian function in Equation 1. This is equivalent to the area within 1 SD of the mean of a one-dimensional Gaussian probability density. The entrance and exit times for the fields were determined using the actual path of the animal. From the estimate of the exact Poisson probability distribution on each subpath we computed the p value to measure how likely the observed number of spikes was under the null hypothesis of a Poisson model. A small p value would suggest that the data are not probable under the Poisson model, whereas a large p value would suggest that the data are probable and, hence, consistent with the model. If the firing pattern along the subpaths truly arose from a Poisson process, then the histogram of p values should be approximately uniform. A separate analysis was performed for subpaths in the encoding and decoding stages of each animal.

Encoding stage: the path model. We assume that the path of the animal during the experiment may be approximated as a zero mean two-dimensional Gaussian random walk. The random walk assumption means that given any two positions on the path, say x(tk-1) and x(tk), the path increments, x(tk- x(tk-1), form a sequence of independent, zero mean Gaussian random variables with covariance matrix:
W<SUB>x</SUB>(&Dgr;<SUB>k</SUB>)=<FENCE><AR><R><C>&sfgr;<SUP>2</SUP><SUB>x<SUB>1</SUB></SUB></C><C>&rgr;&sfgr;<SUB>x<SUB>1</SUB></SUB>&sfgr;<SUB>x<SUB>2</SUB></SUB></C></R><R><C>&rgr;&sfgr;<SUB>x<SUB>1</SUB></SUB>&sfgr;<SUB>x<SUB>2</SUB></SUB></C><C>&sfgr;<SUP>2</SUP><SUB>x<SUB>2</SUB></SUB></C></R></AR></FENCE>&Dgr;<SUB>k</SUB>, (5)
where sigma x12, sigma x22 are the variances of x1 and x2 components of the increments, respectively, rho  is the correlation coefficient, and Delta k = tk - tk-1. These model parameters were also estimated by maximum likelihood. Following model fitting, we evaluated the validity of the Gaussian random walk assumption by a chi 2 goodness-of-fit test and by a partial autocorrelation analysis. In the goodness-of-fit analysis, the Gaussian assumption was tested by comparing the joint distribution of the observed path increments with the bivariate Gaussian density defined by the estimated model parameters. The partial autocorrelation function is an accepted method for detecting autoregressive dependence in time series data (Box et al., 1994). Like the autocorrelation function, the partial autocorrelation function measures correlations between time points in a time series. However, unlike the autocorrelation function, the partial autocorrelation function at lag k measures the correlation between points k time units apart, correcting for correlations at lags k - 1 and lower. An autoregressive model of order p will have a nonzero partial autocorrelation function up through lag p and a partial autocorrelation function of zero at lags p + 1 and higher. Therefore, a Gaussian random walk with independent increments should have uncorrelated increments at all lags and, hence, its partial autocorrelation function should be statistically indistinguishable from zero at all lags (Box et al., 1994).

Decoding stage. To develop our decoding algorithm we first explain some additional notation. Define a sequence of times in (teT], te <=  t0 < t1 < t2, ... , tk < tk+1, ... , < tK <=  T, where te is the end of the encoding stage. The tk values are an arbitrary time sequence in the decoding stage, which includes the spike times of all the place cells. We define Ic(tk) as the indicator of a spike at time tk for cell c. That is, Ic(tk) is 1 if there is a spike at tk from cell c and 0 otherwise. Let I(tk) = [I1(tk), ... , IC(tk)]' be the vector of indicator variables for the C place cells for time tk. The objective of the decoding stage is to find for each tk the best prediction of x(tk) in terms of a probability density, given C place cells, their place field and theta rhythm parameters, and the firing pattern of the place cell ensemble from te up through tk. Because the tk values are arbitrary, the prediction of x(tk) will be defined in continuous time. An approach suggested by signal processing theory for computing the probability density of x(tk) given the spikes in (tetk] is to perform the calculations sequentially. Under this approach Bayes' rule is used to compute recursively the probability density of the current position from the probability densities of the previous position and that of the new spike train data measured since the previous position prediction was made (Mendel, 1995). The recursion relation is defined in terms of two coupled probability densities termed the posterior and one-step prediction probability densities. For our decoding problem these two probability densities are defined as:

Posterior probability density:
<UP>Pr</UP>(x(t<SUB>k</SUB>)‖<IT>spikes in </IT><FENCE>t<SUB>e</SUB>, t<SUB>k</SUB></FENCE>)= (6)
<FR><NU><UP>Pr</UP>(x(t<SUB>k</SUB>)‖<IT>spikes in </IT><FENCE>t<SUB>e</SUB>, t<SUB>k<UP>−</UP>1</SUB></FENCE>)×<UP>Pr</UP>(<IT>spikes at </IT>t<SUB>k</SUB>‖x(t<SUB>k</SUB>), t<SUB>k<UP>−</UP>1</SUB>)</NU><DE><UP>Pr</UP>(<IT>spikes at</IT> t<SUB>k</SUB>‖<IT>spikes in</IT> <FENCE>t<SUB>e</SUB>, t<SUB>k<UP>−</UP>1</SUB></FENCE>)</DE></FR>;
One-step prediction probability density:
<UP>Pr</UP>(x(t<SUB>k</SUB>)‖<IT>spikes in</IT> <FENCE>t<SUB>e</SUB>, t<SUB>k<UP>−</UP>1</SUB></FENCE>) (7)
=<LIM><OP>∫</OP></LIM><UP>Pr</UP>(x(t<SUB>k<UP>−</UP>1</SUB>)‖<IT>spikes in</IT> <FENCE>t<SUB>e</SUB>, t<SUB>k<UP>−</UP>1</SUB></FENCE>)×<UP>Pr</UP>(x(t<SUB>k</SUB>)‖x(t<SUB>k<UP>−</UP>1</SUB>))dx(t<SUB>k<UP>−</UP>1</SUB>).

Before deriving the explicit form of our decoding algorithm, we explain the terms in Equations 6 and 7 and the logic behind them. The first term on the right side of Equation 6, Pr(x(tk)|spikes in (te, tk-1]), is the one-step prediction probability density from Equation 7. It defines the predictions of where the animal is likely to be at time tk given the spike train data up through time tk-1. Equation 7 shows that the one-step prediction probability density is computed by "averaging over" the animal's most likely locations at time tk-1, given the data up to time tk-1 and the most likely set of moves it will make in tk-1 to tk. The animal's most likely position at time tk-1, the first term of the integrand in Equation 7, is the posterior probability density at tk-1. The animal's most likely set of moves from tk-1 to tk, Pr(x(tk)|x(tk-1)), is defined by the random walk probability model in Equation 5 and again below in Equation 8. The formulae are recursive because Equation 7 uses the posterior probability density at time tk-1 to generate the one-step prediction probability density at tk, which, in turn, allows computation of the new posterior probability at time tk given in Equation 6. The second term on the right side of Equation 6, Pr(spikes at tk|x(tk), tk-1), defines the probability of a spike at tk given the animal's position at tk is x(tk) and that the last observation was at time tk-1. This term is the joint probability mass function of all the spikes at tk and is defined by the inhomogeneous Poisson model in Equations 1-4 and below in Equation 9. Pr(spikes at tk|spikes in (te, tk-1]) is the integral of the numerator on the right side of Equation 6 and defines the normalizing constant, which ensures that the posterior probability density integrates to 1.

Under the assumption that the individual place cells are conditionally independent Poisson processes and that the path of the rat during foraging in an open environment is a Gaussian random walk, Equations 6 and 7 yield the following recursive neural spike train decoding algorithm:

State equation:
x(t<SUB>k</SUB>)−x(t<SUB>k<UP>−</UP>1</SUB>)∼N(0, W<SUB>x</SUB>(&Dgr;<SUB>k</SUB>)); (8)

Observation equation:
f(<UP><B>I</B></UP>(t<SUB>k</SUB>)‖x(t<SUB>k</SUB>), t<SUB>k<UP>−</UP>1</SUB>) (9)
=<LIM><OP>∏</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM> {[&lgr;<SUP>c</SUP>[x(t<SUB>k</SUB>)]&Lgr;<SUP>c</SUP>[&thgr;(&Dgr;<SUB>k</SUB>)]}<SUP>I<SUB>c</SUB>(t<SUB>k</SUB>)</SUP><UP>exp</UP>{<UP>−</UP>&lgr;<SUP>c</SUP>[x(t<SUB>k</SUB>)]&Lgr;<SUP>c</SUP>[&thgr;(&Dgr;<SUB>k</SUB>)]};

One-step prediction equation:
<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k<UP>−</UP>1</SUB>)=<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k<UP>−</UP>1</SUB>‖t<SUB>k<UP>−</UP>1</SUB>); (10)

One-step prediction variance:
W(t<SUB>k</SUB>‖t<SUB>k<UP>−</UP>1</SUB>)=W<SUB>x</SUB>(&Dgr;<SUB>k</SUB>)+W(t<SUB>k<UP>−</UP>1</SUB>‖t<SUB>k<UP>−</UP>1</SUB>); (11)

Posterior mode
<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k</SUB>)=<FENCE>W(t<SUB>k</SUB>‖t<SUB>k<UP>−</UP>1</SUB>)<SUP><UP>−</UP>1</SUP>+<LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM> A<SUB>c</SUB>[<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k</SUB>), &thgr;(&Dgr;<SUB>k</SUB>)]W<SUP><UP>−</UP>1</SUP><SUB>c</SUB></FENCE><SUP><UP>−</UP>1</SUP> (12)
×<FENCE>W(t<SUB>k</SUB>‖t<SUB>k<UP>−</UP>1</SUB>)<SUP><UP>−</UP>1</SUP><A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k<UP>−</UP>1</SUB>)+<LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM> A<SUB>c</SUB>[<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k</SUB>), &thgr;(&Dgr;<SUB>k</SUB>)]W<SUP><UP>−</UP>1</SUP><SUB>c</SUB> &mgr;<SUB>c</SUB></FENCE>;

Posterior variance:
W(t<SUB>k</SUB>‖t<SUB>k</SUB>)=<FENCE>W(t<SUB>k</SUB>‖t<SUB>k<UP>−</UP>1</SUB>)<SUP><UP>−</UP>1</SUP>+<LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM> A<SUB>c</SUB>[<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k</SUB>), &thgr;(&Dgr;<SUB>k</SUB>)]W<SUP><UP>−</UP>1</SUP><SUB>c</SUB></FENCE> (13)
<UP>+</UP><LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM> &lgr;<SUP>c</SUP>[<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k</SUB>)]&Lgr;<SUP>c</SUP>[&thgr;(&Dgr;<SUB>k</SUB>)]W<SUP><UP>−</UP>1</SUP><SUB>c</SUB>(<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k</SUB>)−&mgr;<SUB>c</SUB>)(<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k</SUB>)−&mgr;<SUB>c</SUB>)′W<SUP><UP>−</UP>1</SUP><SUB>c</SUB>]<SUP><UP>−</UP>1</SUP>,
where the notation ~N(0, Wx(Delta k)) denotes the Gaussian probability density with mean 0 and covariance matrix Wx(Delta k), f(I(tk)|x(tk), tk-1) is the joint probability mass function of the spikes at time tk and x(tk|tk) denotes the position prediction at time tk given the spike train up through time tk. We also define:
A<SUB>c</SUB>[x(t<SUB>k</SUB>‖t<SUB>k</SUB>), &thgr;(&Dgr;<SUB>k</SUB>)]=I<SUB>c</SUB>(t<SUB>k</SUB>)−&lgr;<SUP>c</SUP>[x(t<SUB>k</SUB>‖t<SUB>k</SUB>)]&Lgr;<SUP>c</SUP>[&thgr;(&Dgr;<SUB>k</SUB>)]; (14)
&Lgr;<SUP>c</SUP>[&thgr;(&Dgr;<SUB>k</SUB>)]=<LIM><OP>∫</OP><LL>t<SUB>k<UP>−</UP>1</SUB></LL><UL>t<SUB>k</SUB></UL></LIM> <UP>exp</UP>{&bgr;<SUB>c</SUB><UP>cos</UP>(&phgr;(t)−&phgr;<SUB>c</SUB>)}dt, (15)
where Lambda c[theta (Delta k)] is the integral of the theta rhythm process (Eq. 3) on the interval (tk-1, tk], and lambda c[x(tk|tk)] = lambda xc[tk|x(tk), xi xc] is given in Equation 1. The prediction x(tk|tk) in Equation 12 is the mode of the posterior probability density, and therefore, defines the most probable position prediction at tk given the ensemble firing pattern of the C place cells from te up through tk. We term x(tk|tk), the Bayes' filter prediction and the algorithm in Equations 8-13 the Bayes' filter algorithm. As stated above, the algorithm defines a recursion that begins with Equation 10. Under the random walk model, given a prediction x(tk-1|tk-1) at tk-1, the best prediction of position at tk, i.e., one step ahead, is the prediction at tk-1. The error in that prediction, given in Equation 11, reflects both the uncertainty in the prediction at tk-1, defined by W(tk-1|tk-1), and uncertainty of the random walk in (tk-1, tk], defined by Wx(Delta k). Once the spikes at tk are recorded, the position prediction at tk is updated to incorporate this new information (Eq. 12). The uncertainty in this posterior prediction is given by Equation 13. The algorithm then returns to Equation 10 to begin the computations for tk+1. The derivation of the Bayes' filter algorithm follows the arguments used in the maximum aposteriori estimate derivation of the Kalman filter (Mendel, 1995) and is outlined in . If the posterior probability density of x(tk) is approximately symmetric, then x(tk|tk) is also both its mean and median. In this case, the Bayes' filter is an approximately optimal filter in both a mean square and an absolute error sense. Equation 12 is a nonlinear function of x(tk|tk) that is solved iteratively using a Newton's procedure. The previous position prediction at each step serves as the starting value. Using Equation 13 and a Gaussian approximation to the posterior probability density of x(tk) (Tanner, 1993), an approximate 95% confidence (highest posterior probability density) region for x(tk) can be defined by the ellipse:
[x(t<SUB>k</SUB>)−<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k</SUB>)]′W(t<SUB>k</SUB>‖t<SUB>k</SUB>)[x(t<SUB>k</SUB>)−<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k</SUB>)]≤6, (16)
where 6 is the 0.95th quantile of the chi 2 distribution with 2 df.

Interpretation of the Bayes' filter algorithm. The Bayes' filter algorithm has a useful analytic interpretation. Equation 12 shows explicitly how the discrete spike times, Ic(tk) values, are mapped into a continuous position prediction x(tk|tk). This equation shows that the current position prediction, x(tk|tk), is a weighted average of the one-step position prediction, x(tk|tk-1), and the place cell centers. The weight on the one-step prediction is the inverse of the one-step prediction covariance matrix (Eq. 11). If the one-step prediction error is high, the one-step prediction receives less weight, whereas if the one-step prediction error is small, the one-step prediction receives more weight. The weight on the one-step prediction also decreases as Delta k increases (Eq. 11).

The weight on each place cell's center is determined by the product of a dynamic or data-dependent component attributable to Ac in Equation 14 and a fixed component attributable to the inverse of the scale matrices, the Wc values, in Equation 2. For small Delta k, it follows from the definition of the instantaneous rate function of a Poisson process that Ac may be reexpressed as:
A<SUB>c</SUB>[x(t<SUB>k</SUB>), &thgr;(&Dgr;<SUB>k</SUB>)]=I<SUB>c</SUB>(t<SUB>k</SUB>)−<UP>Pr</UP><SUP>c</SUP>(<IT>spike at</IT> t<SUB>k</SUB>‖x(t<SUB>k</SUB>), &thgr;(&Dgr;<SUB>k</SUB>)). (17)
Eq. 17 shows that Ac is equal to either 0 or 1 minus the probability of a spike from cell c at tk given the position at tk and the modulation of the theta rhythm in Delta k. Thus, for small Delta k, Ac gives a weight in the interval (-1, 1). A large positive weight is obtained if a spike is observed when a place cell has a low probability of a spike at tk given its geometry and the current phase of the theta rhythm. This is a rare event. A large negative weight is obtained if no spike is observed when a cell has a high probability of firing. This is also a rare event. Equation 12 shows that even when no cell fires the algorithm still provides information about the animal's most probable position. For example, if no place cell fires at tk, then all the place cell means receive negative weights, and the algorithm interprets the new information in the firing pattern as suggesting where the animal is not likely to be. The inverse of the scale matrices are the fixed components of the weights on the place cell means and reflect the geometry of the place fields. Place cells whose scale matrices have small scale factors---highly precise fields---will be weighted more in the new position prediction. Conversely, place cells with large scale factors---diffuse place fields---will be weighted less. Viewed as a function of c and tk, Ac defines for cell c at time tk the point process equivalent of the innovations in the standard Kalman filter algorithm (Mendel, 1995).

At each step the Bayes' filter algorithm provides two estimates of position and for each an associated estimate of uncertainty. The one-step position prediction and error estimates are computed before observing the spikes at tk, whereas the posterior position prediction and error estimates are computed after observing the spikes at tk. Because the tk values are arbitrary, the Bayes' filter provides predictions of the animal's position in continuous time. The recursive formulation of this algorithm ensures that all spikes in (tetk] are used to compute the prediction x(tk|tk). The Newton's method of implementation of the algorithm shows the expected quadratic convergence in two to four steps when the previous position is the initial guess for predicting the new position. Because the previous position prediction is a good initial guess, and the distance between the initial guess and the final new position prediction is small, a fast, linear version of Equation 12 can be derived by taking only the first Newton's step of the procedure. This is equivalent to replacing x(tk|tk) on the right side of Equation 12 with x(tk|tk-1).

The representation of our decoding algorithm in Equations 8-13 shows the relation of our methods to the well known Kalman filter (Mendel, 1995). Although the equations appear similar to those of the standard Kalman filter, there are important differences. Both the observation and the state equations in the standard Kalman filter are continuous linear functions of the state variable. In the current problem, the state equation is a continuous function of the state variable, the animal's position. However, the observation process, the neural spike trains, is a multivariate point process and a nonlinear function of the state variable. Our algorithm provides a solution to the problem of estimating a continuous state variable when the observation process is a point process.

Bayes' smoother algorithm. The acausal decoding algorithms of Bialek and colleagues (1991) are derived in the frequency domain using Wiener kernel methods. These acausal algorithms give an estimate of x(tk|T) rather than x(tk|tk) because they use all spikes observed during the decoding stage of the experiment to estimate the signal at each tk. To compare our algorithm directly with the acausal Wiener kernel methods, we computed the corresponding estimate of x(tk|T) in our paradigm. The estimates of x(tk|T) and W(tk|T) can be computed directly from x(tk|tk), x(tk|tk-1), W(tk|tk), and W(tk|tk-1) by the following linear algorithm:
A<SUB>k</SUB>=W(t<SUB>k</SUB>‖t<SUB>k</SUB>)W(t<SUB>k<UP>+</UP>1</SUB>‖t<SUB>k</SUB>)<SUP><UP>−</UP>1</SUP>; (18)
<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖T)=<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>‖t<SUB>k</SUB>)+A<SUB>k</SUB>[<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k<UP>+</UP>1</SUB>‖T)−<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k<UP>+</UP>1</SUB>‖t<SUB>k</SUB>)]; (19)
W(t<SUB>k</SUB>‖T)=W(t<SUB>k</SUB>‖t<SUB>k</SUB>)+A<SUB>k</SUB>[W(t<SUB>k<UP>+</UP>1</SUB>‖T)−W(t<SUB>k<UP>+</UP>1</SUB>‖t<SUB>k</SUB>)]A′<SUB>k</SUB>, (20)
where the initial conditions are x(T|T) and W(T|T) obtained from the last step of the Bayes' filter. Equations 18-20 are the well known fixed- interval smoothing algorithm (Mendel, 1995). To distinguish x(tk|T) from x(tk|tk), we term the former Bayes' smoother prediction.

Non-Bayes decoding algorithms. Linear and maximum likelihood (ML) decoding algorithms can be derived as special cases of Equation 12. These are:
<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>)<SUB>L</SUB>=<FENCE><LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM> n<SUB>c</SUB>(&Dgr;<SUP>*</SUP><SUB>k</SUB>)W<SUP><UP>−</UP>1</SUP><SUB>c</SUB></FENCE><SUP><UP>−</UP>1</SUP> <LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM> n<SUB>c</SUB>(&Dgr;<SUP>*</SUP><SUB>k</SUB>)W<SUP><UP>−</UP>1</SUP><SUB>c</SUB> &mgr;<SUB>c</SUB>, (21)
and
<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>)<SUB><UP>ML</UP></SUB>=<FENCE><LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM> A<SUP>*</SUP><SUB>c</SUB>[<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>)<SUB><UP>ML</UP></SUB>]W<SUP><UP>−</UP>1</SUP><SUB>c</SUB></FENCE><SUP><UP>−</UP>1</SUP> <LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM> A<SUP>*</SUP><SUB>c</SUB>[<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>)<SUB><UP>ML</UP></SUB>]W<SUP><UP>−</UP>1</SUP><SUB>c</SUB> &mgr;<SUB>c</SUB>, (22)
where Delta *k is the 1 sec interval ending at tk, nc(Delta *k) is the number of spikes from cell c in Delta *k, and A*c is Ac in Equation 14 with Ic(tk) replaced by nc(Delta *k). The term A*c has approximately the same interpretation as Ac in Equation 14. The derivation of these algorithms is also explained in .

For comparison with the findings of Wilson and McNaughton (1993), we also decoded using their maximum correlation (MC) method. This algorithm is defined as follows. Let lambda ijc denote the value of the occupancy- normalized histogram of spikes from cell c on pixel ij. The MC prediction at tk is the pixel that has the largest correlation with the observed firing pattern of the place cells in Delta *k. It is defined as:
<A><AC>x</AC><AC>ˆ</AC></A>(t<SUB>k</SUB>)<SUB><UP>MC</UP></SUB>=<LIM><OP><UP>max</UP></OP><LL>i, j</LL></LIM><FENCE><FR><NU><LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM>(n<SUB>c</SUB>(&Dgr;<SUP>*</SUP><SUB>k</SUB>)−<A><AC>n(&Dgr;<SUP>*</SUP><SUB>k</SUB>)</AC><AC>&cjs1171;</AC></A>)(&lgr;<SUP>c</SUP><SUB>ij</SUB>−<A><AC>&lgr;<SUB>ij</SUB></AC><AC>&cjs1171;</AC></A>)</NU><DE><FENCE><LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM>(n<SUB>c</SUB>(&Dgr;<SUP>*</SUP><SUB>k</SUB>)−<A><AC>n(&Dgr;<SUP>*</SUP><SUB>k</SUB>)</AC><AC>&cjs1171;</AC></A>)<SUP>2</SUP></FENCE><SUP>1/2</SUP><FENCE><LIM><OP>∑</OP><LL>c<UP>=</UP>1</LL><UL>C</UL></LIM>(&lgr;<SUP>c</SUP><SUB>ij</SUB>−<A><AC>&lgr;<SUB>ij</SUB></AC><AC>&cjs1171;</AC></A>)<SUP>2</SUP></FENCE><SUP>1/2</SUP></DE></FR></FENCE>, (23)
where lambda ij is the average firing rate over the C cells at pixel location ij, and <OVL><IT>n</IT>(<IT>&Dgr;<SUP>*</SUP></IT><SUB><IT>k</IT></SUB>)</OVL> is the average of the spike counts over the C place cells in Delta *k.

Implementation of the decoding algorithms. Position decoding was performed using the Bayes' filter, ML, linear, MC, and Bayes' smoother algorithms. Decoding with the Bayes' filter was performed with and without the theta rhythm component of the model. With the exception of the MC algorithm, position predictions were determined in all decoding analyses at 33 msec intervals, the frame rate of the tracking camera. For the MC algorithm the decoding was performed in 1 sec nonoverlapping intervals. The ML prediction at tk was computed from the spikes in Delta *k, the 1 sec time window ending at tk. To carry out the ML decoding at the frame rate of the camera and to give a fair comparison with the Bayes' procedures, this time window was shifted along the spike trains every 33 msec for each ML prediction. Hence, there was a 967 msec overlap in the time window used for adjacent ML predictions. The same 1 sec time window and 33 msec time shift were used to compute the linear decoding predictions. We tested time windows of 0.25, 0.5, and 1 sec and chose the latter because the low spike counts for the place cells gave very unstable position predictions for the shorter time intervals even when the intervals were allowed to overlap. For integration time windows longer than 1 sec, the assumption that the animal remained in the same position for the entire time window was less valid. Zhang et al. (1998) found a 1 sec time window to be optimal for their Bayes' procedures.

Relationship among the decoding algorithms. The Bayes' filter and the non-Bayes' algorithms represent distinct approaches to studying neural computation. Under the Bayes' filter, an estimate of a behavioral state variable, e.g., position at a given time, is computed from the ensemble firing pattern of the CA1 place cells and stored along with an error estimate. The next estimate is computed using the previous estimate, and the information in the firing patterns about how the state variable has changed since the last estimate was computed. For the non-Bayes' algorithms the computational logic is different. The position estimate is computed from the place cell firing patterns during a short time window. The time window is then shifted 33 msec and the position representation is recomputed. The Bayes' filter relies both on prior and new information, whereas the non-Bayes' algorithms use only current information. Because the Bayes' filter sequentially updates the position representation, it may provide a more biologically plausible description of how position information is processed in the rat's brain. On the other hand, the non-Bayes' algorithms provide a tool for studying the spatial information content of the ensemble firing patterns in short overlapping and nonoverlapping time intervals.

The Bayes' filter is a nonlinear recursive algorithm that gives the most probable position estimate at tk given the spike trains from all the place cells and theta rhythm information up to through tk. The ML algorithm yields the most probable position given only the data in a time window ending at tk. Because this ML algorithm uses a 1 sec time window, it is not the ML algorithm that would be derived from the Bayes' filter by assuming an uninformative prior probability density. The latter ML algorithm would have a time window of 33 msec. Given the low firing rates of the place cells, an ML algorithm with a 33 msec integration window would yield position predictions that were significantly more erratic than those obtained with a 1 sec window (see Fig. 4). Theta phase information is also not likely to improve prediction accuracy of the ML algorithm, because the 1 sec integration window averages approximately eight theta cycles. In contrast, the Bayes' filter has the potential to improve the accuracy of its prediction by taking explicit account of the theta phase information. For the Bayes' filter with Delta k = 33 msec and an average theta cycle length of 125 msec, each tk falls on average in one of four different phases of the theta rhythm.

Equation 16 shows that the local linear decoding algorithm uses no information about previous position or the probability of a place cell firing to determine the position prediction. It simply weights the place cell centers by the product of the number of spikes in the time interval Delta *k and the inverse of the scale matrices. If no cell fires, there is no position prediction. Because the algorithm uses no information about the place cell firing propensities, it is expected to perform less well than either the Bayes or the ML algorithms. The MC algorithm estimates the place cell geometries empirically with occupancy-normalized histograms instead of with a parametric statistical model. The position estimate determined by this algorithm is a nonlinear function of the observed firing pattern, and the weighting scheme is determined on a pixel-by-pixel basis by the correlation between the observed firing pattern and the estimated place cell intensities. The MC algorithm is the most computationally intensive of the algorithms studied here because it requires a search at each time step over all pixels in the environment.

The Bayes' smoother derives directly from the Bayes' filter by applying the well known fixed-interval smoothing algorithm. Of the five algorithms presented, it uses the most information from the firing pattern to estimate the animal's position. However, because it uses all future and all past place cell spikes to compute each position estimate, it is the least likely to have a biological interpretation. The Bayes' smoother is helpful more as an analytic tool than as an actual decoding algorithm because it shows algebraically how current and future information are combined to make its position predictions. This algorithm makes explicit the relation between the Bayes' filter and a two-sided filter such as the acausal Wiener filter procedures of Bialek et al. (1991).

    RESULTS
Top
Abstract
Introduction
Materials & Methods
Results
Discussion
References

Encoding stage: evaluation of the Poisson model fit to the place cell firing patterns

The inhomogeneous Poisson model was successfully fit to the place cell spike trains of both animals. Twenty-six of 34 place cells for animal 1 and 24 of 33 cells for animal 2 had place fields located along the border of the environment (Fig. 1). Seven of 34 cells for animal 1 and five of the place cells for animal 2 fired preferentially in regions near the center of the environment. The remaining single cell for animal 1 and four cells for animal 2 had split fields. The split fields could be explained by these place cells having two distinct regions of maximal firing and/or errors in assigning spikes from the tetrode recordings to particular cells. For each of the three types of place field patterns the fits of the position modulation components of the Poisson model were consistent with the occupancy-normalized histograms in terms of shape and location of regions of maximum and minimum firing propensity. For animal 1 (animal 2) 31 (32) of the 34 (33) place cells had statistically significant estimates of the place parameters µc,1, µc,2, sigma c,12, and sigma c,22, and 27 (33) of the 34 (33) place cells had statistically significant estimates of alpha .


View larger version (70K):
[in this window]
[in a new window]
 
Figure 1.   Pseudocolor maps of the fits of the inhomogeneous Poisson model to the place fields of three representative place cells from animal 2. The panels show A, a field lying along the border of the environment; B, a field near the center of the environment; and C, a split field with two distinct regions of maximal firing. Most of the place cells for both animals were like that of cell A (see Encoding stage: evaluation of the Poisson model fit to the place cell firing patterns). The color bars along the right border of each panel show the color map legend in spikes per second. The spike rate near the center of cell A is 25 spikes/sec compared with 12 spikes/sec for cells B and C. Each place field has a nonzero spike rate across a sizable fraction of the circular environment.

In addition to position modulation, there was statistically significant theta phase modulation of the place cell firing patterns for both animals. For animal 1 (2) the theta phases of maximal firing were mostly between 180 and 360° (160 and 360°) with a maximum near 360° (270°). We evaluated the statistical improvement in the fit of the inhomogeneous Poisson model attributable to inclusion of the theta rhythm by likelihood ratio tests and AIC. The likelihood ratio tests showed that 25 of the 34 cells for animal 1 and 31 of the 33 cells for animal 2 had statistically significant improvements in the model fits with the inclusion of the theta component. For 28 of the 34 cells for animal 1 and 31 of the 33 place cells for animal 2, the better model by AIC included the phase of theta rhythm. The individual parameter estimates allow a direct quantitative assessment of the relative importance of position and theta phase on place cell firing. For example, in Fig. 1, place cell B had an estimated maximal firing rate of exp(alpha  + beta ) = exp(1.91 + 0.56) = 11.82 spikes/sec at coordinates x1(t) = 40.37, x2(t) = 43.63, and theta phase phi (t) = 0.61 radians. In the absence of theta phase modulation the approximate maximum firing rate would be exp(alpha ) = exp(1.91) = 6.75 spikes/sec, whereas in the absence of position modulation the maximum firing rate would be exp(beta ) = exp(0.56) = 1.75 spikes/second. With the exception of five place cells for animal 1 and one cell for animal 2, all the alpha  values were positive and in the range of 0.45-4.5. The beta values were all in a narrow range between 0.06 and 0.5 for animal 1 and 0.03 and 1.1 for animal 2. For 25 of 34 place cells for animal 1 and 32 of 33 place cells for animal 2, alpha  was larger than beta . The single place cell for animal 2 and the five of seven place cells for animal 1 for which beta  was larger than alpha  all fired <= 200 spikes during the encoding stage. The median (mean) ratio of exp(alpha ) to exp(beta ) was 2.9 (5.0) for animal 1 and 5.3 (7.5) for animal 2. Because the median is a more representative measure of central tendency in small groups of numbers (Velleman and Hoaglin, 1981), these findings suggest that position is a three to five times stronger modulator of place cell spiking activity than theta phase under the current model.

Encoding stage: fit of the random walk model to the path

For both animals there was close agreement between the variance components of the Gaussian random walk estimated from the first part of the path (encoding stage) and those estimated from the full path (Eq. 5). The estimated variance components were sigma x1 = 0.283 (0.440), sigma x2 = 0.302 (0.393), and rho  = 0.024 (0.033) from the encoding stage for animal 1 (animal 2). The estimated means were all close to zero, and the small values of the correlation coefficient rho  suggested that the x1 and x2 components of the random walk are approximately uncorrelated.

Encoding stage: assessment of model assumptions

We present here the results of our goodness-of-fit analyses of the inhomogeneous Poisson model fits to the place cell spike train data and the random walk model fits to the animals' paths. The implications of these results for our decoding analysis and overall modeling strategy are presented in the Discussion (see Encoding stage: lessons from the random walk and goodness-of-fit analyses).

Evaluation of the inhomogeneous Poisson model goodness-of-fit

In this analysis the one place cell for animal 1 and the four place cells for animal 2 with split fields were treated as separate units. The separate units for the place field were determined by inspecting the place field plot, drawing a line separating the two parts, and then assigning the spikes above the line to unit 1 and the ones below the line to unit 2. Hence, for animal 1, there are 35 = 34 + 1 place cell units, and for animal 2, there are 37 = 33 + 4 place cell units. We first assessed the Poisson model goodness-of-fit for the individual place cell units. We considered the number of recorded spikes to agree with the prediction from the Poisson model if the number recorded was within the 95% confidence interval estimated from the model. In the encoding stage, for 30 of 35 place cell units for animal 1 and for 37 of 37 units for animal 2, the number of recorded spikes agreed with the model prediction. This finding was expected because the model parameters were estimated from the encoding stage data. In the decoding stage, for only 8 of 35 place cell units for animal 1 and for 6 of 37 units for animal 2 did the number of recorded spikes agree with the model predictions. Over the full experiment, for only 7 of 35 place cells for animal 1 and for 9 of 37 place cells for animal 2 did the recorded and predicted numbers of spikes agree.

As stated in Statistical methods, for the goodness-of-fit analysis of the Poisson model on the subpaths, we computed the p value for the observed number of spikes on each subpath under the null hypothesis that the true model was Poisson and that the true model parameters were those determined in the encoding stage (Fig. 2). If the firing patterns along all the subpaths truly arose from a Poisson process then, the histogram of p values should be approximately uniform. In the encoding stage, 33% of the 893 subpaths for animal 1 and 46% of the 885 subpaths for animal 2 had p <=  0.05 (see the first bins of the histograms in Fig. 2, Encoding Stage). Similarly, in the decoding stage, 37% of the 595 subpaths for animal 1 and 43% of the 475 subpaths for animal 2 had p <=  0.05 (see the first bins of the histograms in Fig. 2, Decoding Stage). As expected, both animals in both stages had several subpaths with p >=  0.95 because the expected number of spikes on those trajectories was two or less. The large number of subpaths with small p values suggests that the place cell firing patterns of both animals were more variable than would be predicted by the Poisson model.


View larger version (18K):
[in this window]
[in a new window]
 
Figure 2.   Histograms of p values for the goodness-of-fit analyses of the inhomogeneous Poisson model on the subpaths for the encoding (left column) and decoding (right column) stages for animals 1 and 2. Each p