 |
Previous Article | Next Article 
The Journal of Neuroscience, September 15, 1998, 18(18):7411-7425
A Statistical Paradigm for Neural Spike Train Decoding Applied to
Position Prediction from Ensemble Firing Patterns of Rat Hippocampal
Place Cells
Emery N.
Brown1,
Loren
M.
Frank2,
Dengda
Tang1,
Michael C.
Quirk2, and
Matthew A.
Wilson2
1 Statistics Research Laboratory, Department of
Anesthesia and Critical Care, Harvard Medical School, Massachusetts
General Hospital, Boston, Massachusetts 02114-2698, and
2 Department of Brain and Cognitive Sciences, Massachusetts
Institute of Technology, Cambridge, Massachusetts 02139
 |
ABSTRACT |
The problem of predicting the position of a freely foraging rat
based on the ensemble firing patterns of place cells recorded from the
CA1 region of its hippocampus is used to develop a two-stage statistical paradigm for neural spike train decoding. In the first, or
encoding stage, place cell spiking activity is modeled as an inhomogeneous Poisson process whose instantaneous rate is a function of
the animal's position in space and phase of its theta rhythm. The
animal's path is modeled as a Gaussian random walk. In the second, or
decoding stage, a Bayesian statistical paradigm is used to derive a
nonlinear recursive causal filter algorithm for predicting the position
of the animal from the place cell ensemble firing patterns. The algebra
of the decoding algorithm defines an explicit map of the discrete spike
trains into the position prediction. The confidence regions for the
position predictions quantify spike train information in terms of the
most probable locations of the animal given the ensemble firing
pattern. Under our inhomogeneous Poisson model position was a three to
five times stronger modulator of the place cell spiking activity than
theta phase in an open circular environment. For animal 1 (2) the
median decoding error based on 34 (33) place cells recorded during 10 min of foraging was 8.0 (7.7) cm. Our statistical paradigm provides a
reliable approach for quantifying the spatial information in the
ensemble place cell firing patterns and defines a generally applicable
framework for studying information encoding in neural systems.
Key words:
hippocampal place cells; Bayesian statistics; information
encoding; decoding algorithm; nonlinear recursive filter; random walk; inhomogeneous Poisson process; point process.
 |
INTRODUCTION |
Neural systems encode their
representations of biological signals in the firing patterns of neuron
populations. Mathematical algorithms designed to decode these firing
patterns offer one approach to deciphering how neural systems represent
and transmit information. To illustrate, the spiking activity of CA1
place cells in the rat hippocampus correlates with both the rat's
position in its environment and the phase of the theta rhythm as the
animal performs spatial behavioral tasks (O'Keefe and Dostrovsky,
1971 ; O'Keefe and Reece, 1993 ; Skaggs et al., 1996 ). Wilson and
McNaughton (1993) used occupancy-normalized histograms to represent
place cell firing propensity as a function of a rat's position in its environment and a maximum correlation algorithm to decode the animal's
position from the firing patterns of the place cell ensemble. Related
work on population-averaging and tuning curve methods has been reported
by Georgopoulos et al. (1986) , Seung and Sompolinsky (1993) , Abbott
(1994) , Salinas and Abbott (1994) , and Snippe (1996) .
Spike train decoding has also been studied in a two-stage approach
using Bayesian statistical methods (Bialek and Zee, 1990 ; Bialek et
al., 1991 ; Warland et al., 1992 ; Sanger 1996 ; Rieke et al., 1997 ; Zhang
et al., 1998 ). The first, or encoding stage, characterizes the
probability of neural spiking activity given the biological signal,
whereas the second, or decoding stage, uses Bayes' rule to determine
the most probable value of the signal given the spiking activity. The
Bayesian approach is a general analytic framework that, unlike either
the maximum correlation or population-averaging methods, has an
associated paradigm for statistical inference (Mendel, 1995 ). To date
four practices common to the application of the Bayesian paradigm in
statistical signal processing have yet to be fully applied in decoding
analyses. These are (1) using a parametric statistical model to
represent the dependence of the spiking activity on the biological
signal and to test specific biological hypotheses; (2) deriving
formulae that define the explicit map of the discrete spike trains into the continuous signal predictions; (3) specifying confidence regions for the signal predictions derived from ensemble spike train activity; and (4) implementing the decoding algorithm recursively. Application of
these practices should yield better quantitative descriptions of how
neuron populations encode information.
For example, the estimated parameters from a statistical model would
provide succinct, interpretable representations of salient spike train
properties. As a consequence, statistical hypothesis tests can be used
to quantify the relative biological importance of model components and
to identify through goodness-of-fit analyses spike train properties the
model failed to describe. A formula describing the mapping of spike
trains into the signal would demonstrate exactly how the decoding
algorithm interprets and converts spike train information into signal
predictions. Confidence statements provide a statistical measure of
spike train information in terms of the uncertainty in the algorithm's
prediction of the signal. Under a recursive formulation, decoding would
be conducted in a causal manner consistent with the sequential way
neural systems update; the current signal prediction is computed from
the previous signal prediction plus the new information in the spike
train about the change in the signal since the previous prediction.
We use the problem of position prediction from the ensemble firing
patterns of hippocampal CA1 place cells recorded from freely foraging
rats to develop a comprehensive, two-stage statistical paradigm for
neural spike train decoding that applies the four signal processing
practices stated above. In the encoding stage we model place cell
spiking activity as an inhomogeneous Poisson process whose
instantaneous firing rate is a function of the animal's position in
the environment and phase of the theta rhythm. We model the animal's
path during foraging as a Gaussian random walk. In the decoding stage
we use Bayesian statistical theory to derive a nonlinear, recursive
causal filter algorithm for predicting the animal's position from
place cell ensemble firing patterns. We apply the paradigm to place
cell, theta phase, and position data from two rats freely foraging in
an open environment.
 |
MATERIALS AND METHODS |
Experimental methods
Two approximately 8-month-old Long-Evans rats (Charles River
Laboratories, Wilmington, MA) were implanted with microdrive arrays
housing 12 tetrodes (four wire electrodes) (Wilson and McNaughton,
1993 ) using surgical procedures in accordance with National Institutes
of Health and Massachusetts Institute of Technology guidelines.
Anesthesia was induced with ketamine 50 mg/kg, xylazine 6 mg/kg, and
ethanol 0.35 cc/kg in 0.6 cc/kg normal saline and maintained with
1-2% isoflurane delivered by mask. The skin was incised, the skull
was exposed, and six screw holes were drilled. The skull screws were
inserted to provide an anchor for the microdrive assembly. An
additional hole was drilled over the right CA1 region of the
hippocampus (coordinates, 3.5 anteroposterior, 2.75 lateral). The
dura was removed, the drive was positioned immediately above the brain
surface, the remaining space in the hole was filled with bone wax, and
dental acrylic was applied to secure the microdrive assembly holding
the tetrodes to the skull. Approximately 2 hr after recovery from
anesthesia and surgery, the tetrodes were advanced into the brain. Each
tetrode had a total diameter of ~45 µm, and the spacing between
tetrodes was 250-300 µm. The tips of the tetrodes were cut to a
blunt end and plated with gold to a final impedance of 200-300
K .
Over 7 d, the electrodes were slowly advanced to the pyramidal
cell layer of the hippocampal CA1 region. During this period the
animals were food-deprived to 85% of their free-feeding weight and
trained to forage for randomly scattered chocolate pellets in a black
cylindrical environment 70 cm in diameter with 30-cm-high walls (Muller
et al., 1987 ). Two cue cards, each with different black-and-white
patterns, were placed on opposite sides of the apparatus to give the
animals stable visual cues. Training involved exposing the animal to
the apparatus and allowing it to become comfortable and explore freely.
After a few days, the animals began to forage for chocolate and soon
moved continuously through the environment.
Once the electrodes were within the cell layer, recordings of the
animal's position, spike activity, and EEG were made during a 25 min
foraging period for animal 1 and a 23 min period for animal 2. Position
data were recorded by a tracking system that sampled the position of a
pair of infrared diode arrays on each animal's head. The arrays were
mounted on a boom attached to the animal's head stage so that from the
camera's point of view, the front diode array was slightly in front of
the animal's nose and the rear array was above the animal's neck.
Position data were sampled at 60 Hz with each diode array powered on
alternate camera frames; i.e., each diode was on for 30 frames/sec, and
only one diode was illuminated per frame. The camera sampled a 256 × 364 pixel grid, which corresponded to a rectangular view of
153.6 × 218.4 cm. The animal's position was computed as the mean
location of the two diode arrays in two adjacent camera frames. To
remove obvious motion artifact, the raw position data were smoothed
off-line with a span 30 point (1 sec) running average filter. Missing
position samples that occurred when one of the diode arrays was blocked were filled in by linear interpolation from neighboring data in the
off-line analysis.
Signals from each electrode were bandpass-filtered between 600 Hz and 6 kHz. Spike waveforms were amplified 10,000 times and sampled at 31.25 kHz/channel and saved to disk. A recording session consisted of the
foraging period bracketed by 30-40 min during which baseline spike
activity was recorded while the animal rested quietly. At the
completion of the recording session, the data were transferred to a
workstation where information about peak amplitudes and widths of the
spike waveforms on each of the four channels of the tetrode was used to
cluster the data into individual units, and assign each spike to a
single cell. For animal 1 (2), 33 (34) place cells were recorded during
its 25 (23) min foraging period and used in the place field encoding
and decoding analysis.
Continuous EEG data were taken from the same electrodes used for unit
recording. One wire from each tetrode was selected for EEG recordings,
and the signal was filtered between 1 Hz and 3 kHz, sampled at 2 kHz/channel and saved to disk. The single EEG channel showing the most
robust theta rhythm was identified and resampled at 250 Hz, and the
theta rhythm was extracted by applying a Fourier filter with a pass
band of 6-14 Hz. The phase of the theta rhythm was determined by
identifying successive peaks in the theta rhythm and assuming that
successive peaks represented a complete theta cycle from 0 and 2 .
Each point between the peaks was assigned a phase between 0 and 2
proportional to the fraction of the distance the point lay between the
two peaks (Skaggs et al., 1996 ). The theta rhythm does not have the
same phase at different sites of the hippocampus; however, the phase
difference between sites is constant. Hence, it is sufficient to model
theta phase modulation of place cell spiking activity with the EEG
signal recorded from a single site (Skaggs et al., 1996 ).
Statistical methods
The hippocampus encodes information about the position of the
animal in its environment in the firing patterns of its place cells. We
develop a statistical model to estimate the encoding process and a
statistical algorithm to decode the position of the animal in its
environment using our model estimate of the encoding process. We divide
the experiment in two parts and conduct the statistical paradigm in two
stages: the encoding and decoding stages. We define the encoding stage
as the first 15 and 13 min of spike train, path, and theta rhythm data
for animals 1 and 2, respectively, and estimate the parameters of the
inhomogeneous Poisson process model for each place cell and the random
walk model for each animal. We define the decoding stage as the last 10 min of the experiment for each animal and use the ensemble spike train
firing patterns of the place cells and random walk parameters
determined in the encoding stage to predict position.
To begin we define our notation. Let (0, T] denote the
foraging interval for a given animal and assume that within this
interval the spike times of C place cells are simultaneously
recorded. For animals 1 and 2, T = 25 and 23 min
respectively. Let tic denote the spike
recorded from cell c at time ti in
(0, T], where c = 1, ... , C, and
C is the total number of place cells. Let x(t) = [x1(t), x2(t)]' be the 2 × 1 vector denoting the animal's position at time t, and let
(t) be the phase of the theta rhythm at time t. The notation x(t)' denotes the transpose of
the vector x(t).
Encoding stage: the place cell model. Our statistical model
for the place field is defined by representing the spatial and theta
phase dependence of the place cell firing propensity as an
inhomogeneous Poisson process. An inhomogeneous Poisson process is a
Poisson process in which the rate parameter is not constant (homogeneous) but varies as a function of time and/or some other physical quantity such as space (Cressie, 1993 ). Here, the rate parameter of the inhomogeneous Poisson process is modeled as a function
of the animal's position in the environment and phase of the theta
rhythm. The position component for cell c is modeled as a
Gaussian function defined as:
|
(1)
|
where µc = [µc,1,
µc,2]' is the 2 × 1 vector whose
components are the x1 and
x2 coordinates of the place field center,
c is the location intensity parameter,
|
(2)
|
is a scale matrix whose scale parameters in the
x1 and x2 directions are
c,12 and
c,22, respectively, and
xc = [ c,
µc, Wc]. Our original
formulation of the place cell model included non-zero off-diagonal
terms of the scale matrix to allow varying spatial orientations of the
estimated place fields (Brown et al., 1996 , 1997a ). Because we found
these parameters to be statistically indistinguishable from zero in our
previous analyses, we omit them from the current model. The theta phase
component of cell c is modeled as a cosine function defined
as:
|
(3)
|
where c is a modulation factor,
c is the theta phase of maximum instantaneous
firing rate for cell c, and  c = [ c, c]. The instantaneous
firing rate function for cell c is the product of the
position component in Equation 1 and the theta rhythm component
in Equation 3 and is given as:
|
(4)
|
where c = [ xc,  c].
The maximum instantaneous firing rate of place cell c is
exp{ c + c} and occurs at
x(t) = µc and (t) = c. The instantaneous firing rate model in Equation 4 does not consider the modulation of place cell firing propensity
attributable to the interaction between position and theta phase known
as phase precession (O'Keefe and Reece, 1993 ). We assume that
individual place cells form an ensemble of conditionally independent
Poisson processes. That is, the place cells are independent given their
model parameters. In principle, it is possible to give a more detailed
formulation of ensemble place cell spiking activity that includes
possible interdependencies among cells (Ogata, 1981 ). Such a
formulation is not considered here. The inhomogeneous Poisson model
defined in Equations 1-4 was fit to the spike train data of each place
cell by maximum likelihood (Cressie, 1993 ). The importance of the theta
phase model component was assessed using likelihood ratio tests
(Cassella and Berger, 1990 ) and Akaike's Information Criterion (AIC)
(Box et al., 1994 ).
After model fitting we evaluated validity of the Poisson assumption in
two ways using the fact that a Poisson process defined on an interval
is also a Poisson process on any subinterval of the original interval
(Cressie, 1993 ). First, based on the estimated Poisson model
parameters, we computed for each place cell the 95% confidence
interval for the true number of spikes in the entire experiment, in the
encoding stage and in the decoding stage. In each case, we assessed
agreement with the Poisson model by determining whether the recorded
number of spikes was within the 95% confidence interval estimated from
the model.
Second, for each place cell we identified between 10 to 65 subpaths on
which the animal traversed the field of that cell for at least 0.5 sec.
The region of the place field we sampled was the ellipse located at the
place cell center, which contained 67% of the volume of the fitted
Gaussian function in Equation 1. This is equivalent to the area within
1 SD of the mean of a one-dimensional Gaussian probability density. The
entrance and exit times for the fields were determined using the actual
path of the animal. From the estimate of the exact Poisson probability distribution on each subpath we computed the p value to
measure how likely the observed number of spikes was under the null
hypothesis of a Poisson model. A small p value would suggest
that the data are not probable under the Poisson model, whereas a large
p value would suggest that the data are probable and, hence,
consistent with the model. If the firing pattern along the subpaths
truly arose from a Poisson process, then the histogram of p
values should be approximately uniform. A separate analysis was
performed for subpaths in the encoding and decoding stages of each
animal.
Encoding stage: the path model. We assume that the path of
the animal during the experiment may be approximated as a zero mean
two-dimensional Gaussian random walk. The random walk assumption means
that given any two positions on the path, say
x(tk 1) and
x(tk), the path increments,
x(tk) x(tk 1), form a sequence of independent, zero mean Gaussian random variables with covariance matrix:
|
(5)
|
where x12,
x22 are the variances of
x1 and x2 components of
the increments, respectively, is the correlation coefficient, and
k = tk tk 1.
These model parameters were also estimated by maximum likelihood.
Following model fitting, we evaluated the validity of the Gaussian
random walk assumption by a 2 goodness-of-fit test and
by a partial autocorrelation analysis. In the goodness-of-fit analysis,
the Gaussian assumption was tested by comparing the joint distribution
of the observed path increments with the bivariate Gaussian density
defined by the estimated model parameters. The partial autocorrelation
function is an accepted method for detecting autoregressive dependence
in time series data (Box et al., 1994 ). Like the autocorrelation
function, the partial autocorrelation function measures correlations
between time points in a time series. However, unlike the
autocorrelation function, the partial autocorrelation function at lag
k measures the correlation between points k time
units apart, correcting for correlations at lags k 1
and lower. An autoregressive model of order p will have a
nonzero partial autocorrelation function up through lag p
and a partial autocorrelation function of zero at lags p + 1 and higher. Therefore, a Gaussian random walk with independent
increments should have uncorrelated increments at all lags and, hence,
its partial autocorrelation function should be statistically
indistinguishable from zero at all lags (Box et al., 1994 ).
Decoding stage. To develop our decoding algorithm we first
explain some additional notation. Define a sequence of times in (te, T], te t0 < t1 < t2, ... , tk < tk+1, ... , < tK T, where
te is the end of the encoding stage. The
tk values are an arbitrary time sequence in the
decoding stage, which includes the spike times of all the place cells.
We define Ic(tk) as the indicator of a spike at time tk for cell
c. That is, Ic(tk)
is 1 if there is a spike at tk from cell
c and 0 otherwise. Let I(tk) = [I1(tk), ... ,
IC(tk)]' be the vector of
indicator variables for the C place cells for time
tk. The objective of the decoding stage is to
find for each tk the best prediction of
x(tk) in terms of a probability density,
given C place cells, their place field and theta rhythm
parameters, and the firing pattern of the place cell ensemble from
te up through tk. Because
the tk values are arbitrary, the prediction of
x(tk) will be defined in continuous time.
An approach suggested by signal processing theory for computing the
probability density of x(tk) given the
spikes in (te, tk] is to
perform the calculations sequentially. Under this approach Bayes' rule
is used to compute recursively the probability density of the current
position from the probability densities of the previous position and
that of the new spike train data measured since the previous position
prediction was made (Mendel, 1995 ). The recursion relation is defined
in terms of two coupled probability densities termed the posterior and
one-step prediction probability densities. For our decoding problem
these two probability densities are defined as:
Posterior probability density:
|
(6)
|
One-step prediction probability density:
|
(7)
|
Before deriving the explicit form of our decoding algorithm, we
explain the terms in Equations 6 and 7 and the logic behind them. The
first term on the right side of Equation 6,
Pr(x(tk)|spikes in
(te, tk 1]), is the
one-step prediction probability density from Equation 7. It defines the
predictions of where the animal is likely to be at time
tk given the spike train data up through time
tk 1. Equation 7 shows that the one-step
prediction probability density is computed by "averaging over" the
animal's most likely locations at time tk 1,
given the data up to time tk 1 and the most
likely set of moves it will make in tk 1 to
tk. The animal's most likely position at time tk 1, the first term of the integrand in
Equation 7, is the posterior probability density at
tk 1. The animal's most likely set of moves
from tk 1 to tk,
Pr(x(tk)|x(tk 1)), is
defined by the random walk probability model in Equation 5 and again
below in Equation 8. The formulae are recursive because Equation 7 uses
the posterior probability density at time tk 1 to generate the one-step prediction probability density at
tk, which, in turn, allows computation of
the new posterior probability at time tk given
in Equation 6. The second term on the right side of Equation 6,
Pr(spikes at tk|x(tk),
tk 1), defines the probability of a spike at
tk given the animal's position at
tk is x(tk) and
that the last observation was at time tk 1. This term is the joint probability mass function of all the spikes at
tk and is defined by the inhomogeneous Poisson
model in Equations 1-4 and below in Equation 9. Pr(spikes at
tk|spikes in (te,
tk 1]) is the integral of the numerator on the right
side of Equation 6 and defines the normalizing constant, which ensures
that the posterior probability density integrates to 1.
Under the assumption that the individual place cells are conditionally
independent Poisson processes and that the path of the rat during
foraging in an open environment is a Gaussian random walk, Equations 6
and 7 yield the following recursive neural spike train decoding
algorithm:
State equation:
|
(8)
|
Observation equation:
|
(9)
|
One-step prediction equation:
|
(10)
|
One-step prediction variance:
|
(11)
|
Posterior mode
|
(12)
|
Posterior variance:
|
(13)
|
where the notation ~N(0,
Wx( k)) denotes the Gaussian
probability density with mean 0 and covariance matrix
Wx( k),
f(I(tk)|x(tk), tk 1) is the joint probability mass function of the
spikes at time tk and
(tk|tk) denotes the
position prediction at time tk given the spike
train up through time tk. We also define:
|
(14)
|
|
(15)
|
where
c[ ( k)] is the
integral of the theta rhythm process (Eq. 3) on the interval
(tk 1, tk], and
c[x(tk|tk)] = xc[tk|x(tk),
xc] is given in Equation 1. The prediction
(tk|tk) in Equation 12 is the mode of the posterior probability density, and therefore, defines the most probable position prediction at
tk given the ensemble firing pattern of the
C place cells from te up through tk. We term
(tk|tk), the Bayes'
filter prediction and the algorithm in Equations 8-13 the Bayes'
filter algorithm. As stated above, the algorithm defines a recursion
that begins with Equation 10. Under the random walk model, given a
prediction x(tk 1|tk 1) at
tk 1, the best prediction of position at
tk, i.e., one step ahead, is the
prediction at tk 1. The error in that prediction, given in Equation 11, reflects both the uncertainty in the
prediction at tk 1, defined by
W(tk 1|tk 1), and uncertainty of
the random walk in (tk 1, tk],
defined by Wx( k). Once the
spikes at tk are recorded, the position
prediction at tk is updated to incorporate this
new information (Eq. 12). The uncertainty in this posterior prediction
is given by Equation 13. The algorithm then returns to Equation 10 to
begin the computations for tk+1. The derivation
of the Bayes' filter algorithm follows the arguments used in the
maximum aposteriori estimate derivation of the Kalman filter
(Mendel, 1995 ) and is outlined in . If the posterior
probability density of x(tk) is
approximately symmetric, then
(tk|tk) is also both
its mean and median. In this case, the Bayes' filter is an
approximately optimal filter in both a mean square and an absolute
error sense. Equation 12 is a nonlinear function of
x(tk|tk) that is solved
iteratively using a Newton's procedure. The previous position
prediction at each step serves as the starting value. Using Equation 13
and a Gaussian approximation to the posterior probability density of x(tk) (Tanner, 1993 ), an approximate 95%
confidence (highest posterior probability density) region for
x(tk) can be defined by the ellipse:
|
(16)
|
where 6 is the 0.95th quantile of the 2
distribution with 2 df.
Interpretation of the Bayes' filter algorithm. The Bayes'
filter algorithm has a useful analytic interpretation. Equation 12
shows explicitly how the discrete spike times,
Ic(tk) values, are mapped
into a continuous position prediction
(tk|tk). This equation shows that the current position prediction,
(tk|tk), is a
weighted average of the one-step position prediction,
(tk|tk 1), and the place
cell centers. The weight on the one-step prediction is the inverse of
the one-step prediction covariance matrix (Eq. 11). If the one-step
prediction error is high, the one-step prediction receives less weight,
whereas if the one-step prediction error is small, the one-step
prediction receives more weight. The weight on the one-step prediction
also decreases as k increases (Eq. 11).
The weight on each place cell's center is determined by the product of
a dynamic or data-dependent component attributable to
Ac in Equation 14 and a fixed component
attributable to the inverse of the scale matrices, the
Wc values, in Equation 2. For small
k, it follows from the definition of
the instantaneous rate function of a Poisson process that
Ac may be reexpressed as:
|
(17)
|
Eq. 17 shows that Ac is equal to either 0 or 1 minus the probability of a spike from cell c at
tk given the position at
tk and the modulation of the theta rhythm in
k. Thus, for small k, Ac gives a weight in
the interval ( 1, 1). A large positive weight is obtained if a spike
is observed when a place cell has a low probability of a spike at
tk given its geometry and the current phase of
the theta rhythm. This is a rare event. A large negative weight is
obtained if no spike is observed when a cell has a high probability of
firing. This is also a rare event. Equation 12 shows that even when no
cell fires the algorithm still provides information about the animal's
most probable position. For example, if no place cell fires at
tk, then all the place cell means receive negative weights, and the algorithm interprets the new information in
the firing pattern as suggesting where the animal is not likely to be.
The inverse of the scale matrices are the fixed components of the
weights on the place cell means and reflect the geometry of the place
fields. Place cells whose scale matrices have small scale
factors highly precise fields will be weighted more in the new
position prediction. Conversely, place cells with large scale factors diffuse place fields will be weighted less. Viewed as a
function of c and tk,
Ac defines for cell c at time
tk the point process equivalent of the
innovations in the standard Kalman filter algorithm (Mendel, 1995 ).
At each step the Bayes' filter algorithm provides two estimates of
position and for each an associated estimate of uncertainty. The
one-step position prediction and error estimates are computed before
observing the spikes at tk, whereas the
posterior position prediction and error estimates are computed after
observing the spikes at tk. Because the
tk values are arbitrary, the Bayes' filter
provides predictions of the animal's position in continuous time. The
recursive formulation of this algorithm ensures that all spikes in
(te, tk] are used to
compute the prediction
(tk|tk). The
Newton's method of implementation of the algorithm shows the expected
quadratic convergence in two to four steps when the previous position
is the initial guess for predicting the new position. Because the
previous position prediction is a good initial guess, and the distance
between the initial guess and the final new position prediction is
small, a fast, linear version of Equation 12 can be derived by taking
only the first Newton's step of the procedure. This is equivalent to
replacing (tk|tk) on
the right side of Equation 12 with
(tk|tk 1).
The representation of our decoding algorithm in Equations 8-13 shows
the relation of our methods to the well known Kalman filter (Mendel,
1995 ). Although the equations appear similar to those of the standard
Kalman filter, there are important differences. Both the observation
and the state equations in the standard Kalman filter are continuous
linear functions of the state variable. In the current problem, the
state equation is a continuous function of the state variable, the
animal's position. However, the observation process, the neural spike
trains, is a multivariate point process and a nonlinear function of the
state variable. Our algorithm provides a solution to the problem of
estimating a continuous state variable when the observation process is
a point process.
Bayes' smoother algorithm. The acausal decoding algorithms
of Bialek and colleagues (1991) are derived in the frequency domain using Wiener kernel methods. These acausal algorithms give an estimate
of x(tk|T) rather than
x(tk|tk) because they use
all spikes observed during the decoding stage of the experiment to estimate the signal at each tk. To compare our
algorithm directly with the acausal Wiener kernel methods, we computed
the corresponding estimate of x(tk|T) in our
paradigm. The estimates of x(tk|T) and
W(tk|T) can be computed directly from
(tk|tk),
(tk|tk 1), W(tk|tk), and
W(tk|tk 1) by the following
linear algorithm:
|
(18)
|
|
(19)
|
|
(20)
|
where the initial conditions are (T|T) and
W(T|T) obtained from the last step of the Bayes' filter.
Equations 18-20 are the well known fixed- interval smoothing algorithm
(Mendel, 1995 ). To distinguish (tk|T)
from (tk|tk), we
term the former Bayes' smoother prediction.
Non-Bayes decoding algorithms. Linear and maximum likelihood
(ML) decoding algorithms can be derived as special cases of Equation 12. These are:
|
(21)
|
and
|
(22)
|
where *k is the 1 sec interval
ending at tk,
nc( *k) is the number of
spikes from cell c in
*k, and
A*c is Ac in
Equation 14 with Ic(tk)
replaced by
nc( *k). The term
A*c has approximately the same
interpretation as Ac in Equation 14. The
derivation of these algorithms is also explained in .
For comparison with the findings of Wilson and McNaughton (1993) , we
also decoded using their maximum correlation (MC) method. This
algorithm is defined as follows. Let ijc
denote the value of the occupancy- normalized histogram of spikes from
cell c on pixel ij. The MC prediction at
tk is the pixel that has the largest correlation
with the observed firing pattern of the place cells in
*k. It is defined as:
|
(23)
|
where ij is the average
firing rate over the C cells at pixel location
ij, and
is the average of the spike counts over the C place cells in
*k.
Implementation of the decoding algorithms. Position decoding
was performed using the Bayes' filter, ML, linear, MC, and Bayes' smoother algorithms. Decoding with the Bayes' filter was performed with and without the theta rhythm component of the model. With the
exception of the MC algorithm, position predictions were determined in
all decoding analyses at 33 msec intervals, the frame rate of the
tracking camera. For the MC algorithm the decoding was performed in 1 sec nonoverlapping intervals. The ML prediction at
tk was computed from the spikes in
*k, the 1 sec time window
ending at tk. To carry out the ML decoding at
the frame rate of the camera and to give a fair comparison with the
Bayes' procedures, this time window was shifted along the spike trains
every 33 msec for each ML prediction. Hence, there was a 967 msec
overlap in the time window used for adjacent ML predictions. The same 1 sec time window and 33 msec time shift were used to compute the linear
decoding predictions. We tested time windows of 0.25, 0.5, and 1 sec
and chose the latter because the low spike counts for the place cells
gave very unstable position predictions for the shorter time intervals
even when the intervals were allowed to overlap. For integration time
windows longer than 1 sec, the assumption that the animal remained in
the same position for the entire time window was less valid. Zhang et
al. (1998) found a 1 sec time window to be optimal for their Bayes'
procedures.
Relationship among the decoding algorithms. The Bayes'
filter and the non-Bayes' algorithms represent distinct approaches to
studying neural computation. Under the Bayes' filter, an estimate of a
behavioral state variable, e.g., position at a given time, is computed
from the ensemble firing pattern of the CA1 place cells and stored
along with an error estimate. The next estimate is computed using the
previous estimate, and the information in the firing patterns about how
the state variable has changed since the last estimate was computed.
For the non-Bayes' algorithms the computational logic is different.
The position estimate is computed from the place cell firing patterns
during a short time window. The time window is then shifted 33 msec and
the position representation is recomputed. The Bayes' filter relies
both on prior and new information, whereas the non-Bayes' algorithms
use only current information. Because the Bayes' filter sequentially updates the position representation, it may provide a more biologically plausible description of how position information is processed in the
rat's brain. On the other hand, the non-Bayes' algorithms provide a
tool for studying the spatial information content of the ensemble
firing patterns in short overlapping and nonoverlapping time
intervals.
The Bayes' filter is a nonlinear recursive algorithm that gives the
most probable position estimate at tk given the
spike trains from all the place cells and theta rhythm information up to through tk. The ML algorithm yields the most
probable position given only the data in a time window ending at
tk. Because this ML algorithm uses a 1 sec time
window, it is not the ML algorithm that would be derived from the
Bayes' filter by assuming an uninformative prior probability density.
The latter ML algorithm would have a time window of 33 msec. Given the
low firing rates of the place cells, an ML algorithm with a 33 msec
integration window would yield position predictions that were
significantly more erratic than those obtained with a 1 sec window (see
Fig. 4). Theta phase information is also not likely to improve
prediction accuracy of the ML algorithm, because the 1 sec integration
window averages approximately eight theta cycles. In contrast, the
Bayes' filter has the potential to improve the accuracy of its
prediction by taking explicit account of the theta phase information.
For the Bayes' filter with k = 33 msec and
an average theta cycle length of 125 msec, each
tk falls on average in one of four different phases of the theta rhythm.
Equation 16 shows that the local linear decoding algorithm uses no
information about previous position or the probability of a place cell
firing to determine the position prediction. It simply weights the
place cell centers by the product of the number of spikes in the time
interval *k and the inverse of the
scale matrices. If no cell fires, there is no position prediction.
Because the algorithm uses no information about the place cell firing
propensities, it is expected to perform less well than either the Bayes
or the ML algorithms. The MC algorithm estimates the place cell
geometries empirically with occupancy-normalized histograms instead of
with a parametric statistical model. The position estimate determined
by this algorithm is a nonlinear function of the observed firing
pattern, and the weighting scheme is determined on a pixel-by-pixel
basis by the correlation between the observed firing pattern and the
estimated place cell intensities. The MC algorithm is the most
computationally intensive of the algorithms studied here because it
requires a search at each time step over all pixels in the
environment.
The Bayes' smoother derives directly from the Bayes' filter by
applying the well known fixed-interval smoothing algorithm. Of the five
algorithms presented, it uses the most information from the firing
pattern to estimate the animal's position. However, because it uses
all future and all past place cell spikes to compute each position
estimate, it is the least likely to have a biological interpretation.
The Bayes' smoother is helpful more as an analytic tool than as an
actual decoding algorithm because it shows algebraically how current
and future information are combined to make its position predictions.
This algorithm makes explicit the relation between the Bayes' filter
and a two-sided filter such as the acausal Wiener filter procedures of
Bialek et al. (1991) .
 |
RESULTS |
Encoding stage: evaluation of the Poisson model fit to the place
cell firing patterns
The inhomogeneous Poisson model was successfully fit to the place
cell spike trains of both animals. Twenty-six of 34 place cells for
animal 1 and 24 of 33 cells for animal 2 had place fields located along
the border of the environment (Fig. 1).
Seven of 34 cells for animal 1 and five of the place cells for animal 2 fired preferentially in regions near the center of the environment. The
remaining single cell for animal 1 and four cells for animal 2 had
split fields. The split fields could be explained by these place cells
having two distinct regions of maximal firing and/or errors in
assigning spikes from the tetrode recordings to particular cells. For
each of the three types of place field patterns the fits of the
position modulation components of the Poisson model were consistent
with the occupancy-normalized histograms in terms of shape and location
of regions of maximum and minimum firing propensity. For animal 1 (animal 2) 31 (32) of the 34 (33) place cells had statistically
significant estimates of the place parameters µc,1, µc,2,
c,12, and
c,22, and 27 (33) of the 34 (33) place
cells had statistically significant estimates of .

View larger version (70K):
[in this window]
[in a new window]
|
Figure 1.
Pseudocolor maps of the fits of the inhomogeneous
Poisson model to the place fields of three representative place cells
from animal 2. The panels show A, a field lying along the
border of the environment; B, a field near the center of the
environment; and C, a split field with two distinct regions
of maximal firing. Most of the place cells for both animals were like
that of cell A (see Encoding stage: evaluation of the
Poisson model fit to the place cell firing patterns). The color
bars along the right border of each panel show the
color map legend in spikes per second. The spike rate near the center
of cell A is 25 spikes/sec compared with 12 spikes/sec for
cells B and C. Each place field has a nonzero
spike rate across a sizable fraction of the circular environment.
|
|
In addition to position modulation, there was statistically significant
theta phase modulation of the place cell firing patterns for both
animals. For animal 1 (2) the theta phases of maximal firing were
mostly between 180 and 360° (160 and 360°) with a maximum near
360° (270°). We evaluated the statistical improvement in the fit of
the inhomogeneous Poisson model attributable to inclusion of the theta
rhythm by likelihood ratio tests and AIC. The likelihood ratio tests
showed that 25 of the 34 cells for animal 1 and 31 of the 33 cells for
animal 2 had statistically significant improvements in the model fits
with the inclusion of the theta component. For 28 of the 34 cells for
animal 1 and 31 of the 33 place cells for animal 2, the better model by
AIC included the phase of theta rhythm. The individual parameter
estimates allow a direct quantitative assessment of the relative
importance of position and theta phase on place cell firing. For
example, in Fig. 1, place cell B had an estimated maximal firing rate
of exp( + ) = exp(1.91 + 0.56) = 11.82 spikes/sec at
coordinates x1(t) = 40.37, x2(t) = 43.63, and theta phase (t) = 0.61 radians. In the
absence of theta phase modulation the approximate maximum firing rate
would be exp( ) = exp(1.91) = 6.75 spikes/sec, whereas in the absence
of position modulation the maximum firing rate would be exp( ) = exp(0.56) = 1.75 spikes/second. With the exception of five place cells
for animal 1 and one cell for animal 2, all the values were
positive and in the range of 0.45-4.5. The values were all in a
narrow range between 0.06 and 0.5 for animal 1 and 0.03 and 1.1 for
animal 2. For 25 of 34 place cells for animal 1 and 32 of 33 place
cells for animal 2, was larger than . The single place cell for
animal 2 and the five of seven place cells for animal 1 for which was larger than all fired 200 spikes during the encoding stage.
The median (mean) ratio of exp( ) to exp( ) was 2.9 (5.0) for
animal 1 and 5.3 (7.5) for animal 2. Because the median is a more
representative measure of central tendency in small groups of numbers
(Velleman and Hoaglin, 1981 ), these findings suggest that position is a
three to five times stronger modulator of place cell spiking activity
than theta phase under the current model.
Encoding stage: fit of the random walk model to the path
For both animals there was close agreement between the variance
components of the Gaussian random walk estimated from the first part of
the path (encoding stage) and those estimated from the full path (Eq. 5). The estimated variance components were x1 = 0.283 (0.440),
x2 = 0.302 (0.393), and = 0.024 (0.033) from the encoding stage for animal 1 (animal 2). The estimated
means were all close to zero, and the small values of the correlation
coefficient suggested that the x1 and
x2 components of the random walk are
approximately uncorrelated.
Encoding stage: assessment of model assumptions
We present here the results of our goodness-of-fit analyses of the
inhomogeneous Poisson model fits to the place cell spike train data and
the random walk model fits to the animals' paths. The implications of
these results for our decoding analysis and overall modeling strategy
are presented in the Discussion (see Encoding stage: lessons from the
random walk and goodness-of-fit analyses).
Evaluation of the inhomogeneous Poisson model goodness-of-fit
In this analysis the one place cell for animal 1 and the four
place cells for animal 2 with split fields were treated as separate units. The separate units for the place field were determined by
inspecting the place field plot, drawing a line separating the two
parts, and then assigning the spikes above the line to unit 1 and the
ones below the line to unit 2. Hence, for animal 1, there are 35 = 34 + 1 place cell units, and for animal 2, there are 37 = 33 + 4 place cell units. We first assessed the Poisson model
goodness-of-fit for the individual place cell units. We considered the
number of recorded spikes to agree with the prediction from the Poisson
model if the number recorded was within the 95% confidence interval
estimated from the model. In the encoding stage, for 30 of 35 place
cell units for animal 1 and for 37 of 37 units for animal 2, the number
of recorded spikes agreed with the model prediction. This finding was
expected because the model parameters were estimated from the encoding
stage data. In the decoding stage, for only 8 of 35 place cell units
for animal 1 and for 6 of 37 units for animal 2 did the number of
recorded spikes agree with the model predictions. Over the full
experiment, for only 7 of 35 place cells for animal 1 and for 9 of 37 place cells for animal 2 did the recorded and predicted numbers of
spikes agree.
As stated in Statistical methods, for the goodness-of-fit analysis of
the Poisson model on the subpaths, we computed the p value
for the observed number of spikes on each subpath under the null
hypothesis that the true model was Poisson and that the true model
parameters were those determined in the encoding stage (Fig.
2). If the firing patterns along all the
subpaths truly arose from a Poisson process then, the histogram of
p values should be approximately uniform. In the encoding
stage, 33% of the 893 subpaths for animal 1 and 46% of the 885 subpaths for animal 2 had p 0.05 (see the first bins
of the histograms in Fig. 2, Encoding Stage). Similarly, in
the decoding stage, 37% of the 595 subpaths for animal 1 and 43% of
the 475 subpaths for animal 2 had p 0.05 (see the
first bins of the histograms in Fig. 2, Decoding Stage). As
expected, both animals in both stages had several subpaths with
p 0.95 because the expected number of spikes on
those trajectories was two or less. The large number of subpaths with
small p values suggests that the place cell firing patterns
of both animals were more variable than would be predicted by the
Poisson model.

View larger version (18K):
[in this window]
[in a new window]
|
Figure 2.
Histograms of p values for the
goodness-of-fit analyses of the inhomogeneous Poisson model on the
subpaths for the encoding (left column) and decoding
(right column) stages for animals 1 and 2. Each p |
|
|