The Journal of Neuroscience, August 20, 2003, 23(20):7630-7641
Previous Article | Next Article 
Neural Noise and Movement-Related Codes in the Macaque Supplementary Motor Area
Bruno B. Averbeck and
Daeyeol Lee
Department of Brain and Cognitive Sciences and Center for Visual Science,
University of Rochester, Rochester, New York 14627
 |
Abstract
|
|---|
We analyzed the variability of spike counts and the coding capacity of
simultaneously recorded pairs of neurons in the macaque supplementary motor
area (SMA). We analyzed the mean-variance functions for single neurons, as
well as signal and noise correlations between pairs of neurons. All three
statistics showed a strong dependence on the bin width chosen for analysis.
Changes in the correlation structure of single neuron spike trains over
different bin sizes affected the mean-variance function, and signal and noise
correlations between pairs of neurons were much smaller at small bin widths,
increasing monotonically with the width of the bin. Analyses in the frequency
domain showed that the noise between pairs of neurons, on average, was most
strongly correlated at low frequencies, which explained the increase in noise
correlation with increasing bin width.
The coding performance was analyzed to determine whether the temporal
precision of spike arrival times and the interactions within and between
neurons could improve the prediction of the upcoming movement. We found that
in
62% of neuron pairs, the arrival times of spikes at a resolution
between 66 and 40 msec carried more information than spike counts in a 200
msec bin. In addition, in 19% of neuron pairs, inclusion of within (11%)- or
between-neuron (8%) correlations in spike trains improved decoding accuracy.
These results suggest that in some SMA neurons elements of the spatiotemporal
pattern of activity may be relevant for neural coding.
Key words: spike count variability; correlated noise; monkey; decoding; temporal code; rate code
 |
Introduction
|
|---|
Behavioral performance is constrained by the information processing
capacity of the nervous system. For many tasks, this processing capacity can
be understood by studying the coding properties of cortical neural networks. A
first step in the study of these networks is to understand the way in which
individual neurons or ensembles of neurons encode information. The number of
possible codes that neurons may be using, independently or as ensembles, is
immense (Perkel and Bullock,
1969
), and therefore a systematic approach to the investigation of
putative neural codes is important. One can approach this question by
assessing whether patterns in neural responses within or between neurons are
correlated with behavioral variables. After having identified a potential
code, one can then ask whether the code is behaviorally relevant. The total
information in the code can be considered an upper limit with respect to
behavior, because the networks mediating the behavior may extract only a
portion of the information available.
The question of which parameters of the neuronal signal carry information
has an extensive theoretical and empirical history. Early theoretical papers
often considered the limit of the information content of various neural codes
(MacKay and McCulloch, 1952
;
Barlow, 1963
), whereas early
empirical work by Mountcastle and colleagues
(Mountcastle et al., 1968
;
Talbot et al., 1968
) initiated
the study of the neuronal basis of psychophysical performance and explicitly
explored the question of which parameter of the neural signal accounted for
sensory detection thresholds (Mountcastle et al.,
1968
,
1990
). Other empirical studies
have estimated the amount of information that neurons can carry about stimulus
or behavioral parameters (Heggelund and
Albus, 1978
; Parker and
Hawken, 1985
; Bradley et al.,
1987
; Hernandez et al.,
2000
) and compared the coding performance of single neurons with
the psychophysical performance of behaving animals
(Newsome et al., 1989
;
Vogels and Orban, 1990
).
Our main goal in this paper was to explore the information content of
several related neural codes, as well as the statistical structure of the
neural signal. The mean-variance function as well as the correlation in the
signal and the correlation in the noise were found to depend strongly on the
bin width chosen for analysis. The results of the decoding analyses suggest
that information about upcoming movements is coded in the arrival times of
spikes on a time scale as small as
40 msec and that using some form of
correlation between pairs of neurons or between different time points within a
given neuron improved the prediction of the movement for 19% of the pairs of
neurons in our sample.
 |
Materials and Methods
|
|---|
Neurons analyzed in the present study were recorded from the left caudal
supplementary motor area (SMA-proper or F3) in two rhesus macaques performing
a series of visually guided reaching movements. The detailed methods of
recording as well as the behavioral task have been published previously
(Lee and Quessy, 2003
). All of
the procedures used in this study were approved by the University of Rochester
Committee on Animal Research and conformed to the principles outlined in the
NIH Guide for the Care and Use of Laboratory Animals (publication no.
85-23, revised 1985).
Behavioral task
Two animals were trained on a serial reaction time (SRT) task. They sat
facing a computer monitor on which a series of targets was presented. There
were 16 possible target locations defined by a 4 x 4 grid. A touch
screen placed horizontally in front of the animal was used for behavioral
input. The animals indicated acquisition of each target by contacting the
corresponding location on the touch screen. Each subsequent target in the
sequence appeared 250 msec after the previous target had been acquired. A
trial consisted of a sequence of 10 target acquisitions. If the 10 targets
were acquired successfully, a juice reward was given. Within the task, four
different types of sequences were presented
(Lee and Quessy, 2003
). In the
random condition, the sequence of target locations was selected pseudorandomly
for every trial. In the primary condition, the monkey executed a repeating
sequencing of three targets (i.e., a single trial was three repeats of the
three target sequence), with the first target of the sequence repeated at the
end of the sequence, (for example, ABCABCABCA). In the secondary condition,
the monkey executed a different repeating sequence of three targets. In the
final condition, the monkey began executing the primary sequence and then
switched to the secondary sequence from seventh target onward. New primary and
secondary sequences were selected pseudorandomly for each day's session. A
block of trials consisted of five sequences from the primary condition, and
one sequence from each of the remaining conditions. Trial types were presented
in a randomized block design. In this paper, we analyzed only the data from
the primary condition, because trials in this condition provided a large
amount of data with consistent visual stimuli and behavioral responses.
Data analysis
Analysis of neuronal variability. The data for each behavioral
trial were split into epochs corresponding to each of the 10 movements, 1 for
each target in the movement. Data from the first movement were not considered
because they followed the intertrial interval and varied from trial to trial.
For the analysis of variability in the neural activity, spikes occurring
during a 600 msec interval from 300 msec before to 300 msec after target
presentation were binned using bin sizes of 5, 10, 20, 25, 33, 40, 50, 66,
100, and 200 msec.
We calculated the mean and variance of the neural activity for each neuron
in different time bins relative to stimulus onset. We also calculated the
correlation in the mean response and the correlation in the residual response
between pairs of neurons. The correlation in the mean response, or signal
correlation (Gawne and Richmond,
1993
; Lee et al.,
1998
), was calculated by first concatenating the poststimulus time
histogram (PSTH) for each movement for each pair of neurons. This resulted in
a vector with 3n elements for each neuron, where n was the
number of bins into which the 600 msec epoch was divided, and there were three
different movements in the primary condition. Correlations were calculated
between these vectors. Correlation in the residual response, or noise
correlation, was calculated by first subtracting the mean response from each
trial, giving the residual response. The correlation in these vectors between
neurons was calculated separately for each movement as an estimate of the
correlation in the noise (Gawne and
Richmond, 1993
; Zohary et al.,
1994
; Lee et al.,
1998
).
We also performed three analyses in the frequency domain. Because a large
quantity of data were available, no smoothing in the frequency domain was
necessary (Jarvis and Mitra,
2001
). Also, the rectangular window was used in the time domain,
because it has the smallest main lobe and therefore gives the best frequency
resolution, although at the expense of larger side lobes
(Oppenheim and Schafer, 1989
).
Using other windowing functions would lead to a broadening of the peaks in the
power and coherence plots. All frequency domain values presented in this paper
were calculated across the 600 msec window beginning 300 msec before target
onset, at a 1 msec resolution. Analyses were implemented in C ++,
using compiled versions of the fft and cohere functions from Matlab (The
Mathworks, Inc., Natick, MA).
Estimates of the population periodogram of the mean response (signal) were
calculated by averaging across the individual periodograms of each neuron. The
periodogram, Pxx(k), of the mean
response for each neuron was calculated by taking the fast Fourier transform
(FFT) of the PSTH and normalizing by the length of the signal
(Papoulis, 1991
):
 | (1) |
where X(k) is the FFT of the signal, in this case the PSTH, and
N is its length, equal to 600. We also estimated the periodogram of
the noise, by calculating the periodogram of the residual of each trial, with
the residual calculated as defined above, and then averaging across trials for
individual neurons and finally across neurons. In the final frequency domain
analysis, we analyzed the coherence between residuals of neuron pairs. The
coherence is defined as:
 | (2) |
where Pxy(k) is the cross spectrum
(Papoulis, 1991
). Each trial
was treated as a data segment. These estimates were also averaged across the
entire population to produce the population coherence plots. Because the
rectangular window used in the time domain can result in power bleeding
between frequencies (Oppenheim and
Schafer, 1989
), we also examined periodograms and coherence
functions calculated with the mean removed. There was little difference in the
non-DC components, so we show the plots with the DC information intact,
because it is informative. In Results, we will make comparisons between the
frequency domain analyses and the binned analyses in the time domain. Because
calculating histograms in the time domain leads to aliasing in the frequency
domain, the time domain analyses could have been performed by first low-pass
filtering the spike trains and then subsampling at the corresponding bin
width; however, binning is a much more common practice in the analysis of
neurophysiological data. Furthermore, we performed many of the decoding and
noise analyses by filtering and then subsampling, and the main results of the
paper were not changed. Therefore, we present the results from the binning
analyses to make comparisons between studies easier.
Decoding analyses. After we examined the statistical structure of
the neural activity, we developed decoding algorithms that used the neural
activity of pairs of simultaneously recorded neurons to estimate the target to
which the monkey reached. We performed the decoding analyses using a 200 msec
window that began at target onset. We restricted our decoding analyses to this
window so that we could explore relatively small bin sizes without generating
too many degrees of freedom in our model.
We will discuss our analysis in terms of a Bayesian framework
(Oram et al., 1998
;
Zhang et al., 1998
). Within
our task, however, the prior probability of each movement was the same, and
therefore Bayesian and maximum likelihood decoding frameworks are equivalent.
In the decoding analysis, the target (or corresponding movement) was predicted
by selecting the target with the maximum probability over the joint
distribution of neural activity and possible targets. This can be formalized
as:
 | (3) |
where
is the estimated target
for the subsequent movement, and
p(
|n1,n2) is the
conditional distribution of
given the response of two neurons across
several bins. The conditional probability of
is given by Bayes rule:
 | (4) |
where p(
) is the prior probability of a given target,
n is the vector of neural responses, and p(n) is a
normalizing constant, calculated as:
 | (5) |
In pilot studies we explored both Poisson and Gaussian distribution for the
data. We will only discuss results from the Gaussian distribution, because it
generally provided better decoding performance. The multivariate Gaussian
distribution is given by:
 | (6) |
where n is a vector of spike counts for each bin and each neuron for
a given trial, r is the corresponding vector of mean spike counts for
a given target,
is the covariance matrix of the spike counts for each
bin and each neuron, || indicates the determinant of the matrix, and
d is the dimensionality of the firing rate vector being considered.
In the analyses, we manipulated r by changing the number of bins into
which the 200 msec epoch was divided. Thus, r was always a vector of
the response across two neurons, but in the case of 50 msec bins, r
had eight elements, four for each bin for each neuron, whereas in the case of
200 msec bins, r had only two elements. The covariance matrix was
calculated accordingly. A separate version of Equation 6 was estimated for
each target. Thus the decoding procedure was as follows. For a given neural
response, n, the probability of each target was assessed using
Equation 6, with the mean, r, and covariance,
, which
corresponded to each target. The target with the maximum probability was then
selected as the estimate.
Some neurons failed to fire spikes in one or more of their response bins.
This resulted in a column of zeros in the data matrix, which leads to a
noninvertible covariance matrix. We corrected the problem by eliminating those
columns from the data matrix and still treating the model as if it had all of
its parameters.
We assessed the ability of specific interaction terms to improve the
prediction of the subsequent target. We restricted our analyses to a set of
specific hypotheses. This was done by setting all or a subset of the
off-diagonal terms of the covariance matrix,
, in Equation 6 to zero,
and, in the case of the variance equals mean (VEM) model, by restricting the
diagonal terms of the matrix to be equal to the mean. A matrix with all but
the diagonal matrix entries set to zero will be referred to as a diagonal
matrix. For the main analyses, only one covariance matrix was estimated for
each pair of neurons, for all targets. The VEM model had a diagonal covariance
matrix, with the diagonal elements set equal to the mean, similar to a Poisson
distribution. The "independent" model had a diagonal covariance
matrix, with each variance along the diagonal estimated from the data. The
"between" model included off-diagonal elements corresponding to
interactions between identical time bins between the neurons in the pair. The
"within" model included the off-diagonal elements of the
covariance matrix corresponding to interactions between adjacent time bins of
the neural spike train. Finally, the "full" model included all
off-diagonal elements of the covariance matrix. We also examined models that
had separate covariance matrices for each movement direction. We did this by
first selecting the best model from among the models with a pooled covariance
matrix and then testing whether a model that had a separate covariance matrix
for each movement but the same off-diagonal elements set to zero in each of
the separate matrices performed better. For example, if the model selection
procedure chose the between model as best, we compared the between model that
had a single covariance matrix calculated from the neural response pooled
across movements with a between model that had a separate covariance matrix
for each movement.
Model selection. We used two techniques to decide which of the
various models described above provided the best explanation of the data:
Akaike's information criterion (AIC) and K-fold cross validation
(CV). AIC uses the likelihood of the model parameters, conditioned on the data
and the model to discriminate between models. The likelihood f(X,
Pm) is the product of the likelihood of all data points, as
predicted by the model,
 | (7) |
where X is the data set, with K samples, the likelihood
p(
|n...) is the probability of sample k,
i.e., the probability of the actual target for a given trial, and
Pm indicates that the likelihood is calculated using the
maximum likelihood parameters. AIC is then calculated as follows:
 | (8) |
where m is the number of free parameters of the model under
consideration. The model within the family of models that had the minimum AIC
was selected as the best model.
We also used K-fold CV (Efron
and Tibshirani, 1998
) to assess model performance. CV has been
shown to be asymptotically equivalent to AIC
(Stone, 1977
); however, the
small sample properties of the two approaches are less well understood. In CV,
the data are split repeatedly into two non-overlapping sets. The model is
estimated with one set, and its performance is assessed on the other. In our
implementation, the data were split 10 times (we also tried splits of 3, 5,
and 20, but the results were similar). For each split, 1 of every 10 trials
was placed in the test data set, and the other 9 trials were placed in the
data set used for estimating the model. The model was then estimated, and its
performance was evaluated using the test data set. This was repeated 10 times,
such that all trials were included in one of the test data sets. Results were
compiled across the 10 runs. The model with the highest percentage correct
performance was selected as the best model.
Finally, the finite impulse response high-pass filter, discussed near the
end of Results, had an order of 2000. The high filter order was necessary to
confine the stop band to a rather small set of frequencies. The cutoff
frequency was 0.5 Hz, and the filter had an almost linear roll-off from 1 Hz
to DC, achieving an amplitude response of 0.15 at DC. Thus low frequencies
were strongly suppressed.
 |
Results
|
|---|
Database
The analyses were performed on 19 ensembles of simultaneously recorded
neurons containing a total of 90 single neurons from the SMA of two monkeys
(12 ensembles from monkey 1, 7 ensembles from monkey 2). The distribution of
the number of neurons in each ensemble is shown in
Figure 1. A total of 193 pairs
of neurons were available for the analyses. All ensembles were recorded for at
least 152 trials in the task condition that we analyzed (average number of
trials per ensemble = 267). The SRT task made it possible to collect a large
number of examples of each movement, because the animal repeated each movement
three times in every trial. Therefore at least 456 movements were available
for each of the 3 movements, which resulted in at least 1368 total movements
(average number of movements = 2402).
Figure 2 shows 249 trials, of
each of the three movements, for an example neuron. The rasters are aligned to
target onset. The neural response to movements one and two shows a reduction
in activity before movement onset, which may be a return to baseline or an
inhibition. For the noise analyses, we analyzed the neural activity from 300
msec before target onset to 300 msec after target onset. For the decoding
analyses we restricted the time window to the period from 0 to 200 msec after
target onset. The time window was divided into bins of 5, 10, 20, 25, 33, 40,
50, 66, 100, and 200 msec, which approximately divided the epoch into an
integer number of bins. Individual time bins at a specific time relative to
target onset from a single neuron for a single movement were considered as
separate random variables.

View larger version (54K):
[in this window]
[in a new window]
|
Figure 2. Single-cell raster for 249 trials of each of the three movements. These
example movements were the first three of each behavioral trial. Rasters are
aligned to target onset (0 msec).
|
|
Noise analyses
We began by exploring the variance of the spike count in a bin, as a
function of the mean, for different bin sizes. In
Figure 3, the mean spike count
is plotted against the variance of the spike count for several different bin
sizes for the entire population. The minimum variance possible, given that
spikes are discrete events, is given by f(1-f)
(de Ruyter van Steveninck et al.,
1997
), where f is the fraction of the firing rate over
the largest integer smaller than the firing rate. This lower bound is
indicated by the "scalloped" lines along the bottom of the plots
in Figure 3. A line with a
slope of 1 is also plotted. For the small and intermediate bin sizes it can be
seen that some of the data fell near the line given by the minimum obtainable
variance. For large bin sizes, the data were scattered broadly around the line
indicating a linear relation between mean and variance.

View larger version (23K):
[in this window]
[in a new window]
|
Figure 3. Mean-variance relation for spike counts across the population for several
bin widths. The left column shows the relation in log-log coordinates; the
right column shows the relation in linear coordinates. Bin widths of 5, 40,
and 200 msec are shown, as indicated by the number in each panel of the left
column. The mean for each bin is plotted on the abscissa, and the variance is
plotted on the ordinate. The scalloped line that defines the bottom of each
distribution is the minimum obtainable variance, given that spikes are
discrete events. The line with a slope of 1 defines the mean-variance function
expected of a Poisson process, in which the mean equals the variance. For
small bin sizes, most data fell below the unity line, and many data points
actually had the minimum variance obtainable. For larger bin sizes, the
mean-variance relation was distributed about unity. The data plotted included
all bins, before and after target onset. When the analysis was restricted to
the period after target onset, the results were similar. The slope of the best
fit line (data not shown) was assessed using the following equation: log(var)
= a + b*log(mean). The coefficients a
and b were as follows: a = -0.21, b = 0.96;
a = -0.58, b = 0.83; a = -0.05, b = 0.87
for the 5, 40, and 200 msec bin sizes, respectively.
|
|
In Figure 4A we
plot the matrix of correlation coefficients among 1 msec time bins for an
example cell from our population, for the period from 0 to 200 msec after
stimulus onset. This cell shows strong negative correlation near the main
diagonal and positive correlation over larger intervals, violating the
independence assumption of a Poisson process. This structure in the
correlation matrix can also be seen in the interspike interval histogram,
which is plotted in Figure
4B. We can see how correlations affect the measured
variance for a large bin of neural activity, by separating the variance into
terms attributable to the mean of a time bin and correlations between events
within a bin. To do this we divide a large bin into 1 msec bins and calculate
the means of and correlations among the 1 msec bins. The total variance of the
large bin is given by:
 | (9) |
where
tt' is the correlation coefficient between the
1 msec time bins and
t2 is the variance of the 1
msec bin at time t. From this, it can be seen that if a large bin of
neural activity covers an interval over which the correlation coefficients are
negative, the variance of the bin will decrease below the mean, attributable
to the second sum in Equation 9, and if the bin extends over an interval
during which the balance of the correlation is positive, the variance of the
bin will increase. The plot in Figure
4C shows the variance caused by each component of
Equation 9, as a function of bin width, for the cell with its correlation
matrix plotted in Figure
4A. The line labeled "Total"
(Fig. 4C) corresponds
to
2 on the left-hand side of Equation 9. The line labeled
"Independent" is the variance attributable to the first sum of
Equation 9, which is the sum of the variances of each separate 1 msec bin, and
the line labeled "Correlated" is the variance attributable to the
correlation between bins, given by the second sum in Equation 9. For this
example the effect of the correlations between bins increased the variance for
bin widths >60 msec. Therefore, the scatter in the mean-variance plots
shown in Figure 3 can be
accounted for by correlations between 1 msec bins that make up a larger bin.
This implies that if there are correlations in the spike trains, the size of
the bin chosen to estimate the variance will affect the estimate of the
variance, a fact that has been shown previously
(Oram et al., 2001
). This is
important for measures of neuronal variability and models that try to account
for variability in neuronal responses
(Salinas and Sejnowski, 2000
),
because these measures may depend strongly on correlations in the neuronal
response within the large time bin being considered.
We also explored the covariance structure of the noise between pairs of
neurons in our data set. The question of correlated responses has been
approached from two perspectives. If the total response of the neurons is
considered, the total correlation in their responses can be calculated
(Kruger and Aiple, 1988
). If
the mean response of the neurons to a behavioral event is estimated, however,
the correlation can be split into a signal and a noise component, with the
correlation in the mean response taken as the correlation in the signal, and
the correlation in the trial to trial residual taken as the correlation in the
noise (Gawne and Richmond,
1993
; Zohary et al.,
1994
; Lee et al.,
1998
). Subtracting the mean response removes the effect of
first-order nonstationarity on the correlation between neurons. Residual
correlation, therefore, cannot be accounted for by the interval histogram or
correlations in the mean responses of the two neurons. In our case, the
correlation in the mean was calculated as the correlation in the PSTHs across
neurons, and the correlation in the noise was calculated as the correlation in
the residual after the PSTH was subtracted, with both measures being pooled
across bins (see Materials and Methods). The correlation in the residual was
calculated separately for each movement. In
Figure 5 we show the
distribution of the correlation in the noise (residual) as a function of bin
size, and in Figure 6 we show
the distribution of the correlation in the signal (PSTH) as a function of bin
size for the population. The width of the distribution of correlations at each
bin size was measured and plotted as the variance of the distribution (Figs.
5D,
6D). This measures the
amount of correlation in the population, because wider distributions imply
larger absolute values of correlation. It can be seen that the correlation in
the noise changed considerably as a function of bin size. The correlation in
the signal changed as well, although less. This suggests that the noise
correlation among the neurons was not broadband but was stronger at low
frequencies (see below).

View larger version (28K):
[in this window]
[in a new window]
|
Figure 5. Correlation in residual (noise correlation) as a function of bin size.
A-C show the distribution of correlation coefficients for
the population of pairs of neurons analyzed for three different bin widths: 5,
40, and 200 msec. D, Mean and the variance of the distribution of
correlation coefficients as a function of bin size. Dashed line is mean (M);
solid line is variance (V).
|
|

View larger version (24K):
[in this window]
[in a new window]
|
Figure 6. Correlation in PSTH (signal correlation). Conventions are as in
Figure 5. There was little
systematic effect of bin size on the mean of the distribution of correlations
in the mean response (data not shown). The variance of the distribution of
correlations in the mean is plotted in D. As with the correlation in
the noise, increasing the bin size increases the variance of the signal
correlation.
|
|
In Figure 7, we plot the
function that relates the correlation in the noise to the correlation in the
signal. There was a positive correlation between these two measures, such that
neurons with a more strongly correlated mean response also tended to have a
more strongly correlated noise response. Thus there was some tendency for
correlations to be stronger locally (in tuning function space, not in the
space of the cortical surface), which in general is deleterious to information
coding (see Discussion); however, it was not a strong relationship. The
strength of this correlation also tended to increase as a function of bin
width as shown in Figure
7D.

View larger version (31K):
[in this window]
[in a new window]
|
Figure 7. Correlation between signal correlation and noise correlation for pairs of
neurons. Conventions are as in Figure
5. The signal and noise correlation were positively correlated.
Furthermore, the correlation increased as a function of bin size. Note the
difference in the scale for the vertical axes in A-C,
reflecting the narrower distribution of correlated noise for smaller bin
sizes.
|
|
In Figure 8, we plot the
population average periodograms for the signal, the noise, and the coherence
between the residuals for pairs of neurons. It can be seen that the signal
power and coherence were strongest at low frequencies. The noise power
displayed a dip at low frequencies, which has been described previously
(Bair et al., 1994
). This dip
may be the result of the neural refractory period or a network level
inhibitory mechanism (Mar et al.,
1999
). The fact that the coherence was strongest at low
frequencies accounts for the change in correlation between neurons as a
function of bin width. Larger bins filter more of the high-frequency
variability in the neural response, leaving only the low-frequency
variability, which is more strongly correlated.
Spike count distributions
In the decoding analyses we assumed a multivariate Gaussian distribution as
an approximation to the distribution of spike counts
(Oram et al., 1998
). Other
studies have assumed a Poisson distribution
(Zhang et al., 1998
). We
characterized the empirical distributions by assessing the fit of Gaussian and
Poisson distributions to all samples in our data set. An example is shown in
Figure 9.
Figure 9A shows the
distributions fit to the data for a 20 msec bin, and
Figure 9B shows the
distributions fit to the data for a 100 msec bin. In the example shown in
Figure 9A, the Poisson
distribution could not be rejected [Kolmogorov-Smirnor (KS) test; p
> 0.05], whereas for the example shown in
Figure 9B, the
Gaussian distribution could not be rejected (KS test; p > 0.05).
Figure 10 shows the proportion
of individual bins (each cell contributed multiple bins to this plot) as a
function of bin size, which was fit by each parametric distribution. For small
bin sizes the Poisson distribution fit most bins; however, for larger bin
sizes, the ability of the Poisson distribution to fit the data decreased to
the level of the Gaussian distribution. The Gaussian distribution fit the data
better for intermediate bin sizes. For the largest bin sizes the distributions
did about equally well.

View larger version (24K):
[in this window]
[in a new window]
|
Figure 9. Examples of parametric distributions fit to empirical spike count
distributions. The asterisk in the corner of a plot in each column indicates
the distribution that fit the data, i.e., could not be rejected at a
p value of 0.05 with the KS test. A, Normal and Poisson
distributions fit to the data from a single 20 msec bin. B, Same
distributions fit to the data from a single 100 msec bin.
|
|

View larger version (17K):
[in this window]
[in a new window]
|
Figure 10. Proportion of times that a Gaussian or Poisson distribution successfully
fit the empirical distribution of spike counts as a function of bin size. Each
neuron contributed several bins. For example, for a single neuron at a bin
width of 10 msec there were 60 bins (600 msec, 10 msec bins) for each of three
movements. Thus the parametric distributions considered were fit to 180
separate empirical distributions for each neuron, at the 10 msec bin width.
Bins with a mean spike rate of 0 were not considered.
|
|
Decoding models
The noise analyses suggested that most of the mean responses, as well as
the correlation between neurons, were concentrated at low frequencies.
Conversely, the noise was stronger at high frequencies. The next question that
we addressed was whether any of this variability carried information about the
target for movement. We approached this problem directly, by comparing models
that decoded information using different parameters of the neural signal. For
these analyses, we used data in the 200 msec window beginning at target onset
to predict the subsequent target toward which the monkey would move. The
sequence of movements was deterministic; therefore, increasing the data window
backward or forward in time would have allowed us to achieve better decoding
performance, because the previous and subsequent targets were perfectly
correlated with the current target. We were not directly interested in the
absolute performance of the models, however, but rather in the relative
performance of different models. We chose the 200 msec window for several
reasons. First, pilot analyses showed that many of the pairs of neurons
reached their peak predictive capacity at the end of this window. Second, the
reaction time in the task was on average
240 msec
(Lee and Quessy, 2003
), so the
neural activity during this window is preparatory. Third, the 200 msec window
allowed us to construct a series of models of increasing complexity yet
limited the number of model parameters (NMPs).
We characterized the models using two heuristic dimensions. The first
dimension was the size of the bin used in the analysis. The bin widths
considered were the same as those used in the noise analyses, except we did
not consider the 5 or 10 msec bin size, because the models became too complex
(i.e., too many free parameters for our data set size), and pilot analyses had
shown that there was little information to be gained by using bin widths this
small. By increasing the bin width we reduced the temporal precision of the
neural signal. If information was not lost by eliminating the temporal
information, one would conclude that the temporal structure of the spike
arrival times was not important.
The second dimension was the structure of the covariance within and between
neurons. We constructed several model structures, all on the basis of the
multivariate Gaussian distribution. We used the multivariate Gaussian for two
reasons: (1) the univariate Gaussian distribution provided a reasonable fit to
the data over a range of relevant bin sizes, and (2) it was straightforward to
test the hypotheses that we were interested in by manipulating the covariance
matrix. The VEM model had a diagonal covariance matrix (i.e., all off-diagonal
elements were set to zero), with the variance equal to the mean as in an
independent Poisson model. The independent model had a diagonal covariance
matrix with the variance estimated from the data instead of being set equal to
the mean. The between model included only off-diagonal elements of the
covariance matrix that corresponded to interactions between neurons at the
same time point. The within model included only off-diagonal elements that
corresponded to interactions between adjacent time bins within neurons, and
the full model used a full covariance matrix.
In Figure 11A, we
plot the average decoding performance of the models, as a function of bin
width, for all pairs of neurons. It can be seen that the performance of the
models decreased relatively linearly as a function of the bin width
(decreasing number of bins). Of the various models considered, the model with
a full covariance matrix gave the best prediction, although it was only
marginally better than the other models. There was a wide distribution of
performance across pairs of neurons, as can be seen from
Figure 11B, which
shows the distribution of percentage correct performance across the population
at a bin width of 40 msec for the independent model. Although the raw
performance of the models is a useful benchmark, we would like to know the
generalization performance of the models, that is, how they would perform on
an unseen data set. We approached this problem using two model selection
procedures: K-fold CV and AIC. We performed CV by splitting the data
set 10 times, in each case using 1 of every 10 trials for the test set and the
other trials for estimating the model.

View larger version (17K):
[in this window]
[in a new window]
|
Figure 11. Percentage correct for all analyzed pairs of neurons. A, Mean
percentage correct as a function of bin width and covariance model considered.
VEM (solid black line) indicates the model with the variance set equal to
mean, independent (dashed black line) is the model with the variance estimated
from the data, between (dotted black line) considers interaction between
neurons, within (solid gray line) considers interactions between time bins for
a single neuron, and full (dashed gray line) is the full covariance matrix.
B, An example distribution of percentage correct across the
population at a bin width of 40 msec for the independent model.
|
|
In Figure 12 we plot the
population average percentage correct performance of the models, assessed by
CV. This population average was compiled by computing the percentage correct
for each pair of neurons, then averaging across all pairs, for each model. It
can be seen that cross validation corrects for model complexity, because the
most complex model no longer has the best performance (compare with
Fig. 11A). At the
population level, using the percentage correct as a measure of model
performance, the CV analysis suggests that either the independent or between
model, at a bin width of 33-40 msec, has the best performance. In
Figure 13, we plot the
population average of AIC, as a function of bin width for each covariance
structure. The AIC measure is a composite of the likelihood of the data under
the model and the model complexity (see Eq. 8). The model that has the minimum
AIC is selected as the best model within this framework. From the population
data, the independent model would be selected as the best at a bin width of 50
msec. The between model, which considered interactions between neurons, did
almost as well. On the other hand, the full model did rather poorly, which is
in contrast to its raw percentage correct performance
(Fig. 11A). In
Figure 14, the population
average AIC is plotted as a function of the NMPs. It can be seen that the AIC
first decreased quickly as a function of NMPs and then increased more slowly
with model complexity. If the AIC decreased and then increased again before
finally decreasing, we would have been within a regimen of the criteria within
which we were ineffectively penalizing overly complex models. Furthermore, the
AIC decreased quickly and then increased with a more gradual slope. In general
terms, this indicates that the model family is reasonably well matched to the
data.

View larger version (12K):
[in this window]
[in a new window]
|
Figure 14. Relation between the population average of AIC and the number of model
parameters. The AIC first dropped quickly to a minimum at 40 model
parameters and then increased again slowly. The shape of this function is
characteristic of a model that is not poorly matched to the structure of the
data. Furthermore the AIC value did not begin to decrease again for large
numbers of parameters, and thus we did not inadvertently select overly complex
models.
|
|
The AIC and CV methods make slightly different predictions about the
optimal model at the population level. We compared the performance of these
two criteria by testing them on a surrogate data set. The surrogate data set
was generated by independently shuffling individual bins across trials. This
preserved the mean and variance of each bin of the original data set but
destroyed all real correlations. We selected the best model, for each pair of
neurons, using the surrogate data set. A conservative model selection
criterion should not select between, within, or full models because the
correlations have been destroyed in the surrogate data set. In Tables
1 and
2, we show the performance of
AIC and CV. It can be seen that AIC selected no spurious models, whereas CV
selected spurious models 98 of 193 (51%) times. We obtained similar results
when the likelihood, instead of the percentage correct, was used as a
selection statistic for the CV analysis. These results are consistent with the
fact that CV can select overly complex models
(Larsen and Goutte, 1999
).
Furthermore, the performance of CV is always limited by the fact that the
sample sizes used to estimate and test the models are smaller than those
available to other model selection procedures
(Kearns et al., 1995
). Because
AIC seems to be a more conservative criterion, the remaining results will be
on the basis of model selection using AIC.
View this table:
[in this window]
[in a new window]
|
Table 2. Best model selected from entire parameter space using maximum percentage
correct and CV, on surrogate dataset
| |
We performed the AIC model selection analysis on all pairs of neurons,
selecting the best model for each pair.
Table 3 shows the number of
times each model was selected as best for all neuron pairs. There are several
relevant points. (1) The simplest model (bin width = 200 msec; VEM) was
selected in only 22 cases (11%). (2) Many pairs of neurons (62%) had their
best performance at bin sizes between 40 and 66 msec. (3) Many pairs of
neurons (19%) also preferred covariance structures more complex than the
independent model, including interactions between neurons (between, 8%) or
between time bins for a single neuron (within, 11%). (4) Overall, 92% of the
neuron pairs could be treated as independent without a loss of
information.
We also considered which model was best, within one dimension of the
analysis, while holding the other dimension constant. For example, at a bin
width of 200 msec, what was the best covariance structure for each neuron?
Table 4, which shows the best
bin size for each covariance structure, shows that for many neuron pairs
temporal precision down to 40 msec improved the decoding performance, for all
models except for the full model. Furthermore, the within model showed a
slightly greater preference for the 20 msec bin size (eight pairs) than the
other models, consistent with the fact that the noise power was stronger at
higher frequencies. Table 5
shows that when the best covariance structure for a given bin size was
considered, the between and within models were often significant (93 and 108
neuron pairs, respectively), but the full model was almost never significant.
The profile of the within model, in Table
5, followed the profile of the noise power relatively well: there
was a tendency for the within model to be selected for large bins, which
dropped off for the intermediate-sized bins (40-66 msec) but then became
strong again for smaller bins. Analyses of the between model in
Table 5 also followed the
profile of the coherence curve (Fig.
8) to a certain extent, because it was selected more often for
large bin sizes but then less often for small bin sizes. Overall the simplest
models (VEM and independent) were preferred most often.
All of the models that we have considered thus far have assumed that the
neuronal variability and interneuronal correlation are constant across
different targets. It is possible, however, that either the variability or the
correlation may have changed across different targets, and taking this change
into account could improve our results. We assessed this possibility by
performing an analysis in which we compared, for each pair of neurons, the
performance of the model selected as best above (i.e., with a pooled
covariance matrix) with the performance of a model that used the same
interactions, but estimated them separately for each movement. We found that
these more complex models were selected as best only four times. Thus, for our
data set, the model with a pooled covariance matrix performed better than the
models that used separate covariance matrices for each target, a result that
has been shown previously (Averbeck et al.,
2003
).
Neural activity shows variation across multiple time scales
(Bair et al., 2001
). We have
also shown this with our coherence analysis
(Fig. 8) because the coherence
values at different frequencies imply correlation at different time scales;
low-frequency coherence is long time scale correlation. We repeated the
decoding analysis on neural signals that had their low-frequency components
removed, because these are the components that are assumed to be unimportant
for the purposes of neural information transmission and are probably more
related to slow changes in neuronal responsiveness
(Bair et al., 2001
;
Lee and Quessy, 2003
). We
performed the analysis by high-pass filtering the spike trains (see Materials
and Methods) before the binning process. Interestingly, the decoding
performance averaged over all models (compared with
Fig. 11A) improved by
3% for bin sizes <200 msec (e.g., from
0.60 to
0.63 at a bin
size of 100 msec). The VEM model did poorly; because most of the mean response
was removed by the filter, the variance would be set to a very small value.
Thus the lowest frequencies were carrying more noise than signal.
In the final analysis, we considered whether the presence of correlation
between neurons would be related to the best model selected for pairs of
neurons. If the AIC criterion is selecting models properly, it should select
the between model for neurons that are correlated more strongly. The absolute
value of the correlation, calculated as the integral of the coherence curve,
is 0.085 (SEM = 0.016; n = 67), 0.085 (SEM = 0.009; n = 88),
0.257 (SEM = 0.048; n = 16), and 0.108 (SEM = 0.032; n = 22)
for the VEM, independent, between, and within models, respectively. The
difference in the correlation between neurons as a function of the covariance
model selected was highly significant (ANOVA; F = 10.276; n
= 192; p < 0.0005). Post hoc tests confirmed that pairs
of neurons that had a best model structure of between had significantly more
correlation than neurons that had a best model structure of VEM (Tukey's HSD;
p < 0.0005) or independent (Tukey's HSD; p < 0.0005).
Thus, the model selected as best used the covariance structure between the
neurons.
 |
Discussion
|
|---|
Noise analyses
The function relating the mean spike count to its trial-to-trial variance
has been investigated in the visual
(Schiller et al., 1976
;
Tolhurst et al., 1983
;
Gur et al., 1997
;
Wiener et al., 2001
),
somatosensory (Werner and Mountcastle,
1963
), auditory (Teich and
Khanna, 1985
), and motor (Lee
et al., 1998
) systems. Most of these studies used static stimuli
and relatively large bin sizes. These studies found a roughly linear
mean-variance relation in log-log coordinates with a slope near 1, which would
be expected of a Poisson process. In our data, the variance was often less
than the mean for small bin sizes, likely because of negative correlations
over small time scales. Several authors have recently reported very small
variances in the visual system of various species
(Bair and Koch, 1996
;
Berry et al., 1997
;
de Ruyter van Steveninck et al.,
1997
). Our results do not show the extreme consistency
demonstrated in these studies, perhaps because of the fact that the movements
in our task are less consistent than the stimuli used in studies of the visual
system.
Our analyses of correlation in the mean and variability of neuronal
responses in pairs of neurons have shown that their estimates are affected by
the time scale over which they are measured, a finding that has also been
reported in V1 (Reich et al.,
2001a
). Because most of the coherence between neurons is at low
frequencies, correlations are higher between neural responses for larger bin
widths. Thus, it is important when considering neural coding in ensembles of
neurons to explicitly consider the time scale at which the information is
being represented, because the noise characteristics of the ensemble will be
dependent on the time scale.
There is an important difference between the question of whether a model
(or a downstream neuron) that takes into account covariance can outperform a
model that does not (Nirenberg et al.,
2001
; Wu et al.,
2001
), a question that we addressed with the decoding analyses and
the question of which covariance structures can carry the most information,
(Johnson, 1980
;
Zohary et al., 1994
;
Abbott and Dayan, 1999
;
Wilke and Eurich, 2002
;
Pola et al., 2003
). Uniform
noise correlation improves the ability of a population of neurons to carry
information, if there is an inverse relation between the correlated signal and
the correlated noise. Conversely, if the correlation in the signal and noise
are both either positive or negative, the correlation will be deleterious.
Local noise correlation, in tuning function space, is deleterious in general.
We found that noise correlation tended to be stronger locally, which suggests
that the correlations in our data affect information encoding deleteriously.
This analysis has limitations, however, because variations in the kinematics
of the movement from trial to trial may cause neurons that have similar tuning
functions to have more strongly correlated noise. A similar problem in the
visual system is caused by small eye movements; however, it has been shown
previously that there was no difference in noise correlation between a center
hold epoch and a movement epoch (Lee et
al., 1998
). Also, the data that we analyzed came mostly from a
time period when the monkey's hand was stationary, which should have minimized
the effect of correlation attributable to kinematic variability. Similarly,
correlated noise in the prefrontal cortex has been shown to be relatively
independent of small eye movements
(Constantinidis and Goldman-Rakic,
2002
).
Decoding analyses
We explored a hierarchy of decoding models with respect to two questions.
First, did the times at which spikes occurred within a 200 msec window matter?
Second, which interactions between bins within a neuron or across neurons
should be accounted for to optimize decoding? At the population level, we
found that spike arrival times at a resolution of
50 msec were optimal.
The findings for individual pairs of neurons were similar, except that the
best model for the largest number of pairs was 66 msec. There was little
evidence for signal in bin sizes below 40 msec. The preferred bin size of 50
msec corresponds to an upper cutoff frequency of 10 Hz. Including frequencies
above 10 Hz did not add significant information about the subsequent movement.
This is consistent with the results of the noise analysis, given that the
signal-to-noise ratio was highest at low frequencies, and the coherence was
also strongest below 10 Hz (Fig.
8). Thus, the model that used a bin size of 50 msec might
represent a useful tradeoff between signal and noise, because at >10 Hz
there was little signal power but the noise power remained larger.
We found evidence that some form of correlation was important for 19% of
the pairs of neurons; however, correlations between neurons in our study
improved decoding performance in only 8% of the individual pairs of neurons.
Previous results have been split on the question of whether there is
information in correlations between neurons. Some studies have found that
interactions can improve decoding performance
(Dan et al., 1998
;
Maynard et al., 1999
), whereas
others have found that neurons could be treated independently without losing
much information (Nirenberg et al.,
2001
; Petersen et al.,
2001
; Averbeck et al.,
2003
; Rolls et al.,
2003
). Our results support the claim that ignoring correlation
between neurons, in some systems, does not dramatically affect the information
extracted from the neural code.
The question of whether spike arrival times can carry information has been
approached with several analytical techniques. One approach is to assume that
the cross-correlation between a pair of neurons carries information
(Vaadia et al., 1995
;
Riehle et al., 1997
). It is
important, however, to control for the effect of spike rate when looking for
information in cross-correlations (Oram et
al., 2001
), which our analytical approach has done. Many other
studies in the visual system (Buracas et
al., 1998
; Reinagel and Reid,
2000
; Reich et al.,
2001b
) and the somatosensory system
(Panzeri et al., 2001
;
Petersen et al., 2001
) have
shown that the arrival times of spikes within single neurons can carry
information. Studies using static visual stimuli have shown that the arrival
times of spikes add additional information, down to a temporal resolution on
the order of 30-50 msec (Heller et al.,
1995
; Victor and Purpura,
1998
), whereas other studies, using dynamic visual stimuli, have
shown that spike arrival times in the submillisecond range can carry
information (Dan et al., 1998
;
Strong et al., 1998
;
Reinagel and Reid, 2000
).
An additional question is whether we have modeled the behavior
appropriately, because we have treated the movement as a categorical variable.
It seems likely that the underlying behavioral process is dynamic, in which
case the temporal properties of the code that we have identified could be
linear representations of a dynamically evolving behavior
(Golomb et al., 1994
). This
consideration effectively places a limit on studies of the neural coding of
cognitive factors or decisions, because one cannot directly measure the
dynamics of cognitive processes.
An important question is whether the cortex is decoding neural responses
optimally. Theoretical work has shown that biologically plausible networks can
optimally decode neural responses, when the noise is uncorrelated across
neurons (Deneve et al., 1999
).
As stated in Introduction, the information present in the neural response is
an upper bound on the information actually used by the cortical networks
mediating behavior. The decoding approach cannot address the question of
whether downstream neurons are capable of extracting the same patterns as the
decoding algorithm, but the analysis does set a lower limit on the
sophistication of a downstream neuron if it is to extract all the information
in the neural response.
 |
Footnotes
|
|---|
Received May. 20, 2003;
revised Jun. 27, 2003;
accepted Jul. 7, 2003.
This study was supported by National Institutes of Health (NIH) Grants
R01-MH59216 and P30-EY01319 (D.L.) and NIH Postdoctoral Training Grant T32
EY07125 (B.B.A.). We are grateful to Rita Farrell, Ryan Murray, and Stephan
Quessy for their help with the experiment, as well as Robbie A. Jacobs, David
Knill, Walter Makous, and Alex Pouget for discussions on the data analysis and
Michelle Conroy for comments on this manuscript.
Correspondence should be addressed to Dr. Daeyeol Lee, Department of Brain
and Cognitive Sciences, Center for Visual Science, University of Rochester,
Rochester, NY 14627. E-mail:
dlee{at}cvs.rochester.edu.
Copyright © 2003 Society for Neuroscience
0270-6474/03/237630-12$15.00/0
 |
References
|
|---|
Abbott LF, Dayan P (1999) The effect of correlated
variability on the accuracy of a population code. Neural Comput
11: 91-101.[Web of Science][Medline]
Averbeck BB, Crowe DA, Chafee MV, Georgopoulos AP
(2003) Neural activity in prefrontal cortex during copying
geometrical shapes. II. Decoding shape segments from neural ensembles.
Exp Brain Res 150:
142-153.[Web of Science][Medline]
Bair W, Koch C (1996) Temporal precision of spike
trains in extrastriate cortex of the behaving macaque monkey. Neural
Comput 8:
1185-1202.[Web of Science][Medline]
Bair W, Koch C, Newsome W, Britten K (1994) Power
spectrum analysis of bursting cells in area MT in the behaving monkey.
J Neurosci 14:
2870-2892.[Abstract]
Bair W, Zohary E, Newsome WT (2001) Correlated firing
in macaque visual area MT: time scales and relationship to behavior. J
Neurosci 21:
1676-1697.[Abstract/Free Full Text]
Barlow HB (1963) The information capacity of nervous
transmission. Kybernetik 2:
1.
Berry MJ, Warland DK, Meister M (1997) The structure
and precision of retinal spike trains. Proc Natl Acad Sci USA
94: 5411-5416.[Abstract/Free Full Text]
Bradley A, Skottun BC, Ohzawa I, Sclar G, Freeman RD
(1987) Visual orientation and spatial frequency discrimination: a
comparison of single neurons and behavior. J Neurophysiol
57: 755-772.[Abstract/Free Full Text]
Buracas GT, Zador AM, DeWeese MR, Albright TD (1998)
Efficient discrimination of temporal patterns by motion-sensitive neurons in
primate visual cortex. Neuron 20:
959-969.[Web of Science][Medline]
Constantinidis C, Goldman-Rakic PS (2002) Correlated
discharges among putative pyramidal neurons and interneurons in the primate
prefrontal cortex. J Neurophysiol 88:
3487-3497.[Abstract/Free Full Text]
Dan Y, Alonso JM, Usrey WM, Reid RC (1998) Coding of
visual information by precisely correlated spikes in the lateral geniculate
nucleus. Nat Neurosci 1:
501-507.[Web of Science][Medline]
de Ruyter van Steveninck RR, Lewen GD, Strong SP, Koberle R, Bialek
W (1997) Reproducibility and variability in neural spike trains.
Science 275:
1805-1808.[Abstract/Free Full Text]
Deneve S, Latham PE, Pouget A (1999) Reading
population codes: a neural implementation of ideal observers. Nat
Neurosci 2:
740-745.[Web of Science][Medline]
Efron B, Tibshirani RJ (1998) An introduction
to the bootstrap. New York: Chapman and Hall.
Gawne TJ, Richmond BJ (1993) How independent are the
messages carried by adjacent inferior temporal cortical neurons? J
Neurosci 13:
2758-2771.[Abstract]
Golomb D, Kleinfeld D, Reid RC, Shapley RM, Shraiman BI
(1994) On temporal codes and the spatiotemporal response of
neurons in the lateral geniculate nucleus. J Neurophysiol
72: 2990-3003.[Abstract/Free Full Text]
Gur M, Beylin A, Snodderly DM (1997) Response
variability of neurons in primary visual cortex (V1) of alert monkeys.
J Neurosci 17:
2914-2920.[Abstract/Free Full Text]
Heggelund P, Albus K (1978) Response variability and
orientation discrimination of single cells in striate cortex of cat.
Exp Brain Res 32:
197-211.[Web of Science][Medline]
Heller J, Hertz JA, Kjaer TW, Richmond BJ (1995)
Information flow and temporal coding in primate pattern vision. J
Comput Neurosci 2:
175-193.[Web of Science][Medline]
Hernandez A, Zainos A, Romo R (2000) Neuronal
correlates of sensory discrimination in the somatosensory cortex. Proc
Natl Acad Sci USA 97:
6191-6196.[Abstract/Free Full Text]
Jarvis MR, Mitra PP (2001) Sampling properties of the
spectrum and coherency of sequences of action potentials. Neural
Comput 13:
717-749.[Web of Science][Medline]
Johnson KO (1980) Sensory discrimination: neural
processes preceding discrimination decision. J Neurophysiol
43: 1793-1815.[Free Full Text]
Kearns M, Mansour Y, Ng AY, Ron D (1995) An
experimental and theoretical comparison of model selection methods. In:
Proceedings of the 7th workshop on computational learning
theory. New York: ACM Press.
Kruger J, Aiple F (1988) Multimicroelectrode
investigation of monkey striate cortex: spike train correlations in the
infragranular layers. J Neurophysiol 60:
798-828.[Abstract/Free Full Text]
Larsen J, Goutte C (1999) On optimal data split for
generalization estimation and model selection. In: Proceedings of the
IEEE workshop on neural networks for signal processing IX (Hu YH,
Larsen J, Wilson E, Douglas S, eds), pp 225-234.
Piscataway, NJ: IEEE.
Lee D (2003) Coherent oscillations in neuronal
activity of the supplementary motor area during a visuomotor task. J
Neurosci 23:
6798-6809.[Abstract/Free Full Text]
Lee D, Quessy S (2003) Activity in the supplementary
motor area related to learning and performance during a sequential visuomotor
task. J Neurophysiol 89:
1039-1056.[Abstract/Free Full Text]
Lee D, Port NL, Kruse W, Georgopoulos AP (1998)
Variability and correlated noise in the discharge of neurons in motor and
parietal areas of the primate cortex. J Neurosci
18: 1161-1170.[Abstract/Free Full Text]
MacKay DM, McCulloch WS (1952) The limiting
information capacity of a neuronal link. Bull Math Biophys
14: 127-135.
Mar DJ, Chow CC, Gerstner W, Adams RW, Collins JJ
(1999) Noise shaping in populations of coupled model neurons.
Proc Natl Acad Sci USA 96:
10450-10455.[Abstract/Free Full Text]
Maynard EM, Hatsopoulos NG, Ojakangas CL, Acuna BD, Sanes JN,
Normann RA, Donoghue JP (1999) Neuronal interactions improve
cortical population coding of movement direction. J Neurosci
19: 8083-8093.[Abstract/Free Full Text]
Mountcastle VB, Talbot WH, Sakata H, Hyvarinen J
(1968) Cortical neuronal mechanisms in flutter-vibration studied
in unanesthetized monkeys. Neuronal periodicity and frequency discrimination.
J Neurophysiol 32:
452-484.[Web of Science]
Mountcastle VB, Steinmetz MA, Romo R (1990) Frequency
discrimination in the sense of flutter: psychophysical measurements correlated
with post-central events in behaving monkeys. J Neurosci
10: 3032-3044.[Abstract]
Newsome WT, Britten KH, Movshon JA (1989) Neuronal
correlates of a perceptual decision. Nature
341: 52-54.[Medline]
Nirenberg S, Carcieri SM, Jacobs AL, Latham PE (2001)
Retinal ganglion cells act largely as independent encoders.
Nature 411:
698-701.[Medline]
Oppenheim A, Schafer R (1989) Discrete-time
signal processing, Ed 1. Englewood Cliffs, NJ: Prentice
Hall.
Oram MW, Foldiak P, Perrett DI, Sengpiel F (1998) The
"ideal homunculus": decoding neural population signals.
Trends Neurosci 21:
259-265.[Web of Science][Medline]
Oram MW, Hatsopoulos NG, Richmond BJ, Donoghue JP
(2001) Excess synchrony in motor cortical neurons provides
redundant direction information with that from coarse temporal measures.
J Neurophysiol 86:
1700-1716.[Abstract/Free Full Text]
Panzeri S, Petersen RS, Schultz SR, Lebedev M, Diamond ME
(2001) The role of spike timing in the coding of stimulus
location in rat somatosensory cortex. Neuron
29: 769-777.[Web of Science][Medline]
Papoulis A (1991) Probability, random variables
and stochastic processes, Ed 3. New York: McGraw-Hill.
Parker A, Hawken M (1985) Capabilities of monkey
cortical cells in spatial-resolution tasks. J Opt Soc Am [A]
2: 1101-1114.[Web of Science][Medline]
Perkel DH, Bullock TH (1969) Neural coding. In:
Neurosciences research symposium summaries (Schmitt FO,
Melnechuk T, Quarton GC, Adelman G, eds), pp 405-527.
Cambridge, MA: MIT.
Petersen RS, Panzeri S, Diamond ME (2001) Population
coding of stimulus location in rat somatosensory cortex. Neuron
32: 503-514.[Web of Science][Medline]
Pola G, Thiele A, Hoffmann KP, Panzeri S (2003) An
exact method to quantify the information transmitted by different mechanisms
of correlational coding. Network 14:
35-60.[Web of Science][Medline]
Reich DS, Mechler F, Victor JD (2001a) Independent and
redundant information in nearby cortical neurons. Science
294: 2566-2568.[Abstract/Free Full Text]
Reich DS, Mechler F, Victor JD (2001b) Temporal coding
of contrast in primary visual cortex: when, what, and why. J
Neurophysiol 85:
1039-1050.[Abstract/Free Full Text]
Reinagel P, Reid RC (2000) Temporal coding of visual
information in the thalamus. J Neurosci
20: 5392-5400.[Abstract/Free Full Text]
Riehle A, Grun S, Diesmann M, Aertsen A (1997) Spike
synchronization and rate modulation differentially involved in motor cortical
function. Science 278:
1950-1953.[Abstract/Free Full Text]
Rolls ET, Franco L, Aggelopoulos NC, Reece S (2003)
Application of an information theoretic approach to analyzing the
contributions of the firing rates and the correlations between the firing of
neurons. J Neurophysiol 15:
2810-2822.
Salinas E, Sejnowski TJ (2000) Impact of correlated
synaptic input on output firing rate and variability in simple neuronal
models. J Neurosci 20:
6193-6209.[Abstract/Free Full Text]
Schiller PH, Finlay BL, Volman SF (1976) Short-term
response variability of monkey striate neurons. Brain Res
105: 347-349.[Web of Science][Medline]
Stone M (1977) An asymptotic equivalence of choice of
model by cross-validation and Akaike's criterion. J Roy Statist Soc Ser
B 39:
44-47.
Strong SP, Koberle R, de Ruyter van Steveninck RR, Bialek W
(1998) Entropy and information in neural spike trains.
Phys Rev Lett 80:
197-200.
Talbot WH, Darian-Smith I, Kornhuber HH, Mountcastle VB
(1968) The sense of flutter-vibrations: comparison of the human
capacity with the response patterns of mechanoreceptive afferents from the
monkey's hand. J Neurophysiol 31:
301-334.[Free Full Text]
Teich MC, Khanna SM (1985) Pulse-number distribution
for the neural spike train in the cat's auditory nerve. J Acoust Soc
Am 77:
1110-1128.[Web of Science][Medline]
Tolhurst DJ, Movshon JA, Dean AF (1983) The
statistical reliability of signals in single neurons in cat and monkey visual
cortex. Vision Res 23:
775-785.[Web of Science][Medline]
Vaadia E, Haalman I, Abeles M, Bergman H, Prut Y, Slovin H, Aertsen
A (1995) Dynamics of neuronal interactions in monkey cortex in
relation to behavioural events. Nature
373: 515-518.[Medline]
Victor JD, Purpura KP (1998) Spatial phase and the
temporal structure of the response to gratings in V1. J
Neurophysiol 80:
554-571.[Abstract/Free Full Text]
Vogels R, Orban GA (1990) How well do response changes
of striate neurons signal differences in orientation: a study in the
discriminating monkey. J Neurosci 10:
3543-3558.[Abstract]
Werner G, Mountcastle VB (1963) The variability of
central neural activity in a sensory system, and its implications for the
central reflections of sensory events. J Neurophysiol
26: 958-977.[Free Full Text]
Wiener MC, Oram MW, Liu Z, Richmond BJ (2001)
Consistency of encoding in monkey visual cortex. J Neurosci
21: 8210-8221.[Abstract/Free Full Text]
Wilke SD, Eurich CW (2002) Representational accuracy
of stochastic neural populations. Neural Comput
14: 155-189.[Web of Science][Medline]
Wu S, Nakahara H, Amari S (2001) Population coding
with correlation and an unfaithful model. Neural Comput
13: 775-797.[Web of Science][Medline]
Zhang K, Ginzburg I, McNaughton BL, Sejnowski TJ
(1998) Interpreting neuronal population activity by
reconstruction: unified framework with application to hippocampal place cells.
J Neurophysiol 79:
1017-1044.[Abstract/Free Full Text]
Zohary E, Shadlen MN, Newsome WT (1994) Correlated
neuronal discharge rate and its implications for psychophysical performance.
Nature 370:
140-143.[Medline]
This article has been cited by other articles:

|
 |

|
 |
 
N. S. Narayanan and M. Laubach
Delay Activity in Rodent Frontal Cortex During a Simple Reaction Time Task
J Neurophysiol,
June 1, 2009;
101(6):
2859 - 2871.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. R. Cohen and W. T. Newsome
Estimates of the Contribution of Single Neurons to Perception Depend on Timescale and Noise Correlation
J. Neurosci.,
May 20, 2009;
29(20):
6635 - 6648.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. L. Jacobs, G. Fridman, R. M. Douglas, N. M. Alam, Peter. E. Latham, G. T. Prusky, and S. Nirenberg
Ruling out and ruling in neural codes
PNAS,
April 7, 2009;
106(14):
5936 - 5941.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. C. Osborne, S. E. Palmer, S. G. Lisberger, and W. Bialek
The Neural Basis for Combinatorial Coding in a Cortical Population Response
J. Neurosci.,
December 10, 2008;
28(50):
13522 - 13531.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. B. Hamed, M. H. Schieber, and A. Pouget
Decoding M1 Neurons During Multiple Finger Movements
J Neurophysiol,
July 1, 2007;
98(1):
327 - 333.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
F. Montani, A. Kohn, M. A. Smith, and S. R. Schultz
The Role of Correlations in Direction and Contrast Coding in the Primary Visual Cortex
J. Neurosci.,
February 28, 2007;
27(9):
2338 - 2348.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. B. Averbeck and D. Lee
Effects of Noise Correlations on Information Encoding and Decoding
J Neurophysiol,
June 1, 2006;
95(6):
3633 - 3644.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. M. Churchland, B. M. Yu, S. I. Ryu, G. Santhanam, and K. V. Shenoy
Neural variability in premotor cortex provides a signature of motor preparation.
J. Neurosci.,
April 5, 2006;
26(14):
3697 - 3712.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Vladusich, M. P. Lucassen, and F. W. Cornelissen
Do Cortical Neurons Process Luminance or Contrast to Encode Surface Properties?
J Neurophysiol,
April 1, 2006;
95(4):
2638 - 2649.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. E. Latham and S. Nirenberg
Synergy, Redundancy, and Independence in Population Codes, Revisited
J. Neurosci.,
May 25, 2005;
25(21):
5195 - 5206.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Kohn and M. A. Smith
Stimulus Dependence of Neuronal Correlation in Primary Visual Cortex of the Macaque
J. Neurosci.,
April 6, 2005;
25(14):
3661 - 3673.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Mazzoni, E. Garcia-Perez, D. Zoccolan, S. Graziosi, and V. Torre
Quantitative Characterization and Classification of Leech Behavior
J Neurophysiol,
January 1, 2005;
93(1):
580 - 593.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Cohen and M. A. L. Nicolelis
Reduction of Single-Neuron Firing Uncertainty by Cortical Ensembles during Motor Skill Learning
J. Neurosci.,
April 7, 2004;
24(14):
3574 - 3582.
[Abstract]
[Full Text]
[PDF]
|
 |
|