Abstract
The relationship between spiking activities in motor cortex and movement kinematics has been well studied in neurologically intact nonhuman primates. We examined the relationship between spiking activities in primary motor cortex (M1) and intended movement kinematics (position and velocity) using 96-microelectrode arrays chronically implanted in two humans with tetraplegia. Study participants were asked to perform two different tasks: imagined pursuit tracking of a cursor moving on a computer screen and a “neural cursor center-out” task in which cursor position was controlled by the participant's neural activity. In the pursuit tracking task, the majority of neurons were significantly tuned: 90% were tuned to velocity and 86% were tuned to position in one participant; 95% and 84%, respectively, in the other. Additionally, velocity and position of the tracked cursor could be decoded from the ensemble of neurons. In the neural cursor center-out task, tuning to direction of the intended target was well captured by a log-linear cosine function. Neural spiking soon after target appearance could be used to classify the intended target with an accuracy of 95% in one participant, and 80% in the other. It was also possible to extract information about the direction of the difference vector between the target position and the instantaneous neural cursor position. Our results indicate that correlations between spiking activity and intended movement velocity and position are present in human M1 after the loss of descending motor pathways, and that M1 spiking activities share many kinematic tuning features whether movement is imagined by humans with tetraplegia, or is performed as shown previously in able-bodied nonhuman primates.
Introduction
Neural activity in the primate primary motor cortex (M1) is known to correlate with movement kinematics. Previous studies in nonhuman primates have described M1 tuning to position, velocity and acceleration of the hand during point-to-point reaching, pursuit tracking, and drawing movements (Georgopoulos et al., 1982; Kettner et al., 1988; Schwartz, 1994, 2004; Fu et al., 1995; Moran and Schwartz et al., 1999; Paninski et al., 2004). Although the interpretation and identification of the mechanisms underlying these tuning functions remain controversial (Kalaska et al., 1989; Todorov, 2000; Scott et al., 2001; Scott, 2004), their statistical characterization has been important in informing and constraining models of M1's participation in motor control. Studies of the relationships between spiking activities and movement control have been, however, largely limited to the study of nonhuman primates because of the need for an invasive sensor to record neuronal action potentials.
The launch of pilot clinical trials to develop neural interface systems based on signals derived from intracortical microelectrode arrays provides a unique opportunity to evaluate the properties of M1 neurons in humans. We have previously reported neural prosthetic results (Hochberg et al., 2006a) for the first participant in a pilot clinical trial of an intracortically based neural interface system [BrainGate Neural Interface System (NIS); Cyberkinetics Neurotechnology Systems, Foxborough, MA]. Among the many unknowns, it was unclear whether and how the presumed arm area in M1 [the “knob” (Yousry et al., 1997)] would be related to intended or attempted movement kinematics (position and velocity), rather than actual movements, in patients with lost or highly impaired voluntary control of limb motion. This participant could successfully control the position of external devices (e.g., a computer cursor) via a linear filter that related M1 spiking to position. Although the type of implemented tasks, training procedures and online neural decoding algorithms in the pilot trial were primarily designed with neuroprosthesis goals in mind, analysis of the collected data provides the opportunity to investigate more general encoding properties in human M1, also furthering neural interface system development. Here, we studied the relationship between spiking activities and intended movement velocity and position in two trial participants during two tasks: (1) imagined pursuit tracking of a computer cursor moving on a screen, and (2) a “neural cursor center-out” task executed by the participant using a neural interface system.
Materials and Methods
Participants, surgical procedures, and the BrainGate system
An investigational device exemption (IDE) for these studies was obtained from the United States Food and Drug Administration. Participant 1 (SCI001, henceforth referred to as S1) in this study was a 24-year-old male who, 3 years before trial enrollment, sustained a knife wound that transected the spinal cord between cervical vertebrae C3 and C4, resulting in complete tetraplegia (C4 ASIA A). Participant 2 (SCI003, henceforth referred to as S3) was a 53-year-old female who sustained a pontine stroke 9 years before trial enrollment, resulting in loss of speech and locked-in syndrome, which later resolved to incomplete tetraplegia. Unlike S1, this participant has intact sensory pathways, with normal perception of light touch, pinprick, and kinesthetic stimuli. After informed consent and medical and surgical screening procedures, a small array of electrodes was implanted into the arm area of the dominant precentral gyrus of each participant using a pneumatic technique (Rousche and Normann, 1992; Suner et al., 2005). The M1 hand/arm area was identified anatomically as the “knob” region (Yousry et al., 1997; Hochberg et al., 2006a) of the precentral gyrus in preoperative magnetic resonance imaging (MRI). Although the posterior region of the precentral gyrus was targeted, it is possible that some of the array recordings included neurons from Brodmann area 6.
The recording device, preamplifiers, data acquisition systems, and computer are part of the investigational NIS. The sensor is a 10 × 10 array of silicon microelectrodes that protrude 1 mm (S1) or 1.5 mm (S3) from a 4 × 4 mm platform (Guillory and Norman, 1999). Ninety-six electrodes were available for signal acquisition. At manufacture, electrodes had an impedance of 322 ± 138 kΩ (mean ± SD, at 1 kHz) for S1 and 310 ± 125 kΩ for S3. After placement, electrodes penetrate into the cortex to record neurons in intermediate layers. Recorded electrical signals pass externally through a titanium percutaneous connector, which is secured to the skull. Cabling attached to the connector during recording sessions routes signals to external amplifiers and then to a series of computers in a cart that process the signals and convert them into an output or control signal. We call this a “neural cursor” when the output is used to control a cursor on a computer monitor displayed to the participant.
Neural recordings
Research sessions were scheduled at least once per week at the participant's residence. Sessions would commence with neural recording and spike discrimination, followed by a pursuit tracking task whose purpose in the context of the clinical trial was the building of decoding filters. In the two sessions for each participant reported in this study, a neural cursor center-out task followed next, which used the decoder built during the pursuit tracking task (see below for details). Only those neural signals (units) selected immediately before decoder building were used for that session's neural cursor center-out task. Units were manually discriminated by a technician using visual features to place time-amplitude windows on waveforms displayed within 1.6 ms windows triggered when the signal crossed a manually adjusted threshold (Cyberkinetics Central Software) while the participant was at rest. All on-line experiments used spike events sorted in this manner. For the off-line analyses described here, spikes were re-sorted using Off-line Sorter (Plexon, Dallas, TX). Sorted spikes were treated as single units depending on several criteria including analysis of autocorrelation functions, waveforms and signal-to-noise ratios (Suner et al., 2005). The analyses were performed on data from sessions corresponding to postoperative days 86 and 90 (S1) and to days 54 and 55 (S3). Each of these was treated as a separate data set with no assumptions about which neurons may have been recorded in both sessions. We considered that the recorded spikes were generated by the same neuron during the session if the corresponding time amplitude window discriminator for that neuron remained the same across the session. However, day to day small motions of the microelectrode array could result in variation of recorded waveforms from the same neuron or the measurement of a different neuron that had similar autocorrelation properties and similar recorded waveforms at the same electrode. For these reasons, the determination of whether any particular unit recorded on one day was the same as or different from that recorded on a subsequent day is a difficult problem.
Behavioral tasks
We studied M1 tuning to intended movement position and velocity in two different tasks: imagined visually guided pursuit tracking and neural cursor center-out tasks.
Visually guided pursuit tracking.
Participants were asked to track a computer cursor moving on a monitor, as if they were controlling its position by intending to move their dominant arm or hand, similar to controlling a computer cursor via a hand held mouse. We refer to this moving computer cursor as the “tracked” cursor (TC). The TC trajectory was controlled by a technician, positioned beside the participant, via a computer mouse. The TC was moved through a succession of randomly positioned targets on the screen. Examples of TC paths can be seen in Figure 2 and in supplemental videos 1–4 (available at www.jneurosci.org as supplemental material). Target positions were randomly drawn from a uniform distribution over the workspace. The center of a 19 inch computer monitor was ∼59 cm away from the participant's head and the workspace subtended a visual angle of 33.9° (36 cm) and 27.6° (29 cm) for the horizontal and vertical dimensions, respectively. Participants were seated in their wheelchair for all sessions, with their arms positioned on the arm rests of his chair (S1) or on her lap (S3). In each session, the tracking of the cursor was organized in eight or more blocks that lasted 1 min each.
These pursuit tracking blocks had been designed for the purpose of building filters to be used in on-line decoding to drive a neurally controlled cursor in the subsequent center-out and assistive device control tasks. We inferred the participants' intended hand movements on the basis of instructed actions, because measurement of kinematics of overt behavior was not possible. The filter building procedure has been described previously (Serruya et al., 2002; Hochberg et al., 2006a). Briefly, for each session, single and multiunit data were used to create a linear filter decoder that related M1 spiking to position to generate a two-dimensional position output signal. In the beginning of the session, four 1-min-long tracking blocks were presented with only the TC and successively appearing targets visible on the screen. An initial filter was built with the neural activity collected during these four blocks. This filter was used to generate a feedback cursor (FC) about the decoded position. Next, four more blocks were generated, where the target and both the TC and FC were shown on the screen. The purpose of the FC was to familiarize the participants with the presence of a neurally controlled cursor on the screen. During this filter building epoch, participants were instructed not to attempt to correct for errors between tracked and feedback cursor positions. The filter was updated at the end of each block. To create the final filter to be used in the on-line center-out task, only the neural and position data from the last four blocks were used. The position linear filters were constructed from a response matrix containing the firing rate over a 1.4 s history for each neuron (28 50 ms bins), using the pseudoinverse to solve the least-squares regression problem. In this study we focus on the off-line analysis of these last four pursuit tracking blocks. Also, off-line decoding analyses used different decoding algorithms (see below).
Neural cursor center-out with on-line decoding closed-loop control.
A center-out task was performed after a linear filter decoder had been built during the pursuit tracking task performed earlier on the same day. In each session, each participant completed 80 trials of a four-direction center-out task: the participant was instructed to imagine moving a circle-shaped cursor displayed on the screen to one of four peripheral targets, positioned at 0, 90, 180, and 270°, subtending a visual angle of 4.8° (5 cm) from the center of the monitor; the distance from screen center to target center was 9.5 cm (9.2°). The position of this cursor was obtained by regressing spiking activity onto screen position via the linear filter built in the pursuit tracking task. This on-line neurally controlled cursor is henceforth referred to as the neural cursor (NC). The NC subtended a visual angle of 2.43° (2.5 cm diameter). A trial began after the participant held the NC over the center target for 500 ms. At the start of a trial, one of four peripheral targets appeared on the screen. Participants were instructed to move the NC to the target location and to hold it at that location for 500 ms. A trial terminated after 7 s. Trials were considered successful if the target was acquired in 7 s or less; trials were considered failures if the target was not acquired in 7 s. Regardless of whether or not the target was successfully acquired, because the NC was under constant neural control, the participant needed to return the cursor to the center target (and dwell there for 500 ms) before the next trial would begin. In each session, 20 trials were collected for each of the 4 pseudo-randomly presented radial targets. Videos showing examples of these center-out trials are available in the supplemental material (available at www.jneurosci.org).
Data analysis 1: off-line analyses of tuning functions
Modeling of spiking activity as a function of kinematic covariates during pursuit tracking.
We represented the spiking activity (recorded spike times) of each sorted unit as a discrete time neural point process (Truccolo et al., 2005), i.e., as a binary sequence (spike train) obtained by determining whether a spike occurred or not in each consecutive time bin of width Δt = 1 ms. The value of this sequence at a particular time t is here denoted by ΔNt ∈ [0,1]. To investigate the relationship between a neuron's spiking activity and specific kinematic covariates, we modeled, in the log domain, the instantaneous spiking rate as a function of the covariates. Four models were examined: position, velocity, direction, and speed models. For the model of spiking activity as a function of TC position (position model), we used the following: where λtc is the instantaneous spiking rate function (in spikes/s) at time t for the cth neuron (out of C recorded neurons), x and y are the horizontal and vertical positions, respectively, τ is a time lag, and [μ, β1, β2] are model parameters to be estimated. A related position model in three dimensions was presented by Kettner et al. (1988). The chosen form of the model of spiking activity as a function of TC velocity (velocity model) was a variation of the model proposed previously by Moran and Schwartz (1999): where ẋt = [ẋt, ẏt]T is the velocity vector, θt is the movement direction of TC and θ0 is the preferred movement direction (PD) of the neuron. Although the same letters μ and β appear in different models, they relate to different parameter values depending on the model. This particular velocity model has been successfully used before in approximations of M1velocity tuning functions in monkeys performing pursuit tracking and center out tasks with their hand (Truccolo et al., 2005). Also, our model is closer in form to the exponential form based on the von Mises distribution, which has been shown to capture well the tuning to direction in spiking activity in monkeys performing center-out tasks (Amirikian and Georgopoulos, 2000). To separately investigate M1 tuning to direction and speed of the TC, we also considered the direction model: and speed model A unit was considered tuned to a covariate whenever the following two conditions were satisfied (McCullagh and Nelder, 1989): (1) at least one of the coefficients in the model was statistically significant (Wald confidence intervals, p value < 0.05), and (2) a significant reduction in deviance was obtained from a kinematic model when compared with a model including only the estimated mean spiking rate (χ2 test, p value < 0.05). The deviance was computed under the Poisson distribution. (For intuition purposes, one can think of the deviance under the Gaussian distribution; in this case the deviance is simply the residual sum of squares.) Optimal time lags τ were determined by finding the time lag that maximized the log-likelihood function of the point process. All of the above models were fitted using standard functions for generalized linear models in Matlab (MathWorks, Natick, MA). More details about the motivation and maximum likelihood fitting of these nonlinear neural point process models can be found in the study by Truccolo et al. (2005).
Assessment of changes in preferred direction across experimental sessions.
We examined whether the preferred direction of a tuned unit during the pursuit tracking task changed significantly from one session to the other. A bootstrap resampling procedure was used to assess the statistical significance of preferred direction changes (Chestek et al., 2007). In brief, the bootstrap procedure consisted of the following steps. First, for a chosen unit and session, a distribution of preferred directions was generated by bootstrap sampling with replacement of the observed unit's spiking activity and the related velocity covariate at the previously estimated optimal time lag. One thousand bootstrap samples were used. For each of these bootstrap samples, a velocity model (Eq. 2) was fitted and the corresponding preferred direction was obtained as tan−1(β2/β1) resolved to the proper quadrant. Second, once a distribution of PDs for a particular unit and session was generated, the circular mean (Zar, 1999) of the distribution was subtracted from each of the sampled PDs. Third, for a chosen unit, we sampled one PD from the session 1 zero mean distribution and another PD from the session 2 zero mean distribution, and computed their absolute angle difference. By repeating this procedure (1000 samples) the distribution of changes in PD corresponding to the null hypothesis (no change in PD) was obtained. This distribution was used to compute the probability that the actually observed change in PD was statistically zero. Units whose PD difference had a p value < 0.05 [false discovery rate (FDR) corrected for multiple comparisons] (Benjamini and Hochberg, 1995; Benjamini and Yekutieli, 2001) were considered to have a significant change in PD between the two sessions. To attenuate the effect of the spiking activity's temporal dependencies in the bootstrap sampling, we dropped all of the spike train samples that occurred within the spike train autocorrelation length.
Assessment of the power of a model to predict spiking activity during pursuit tracking.
We assessed how much of a neuron's spiking activity variation was explained by the above described models. Here, this assessment is performed by measuring the power of a model to predict the occurrence or not of single spikes. A measure of predictive power can be derived from receiver operating characteristic (ROC) curve analysis (Fawcett, 2006). ROC curves were generated as follows. A spike is predicted to happen at a particular time t whenever the spike probability conditioned on a particular model, in other words: is greater than a specified threshold. A ROC curve can then be generated from the true positive (hit probability) and false positive rates of predicted spikes obtained at different thresholds values. The area under the ROC curve (AUC) gives the probability that, when two different samples (one containing a spike, ΔN = 1, and the other not, ΔN = 0) are randomly drawn from the data, the model will give a higher probability for the sample with a spike. A measure of predictive power can then be derived from the AUC as follows: with values ranging from 0 to 1. If a model does not improve spike prediction beyond chance level, the AUC will be close to 0.5 (i.e., chance level) and the predictive power will be zero as expected for a model without any predictive power. However, if the model always assigns a higher probability to a sample containing a spike, both the AUC and the predictive power will be one, again as expected for a model with maximum predictive power. In other words, there exists in this case a specific threshold such that the prediction would be perfect: the model would have then explained all of the variation of the spiking activity in the recorded data.
Relative importance of different kinematic covariates to M1 spiking during pursuit tracking: model independent assessment.
Additionally, a model independent comparison of the relative importance of a kinematic covariate to the prediction of single neuron spiking activity was attempted by computing the mutual information between the covariate and the spiking activity. However, given the small sample and low spiking rates, the variance of mutual information estimates was high. An alternative, less powerful but still informative measure based on linear second-order correlation was taken by computing the cross-correlation function between sequences of spike counts (50 ms time bins) and position, velocity, direction and speed at multiple time lags. Cross-correlation functions for position and velocity were computed separately for each coordinate. For direction, a circular-linear correlation measure was used (Zar, 1999). We report, for each neuron and covariate, the maximum absolute value of the cross-correlation function. Confidence intervals (95%) for the null hypothesis were obtained via random permutation tests that used phase randomization of the covariate time series (for details, see supplemental text C, available at www.jneurosci.org as supplemental material).
Modeling of spiking activity as a function of target direction during the neural cursor center-out task.
We examined the tuning of M1 neurons to target direction during the initial period after target onset. We used the same direction model (Eq. 3) with the exception that, because the target direction was constant during a trial, the spike trains were transformed into counts over a 400 ms time window. Confidence intervals for the direction tuning functions in the neural cursor center-out task were estimated via bootstrap resampling (5000 bootstrap samples).
Data analysis 2: off-line decoding of instantaneous velocity, position, difference vector direction and intended target
Off-line decoding of instantaneous velocity and position during pursuit tracking.
The observed spike trains from of all of the tuned neurons together with the corresponding velocity or position models (fitted to training data sets) were used to decode off-line the velocity or position of the TC cursor in test data sets. We adopted a state space formulation to decode the TC velocity from the spike trains in the recorded neural ensemble (Eden et al., 2004a; Truccolo et al., 2005). Following Bayes' rule, the posterior probability distribution for velocity at time t + τ can be expressed as follows: where ΔN0:t = {ΔN0:t1,ΔN0:t2,…,ΔN0:tC} corresponds to the collection of spike trains from C observed neurons during an interval from time 0 up to time t. A stochastic state space point process filter can be derived as a recursive decoding algorithm (see supplemental text A, available at www.jneurosci.org as supplemental material). In this algorithm, the decoded velocity is taken to be the estimated mode of the posterior distribution (Eq. 7). The initial value for the decoded velocity at time t = 0 was sampled from a zero mean Gaussian distribution with variance equal to the variance of the observed TC velocity. Note that the true observed velocities never entered the recursive algorithm; the input to the decoding algorithm consisted only of the observed spike trains. For visualization purposes, we plotted the decoded velocity in polar coordinates, i.e., decoded direction and speed, in the Results section. The position model (Eq. 1) and this stochastic state point process filter were used for off-line position decoding.
Correlation coefficients between true and decoded covariate values, as well as specific decoding examples (see Fig. 4), were computed on test data sets. Training and test data sets were obtained as follows. Each 1 min block was selected once to be the testing data set whereas the other remaining three blocks constituted the training data set. For example, if the first three tracking blocks provided 3 min of training data to build the neural encoding models, the test data consisted of the fourth pursuit tracking block, and so on until each block had been selected once for test data. The reported correlation coefficients correspond to the average of the correlation values obtained for each test data set separately.
Off-line decoding of the direction of the instantaneous difference vector during the neural cursor center-out task.
The same direction model (Eq. 3) and stochastic state point process filter described above for velocity were used to decode the direction of the instantaneous difference vector between the NC position vector and the target position vector during the neural cursor center-out task. To avoid confusion with the TC direction denoted above by θt, the direction of the difference vector will be denoted by φt. For details regarding the definition of the difference vector and its direction, see Figure 10A.
Off-line decoding of target direction during neural cursor center-out task.
We were interested in the ability to decode (predict) the location of a newly appearing target in the right, left, down, or up radial position, from the ongoing spiking activity. A naive Bayes classifier was used to decode the target direction. The conditional probabilities p(N1, N2,…, NC), where Nc is the spike count in a given time window for the cth neuron and θ the direction (0, 90, 180, or 270°) of the target in a given center-out presentation, were estimated nonparametrically. The representation of the spiking activity in terms of spike counts was chosen for simplicity given that the covariate of interest, target direction, was constant during the trial. Leave-one-out cross-validation was used to assess the classification performance as function of window length and time with respect to target onset. As an example using a 100 ms long window, we computed the classification performance based on spike counts observed in this time window at several different times with respect to target onset, starting at time 0 and moving forward in steps of 25 ms. In other words, for time 0 ms the classification was based on spiking counts observed in the interval [−100 ms, 0 ms], for time 25 ms the classification was based on [−75 ms, 25 ms], and so on. The length of the explored windows was varied from 100 to 700 ms in steps of 100 ms.
Results
A total of 29 sorted units (S1) and 121 sorted units (S3) per recording sessions were included in the analyses presented here, resulting in 58 (S1) and 242 (S3) neuron recordings. Figure 1 shows the average extra-cellular recorded action potentials for the sorted neurons from each participant. The mean of the peak-to-peak amplitudes of average waveforms across neurons was 73±66 μV (mean ±2 SD) for S1 and 173 ± 312 μV for S3. Larger waveform amplitudes were observed in S3, perhaps reflecting closer proximity of electrode tips to larger neurons in deep layers given that longer electrodes were used in this participant. In a few channels, two or three units were separated.
M1 spiking activity and intended kinematics during pursuit tracking task
Before investigating relationships between neural spiking and kinematics, we briefly describe the main statistical properties of the trajectories of the TC. Figure 2A shows examples of paths of the TC during one of the 1 min tracking blocks. These paths were generated by randomly placing, in sequence, a new target on the screen. Once the target was hit by the TC, the target disappeared and a new one appeared on the screen (see Materials and Methods). The distribution of movement direction and speed of the TC in the four 1 min tracking blocks used to fit velocity models are shown in Figure 2B. Motion of the TC in S1 sessions was generally faster than in S3 sessions. Figure 2C shows that both position and velocity had temporal correlations spanning ±5 s, and tended to show negative correlations at time lags that differed for S1 and S3. Further, position and velocity in the same coordinate were also positively and negatively correlated depending on the time lag. Although the pursuit task was designed to minimize all of these temporal correlations in and among kinematic covariates, the observed correlations are likely to have resulted from edge effects imposed by the workspace and properties of natural motion (the TC was controlled by a technician).
Tuning to velocity during pursuit tracking
M1 neurons were tuned to TC velocity in both participants. We examined the relationship between spiking activity and past, simultaneous and future velocities of the TC (Fig. 3A). The fact that the TC motion was relatively slow and that the TC's target was present on the screen during the entire TC motion could have induced both delay and anticipation effects (see below). For this reason, we fitted the velocity model (Eq. 2) for a wide range of different time lags: τ ∈ [−4, −3.9,…, 0,…, 3.9, 4 s]. For each neuron, we found the optimal time lag (OTL), i.e., the time lag at which the log-likelihood under the velocity model achieved its maximum value, and checked whether the model satisfied the two significance criteria (see Materials and Methods). A neuron was considered tuned to velocity if both significance criteria were met. For further analyses, we used only the velocity tuning functions at the optimal time lag. An example of optimal time lag analysis based on the log-likelihood function is shown in Figure 3B. The distributions for the optimal time lag over the population of tuned neurons are shown in Figure 3, C and D. In S1, optimal time lags over the population were clustered ∼100–200 ms. This could be a reflection of anticipation effects, given the structure of the task: the participant knew the target position at the start of the TC's movement and this movement toward the target was almost rectilinear. Optimal time lags for neurons in S3 were bimodally distributed over the interval, with clustering around positive and negative time lags (see Discussion). A summary of preferred direction and tuning depth at the OTLs over all of the tuned neurons is shown in Figure 3, G and H. The tuning depth was defined as the total rate modulation in the tuning function accounted for by the covariate, in this case velocity. Of the studied neurons across both sessions, 90% and 95% were tuned to velocity in S1 and S3, respectively. The velocity model can be seen as the introduction of speed effects to the simpler direction model (Eq. 3). For this reason we assessed whether the reduction in deviance obtained by the velocity model with respect to the direction model was statistically significant (second criterion for significance) (see Materials and Methods). For all of the recorded units, this reduction in deviance was statistically significant (χ2 test, p < 0.05), indicating that the speed component in the velocity model added to the explanation of spiking activity. A similar analysis comparing the direction and the speed models (Eq. 4) revealed also that, for all of the tuned neurons, the direction model achieved a larger reduction in deviance (χ2 test, p < 0.05), suggesting that the direction model explained more of the spiking activity than the speed model.
Additional evidence supporting velocity tuning in this pursuit tracking task was provided by the off-line neural decoding of TC velocity in cross-validated blocks. We set the time lag of the velocity covariate for all of the neurons to the same value given by the mean of the optimal time lag over the population. Obviously, this choice of a single time lag might not have been optimal, but it was taken, nevertheless, for simplifying the computations. Correlation coefficients between true and decoded velocity computed over all of the four 1 min block test datasets (see Materials and Methods) were relatively small (0.28 and 0.35 in S1, 0.37 and 0.35 in S3, for velocities in x and y coordinates, respectively). Despite these relatively small correlations, reasonably accurate off-line decoding of the direction of the velocity vector was achieved (Fig. 4A,C). Figure 4 shows decoding examples for both participants.
We also examined whether the preferred direction of a tuned unit changed significantly from one session to the other. The two recording sessions for each participant were separated by four and one day(s) in S1 and S3, respectively. As in Chestek et al. (2007), we visually inspected waveforms and interspike distributions to determine whether a recorded unit in a given electrode seemed similar or not in both sessions. Only the recorded units that were tuned to direction in both sessions were included in the analysis. In the case that more than one unit was sorted from a single electrode, only the largest unit entered the analysis. A bootstrap sampling procedure was used to assess the statistical significance of changes in preferred directions (see Materials and Methods). In S1, 5 of 12 examined units (41%) showed a significant change in PD (p < 0.05, FDR correct for multiple comparisons) (Benjamini and Hochberg, 1995). Of these five units, changes of 38.5, 41.5, 59.6, 114.0 and 130.9° were observed. In S3, 33 of 76 (43%) units showed a significant change in PD. The changes in PD ranged from 30.9 to 176.2°, with a median of 125°. Possible interpretations of these results are given in the Discussion.
Tuning to position during pursuit tracking
Our previous work (Hochberg et al., 2006a) demonstrated the availability of position information in the recorded neuronal ensembles in S1, by showing that he was able to successfully control a computer cursor and other devices via a position linear filter decoder. Here, we report more extensive analyses for him and an additional participant (S3) with a different neurological condition (see Materials and Methods). As was done for the analysis of velocity tuning, we fitted the position model (Eq. 1) for a range of different time lags τ ∈ [−4, −3.9,…, 0,…, 3.9, 4 s], and we report analyses based on the position tuning function at the optimal time lag. Figure 5A shows an example of an empirically estimated mean spiking rate conditioned on the TC position, and the corresponding fitted model is shown in Figure 5B. Figure 5, C and E, shows the distribution of PD in the position space for the population of tuned units. This position PD corresponds to the direction, in the two-dimensional position space, in which the spiking rate changes the most with changes in position. The distribution of optimal time lags for both participants is shown in Figure 5, D and F. Optimal time lags for S1 and S3 tended to cluster around τ = 1 s, with wider spread for S3. Summaries for both participants include tuned neurons from both sessions 1 and 2. Of the studied units, 86% and 84% were tuned to position in S1 and S3, respectively. Position decoding has been previously examined (Hochberg et al., 2006a); here we add that off-line position decoding using the point process filter algorithm (see Materials and Methods) (supplemental text A, available at www.jneurosci.org as supplemental material) yielded correlation coefficients between true (TC) and decoded position of 0.41 and 0.29 for S1, and 0.42 and 0.45 for S3 in x and y coordinates, respectively.
Comparison between velocity and position models
Because differences in decoding performance can also be a reflection of the nature and assumptions in the decoding algorithm itself, we used a different approach to compare position and velocity models. We compared the velocity and position models in terms of their power to predict the occurrence of single spikes (1 ms time resolution). As stated in the Materials and Methods section, the predictive power (Eq. 6) is a value between 0 (minimum predictive power) and 1 (maximum predictive power) and it is derived from the area under the ROC curve. A predictive power of 1 would mean that the model explained all of the variation in the spike train.
Position and velocity models were fitted to data from the four 1 min pursuit tracking blocks. ROC curves were computed on the same data. Because the number of parameters in the velocity and position model was the same and this number was very small compared with the number of samples, data overfitting was not an issue and a cross-validation scheme was not necessary. Figure 6A provides an example of ROC curve analysis for a single neuron. Summaries of the predictive power of velocity and position over the population of recorded neurons from both participants are shown in Figure 6, B and C. Differences between the predictive power of position and velocity were minor and did not motivate further detailed statistical testing.
We also performed a model-independent comparison of the strength of the relation between different kinematic covariates (position, velocity, direction, and speed) and M1 spiking activity. Low spike rates and small sample data prevented us from applying standard mutual information analyses. Instead, we adopted a less powerful but still informative measure: cross-correlation functions between the 1 min long spike trains (transformed into 50 ms counts) and the corresponding time series of the kinematic covariate. These functions were computed at multiple lags and averaged across the four pursuit tracking blocks. Cross-correlations for position and velocity were computed separately for x and y coordinates, and the circular–linear correlation (Zar, 1999) was computed for the TC movement direction. Neurons whose cross-correlation function values (at any time lag) achieved statistical significance were included in the analyses. 95% confidence intervals were obtained from distributions generated via phase randomization tests (see Materials and Methods) (supplemental text C, available at www.jneurosci.org as supplemental material). Figure 7 summarizes this analysis for both participants. A sign test performed separately for each session and participant revealed no mean differences between correlation values for position and velocity in each coordinate over the recorded neuron population (p value > 0.05). Highly significant differences were obtained, however, when comparing direction and speed (sign test, p value < 0.0001) in each session and participant. Correlation values were higher for direction in 93 and 96% of the neurons for S1 (sessions 1 and 2, respectively), and 100% and 96% for S3. In summary, both predictive power and model-independent comparisons did not reveal major differences in the relative importance of velocity and position for explanation of M1 spiking activity in this task. Significant differences were detected only for the relative importance of direction and speed.
Relation of M1 neural activity to intended kinematics in a neural cursor center-out task
As shown above, during the imagined pursuit tracking task, M1 neural activity in both participants with tetraplegia was tuned to both TC position and velocity. We were also interested in further investigating neuronal spiking activity when the participant was using a neurally controlled cursor to acquire a series of targets.
Spiking activity during a neural cursor control task
We start by investigating the time course of spiking activities and the properties of their tuning functions to intended target direction during the neural cursor center-out task. Target onset was preceded by a holding phase in which the participant kept the NC at the center of the workspace for at least 500 ms. When this hold period was satisfied, a target appeared at one of four radial positions (right, up, left, and down) and the participant proceeded, following prior instruction, to move the NC toward the target. Most of the analyses described below focus on the initial period [0, 1.5 s] after target onset. Neurons recorded in both participants showed clear modulation of their spiking rates after target onset. Figure 8 shows examples of perievent time histograms (PETHs), centered at target appearance (time 0). Modulation in the PETH tended to occur around 300 ms for S1. In S3, the modulation onset showed a broader range. In some cases, the temporal profile of spike rate modulation varied according to the target.
M1 tuning to intended movement direction during neural cursor center-out task
We studied the above described dependence of spiking rates on different targets in terms of tuning to target direction. It is not obviously expected that M1 spiking activity should be related to intended movement direction in this center-out task. That is because the participants used a position based control interface to perform this task (for more details, see Materials and Methods): the position of the neural cursor was given by the output of a position filter which explicitly extracted position, not velocity, information from the recorded M1 neuronal ensemble via a linear mapping. As exemplified in Figure 9, A and C, tuning to target direction was well captured by the direction model (Eq. 3, Materials and Methods). The percentage of neurons that showed statistically significant tuning to target direction in specified time windows after target onset was 66 and 64% for S1 and S3, respectively. Figure 9, B and D, summarizes the preferred directions in the tuned population.
Because there were only four radial targets, intended target (possibly also location) and intended direction could be confounded during the initial phase (0–1.5 s) of “reaching.” Therefore, the detected direction tuning could actually be a reflection of tuning to intended target identity or position. For this reason, we extended our analyses of direction tuning to later phases of the center-out reaching. As reported previously, the on-line position linear filter decoder used to animate the neural cursor was far from optimal. Although the participants were able to navigate the cursor from the center to the target and back, often in a few seconds, the cursor motion commonly strayed from a straight or a simple curvilinear path linking the center and the target. As a result, there were many more than just four directions in which to move the neural cursor toward the target during the reaching phase. This provided us, therefore, with a much wider range of intended directions for the examination of M1 direction tuning. This examination was performed as follows.
Given target and instantaneous NC position vectors during a center-out trial, the direction of the difference vector between these two vectors can provide a reasonable approximation to the intended movement direction in which to move the NC (for a schematic description of these vectors, see Fig. 10A). In this scenario, the direction of the difference vector would then approximate the direction of the intended velocity vector at each moment in time during the center-out task. We fitted direction models (Eq. 3, with the covariate now representing the direction of the difference vector) to the neural point process data (1 ms time bins) and used these models together with the stochastic state point process filter (see Materials and Methods) to decode the direction of the difference vector. The instantaneous angle error (i.e., angle difference between true and decoded direction of the difference vector) obtained by using this decoding algorithm was relatively small. It can be seen in Figure 10B that the distribution of the angle error concentrated below 45°, demonstrating that decoding of this direction clearly departed from chance levels. Correlation coefficients (circular correlation) (Zar 1999) between the true and decoded difference vector's direction were r = 0.21 and 0.4 for S1 and S3, respectively. An assessment of the power of this covariate to predict spike activity is provided in Figure 10C, where we have used the same ROC approach as done in the analysis of velocity and position during the pursuit tracking task. Both the decoding and predictive power analyses indicated, therefore, that information about the direction of the difference vector, or intended movement direction, was available in the M1 spiking activity during the neural cursor center-out task. Issues regarding neural adaptation, including changes in direction tuning properties attributable to the type of the task (e.g., pursuit tracking vs neural cursor center-out), are addressed in the Discussion.
Off-line decoding of intended target during neural cursor center-out task
We examined how well intended target could be decoded from M1 activity during the initial phases of the reaching center-out trials. We assessed the decoding performance of intended target during the interval [0,1.5 s] of the center-out task for trials that lasted at least 2 s. Figure 11, A and B, shows the distributions of the times taken by the participants to move the neural cursor from the center of the screen to the radial target and to hold at the target for 0.5 s. By limiting our off-line analysis to the first 1.5 s, we ensured that the neural spikes entering the decoding analyses had occurred before target acquisition in all of the examined trials. Figure 11, C and D, shows the temporal evolution of the classification performance based on spike counts in a given moving time window (see Materials and Methods). A naive Bayes classifier was used for classification. Models were fitted on training data and target classification was performed on test data using a leave-one-out cross-validation scheme. In other words, a trial was selected as the test trial, and all of the remaining others were used to fit the model. We varied the window lengths from 100 to 700 ms, in 100 ms steps. For each time and window length, the conditional probability of a neuron's spike count in the window was estimated nonparametrically. In S1, classification performance departed from chance classification (0.25) at ∼200 ms after target onset. In S3, this departure happened later at ∼400 ms. Maximum performance levels and peak time depended on the length of the time window used for the spike counts. The maximum correct classification performance and performance peak time increased monotonically with the window length (Fig. 11E–H). For each participant, the time course in classification performance was about the same in the two analyzed sessions.
Discussion
In two humans, M1 tuning to position and velocity of intended movement shared many of the features previously observed in able-bodied nonhuman primates. In particular, we have demonstrated that M1 neural spiking in these two humans is strongly tuned to intended movement direction in both pursuit tracking and point-to-point (center-out) ‘reaching’ tasks. It is notable that these features were observed years after severe damage to descending motor pathways alone (S3) or combined damage to both descending and ascending pathways (S1), although it cannot be ruled out that these features emerged as a result of the neurologic injury or adaptation to specific task constraints (e.g., “massless” cursor and pure kinematics control) in the given experimental tasks. Although the analyses do not point to a particular mechanism underlying the nature of M1 representations and their role in motor control in the intact nervous system, the existence of these tuning features in humans with tetraplegia has implications for both models of motor control and for the development of neuroprostheses for persons with paralysis or limb loss. Aspects of these results warrant further elaboration.
Tuning to kinematics during pursuit tracking
The ability to decode movement direction continuously during 1 min long pursuit tracking blocks was notable: in contrast to previous studies in able-bodied monkeys performing pursuit tracking with actual hand movements (Paninski et al., 2004; Truccolo et al., 2005), here the participants were unable to move their hands or arms. Although we propose that the underlying source of tracking-related velocity signals in M1 was an ‘imagined’ action or intention to move, a few alternative explanations should be also considered. First, although the two participants sustained damage to descending motor pathways, voluntary motion of the head, neck and eyes remained. These motions could have accompanied the imagined tracking. Our previous analysis (Hochberg et al. 2006a) showed, however, that head motion and trajectories of the neural cursor were poorly correlated, indicating that this remaining motion played a minor role, if any. Furthermore, open loop experiments where the participant (S1) was asked to simply imagine different arm and hand movements yielded diverse and imagined-movement-specific modulation in M1 neuronal activity that could not have resulted from visuomotor cues [Hochberg et al. (2006a), their Fig. 3 and supplemental Fig. 1]. Future studies involving participants with advanced ALS (i.e., persons with almost no remaining head, trunk, or appendicular movements) (Hochberg et al., 2006b) may help to answer this question directly. Other alternative explanations would be that signals about the direction of the tracked cursor could have been made available in M1 via indirect visual inputs originating, for example, from area MT, or that M1 tuning reflected gaze direction effects (Baker et al., 1999). Additional experimental controls, beyond those adopted under the priorities and constraints of this pilot clinical trial, would have been required to sort out these alternative explanations. Although the precise origin of these kinematic signals is unclear, our findings have nevertheless demonstrated their robust persistence in M1 neuronal spiking activities in humans with tetraplegia.
Optimal time lags for the tracked cursor velocity clustered around 400 ms for S1, suggesting anticipatory effects. These effects might have easily taken place in this task because the target was present at all times during cursor motion, and that the cursor's motion toward the target was almost rectilinear. Similar clustering was observed for position in S1. A different pattern was seen in S3 for whom velocity and position optimal time lags were more spread out and a bimodal distribution with clusters around −1 and 2 s was observed for velocity time lags. For comparison, note that in able-bodied monkeys actually performing a hand tracking task, optimal time lags for hand velocity tend to cluster around 100 ms, as expected for neurons controlling the actual hand movement, whereas for hand position, time lags tend to distribute more uniformly in a broad interval from (−1,2) s (Paninski et al., 2004) (Fig. 11D,E). It should be also noted that the observed distribution of optimal time lags might reflect more the properties of the tasks, different cognitive strategies used by the participants, or their neurological condition and associated compensations, than general M1 functional properties.
The direction of the cursor's velocity vector during the tracking task contributed more to the explanation of neuronal spiking activity than did speed. A similar observation has been made in M1 spiking activity in monkeys manually executing pursuit tracking (Truccolo et al., 2005) and reaching tasks (Moran and Schwartz, 1999). It should be noted, however, that this weaker contribution of speed in the human data reported here could also reflect the small range of speeds of the tracked cursor in the pursuit tracking task. Our analyses suggest also a balanced relative contribution of position and velocity to the explanation of M1 spiking activity. However, because these analyses relied on model based (ROC) and linear correlation measures, the issue of the relative explanatory or predictive power of these two covariates needs further exploration.
Tuning to intended movement direction and target during neural cursor center-out task
Because there were only four radial targets, their respective identity and direction were confounded at the beginning of each target presentation. Therefore, the observed tuning to direction (Fig. 9) could have resulted from tuning to the intended target, or conversely, the classification of intended target (Fig. 11) could have been based on intended direction signals. Future experiments that prevent such confounding (e.g., by using two layers of radial targets, where a straight line between the center and outer target would intercept an inner target) will be required to fully resolve this ambiguity. Nevertheless, the decoding analysis of the difference vector's direction provided additional evidence for the existence of information about intended movement direction in M1 spiking during this (position control) neural cursor center-out task. The direction of the difference vector, computed throughout the center-out trials, was largely independent of target identity or target position in our data: the neural cursor did not follow, in general, a straight path from the center to the target; the paths were usually very wiggly. In this way, the difference vector's direction provided an approximation to the intended movement direction at any instant during the center-out task. (We say an approximation because the intended direction might have also depended on other factors such as velocity and acceleration of the neural cursor.) We also note that tuning to intended direction should not be expected a priori in this task, because participants controlled the neural cursor via an on-line linear position filter. This filter required explicit information about intended position, not direction or velocity, from M1 spiking activity. The fact that many successful trials lasted longer than the length of the on-line position linear filter (i.e., 1.4 s) argues against a filter that was actually outputting position by integrating velocity information available in the neuronal ensemble activity.
Differences between participants
Although most of our results regarding tuning to kinematics were qualitatively similar between the two participants, there were a few important differences. As mentioned, the distribution of optimal time lags contrasted between the two participants. Additionally, off-line neural decoding of direction during pursuit tracking and of the intended target during the center-out task was better in S1 than in S3. Also, S3 required longer times for target acquisition using the on-line position linear filter decoder. This occurred although this participant's recordings had many more significantly tuned neurons than S1. We can only speculate about a series of factors that may have contributed to these differences: recording from different cortical layers (electrodes were 1.0 mm long in S1 and 1.5 mm in S3), possible differences in the cognitive and control strategies adopted by the participants when handling the position linear filter interface and their different medical conditions, i.e., spinal cord transection in S1 with complete deafference and defference of the limb, and pontine stroke with intact sensory pathways and intermittent bilateral upper extremity flexor spasms in S3.
Stability of recorded neuronal signals: changes in recorded neuronal populations and changes in tuning properties
Neurons were recorded stably across single sessions: tracking spike waveforms continuously across a session (1–2 h) revealed that there was no observed change in the number or identity of recorded neurons. However, because recordings occurred for only a few hours each day, and thus continuous tracking of waveforms across sessions could not be performed, we do not know whether waveforms discriminated from same electrode across days were from the same or a different neuron. Additionally, we also observed that, in some cases, a unit well isolated in one session could not be discriminated in the other session. (This was one of the motivations for building new decoding filters at the beginning of each session.) We believe that small vertical motions of the microelectrode array, on the order of a few tens of micrometers with respect to the cortical tissue, could be a source of these fluctuations across sessions. Further, there has been interest in assessing the stability of single neuron tuning properties over hours or days. Several previous studies have reported changes in these properties and related them to learning (Li et al., 2001; Padoa-Schioppa et al., 2004) or to background intrinsic fluctuations (drift) of tuning functions on a hypothesized solution manifold (Rokni et al. 2007). Others have found little or no significant variability in tuning (Chestek et al., 2007). We attempted a simple assessment of the stability of tuning properties in our data by examining changes in preferred direction across the two experimental sessions. Our results seem to indicate that whereas ∼60% of the examined units showed no significant change in preferred direction, ∼40% of the examined units in both participants showed a statistically significant change. These results should be interpreted with caution, however. First, our identification of unit identity across different sessions was based on visual inspection of spike waveforms and interspike distributions. A more rigorous identification would require, as mentioned above, the tracking of waveforms continuously over the time period in between sessions (Santhanam et al., 2007). We hope to systematically investigate tuning function stability and the effects of learning with more controlled tasks and larger data sets in future studies. Second, it should also be noted that nonstationarity of tuning properties, as well as changes in the recorded neuronal population, can be properly handled in neuroprosthetic applications via intermittent recalibration of decoding filters (e.g., as performed here with filter rebuilding at the beginning of a session) or via continuous tracking of the neuronal population and their tuning properties (Eden et al., 2004a,b; Srinivasan et al., 2007). In addition, as pointed out by Rokni et al. (2007), one can also conjecture that an optimal solution manifold might similarly emerge for the neuronal population recorded at the prosthetic interface. If this conjecture is correct, spontaneous background drift of tuning functions on this optimal solution manifold would not significantly affect decoded motor output.
Implications for neural interface systems
The results from these off-line encoding-decoding analyses suggest that intended velocity could be used as a control signal for direct corticomotor prosthetic devices by persons with paralysis. A preliminary evaluation in one participant showed that closed-loop velocity based control resulted in significantly improved performance compared with position based control (Kim et al., 2007). Additionally, the fact that both intended position and velocity signals are available in M1 activity in these two humans with tetraplegia suggests that the combined use of these two variables in decoding algorithms (Kemere et al., 2004; Srinivasan et al., 2006, 2007; Wang et al., 2007; Yu et al., 2007) may enhance performance.
Footnotes
-
This work was supported in part by the Office of Research and Development, Rehabilitation Research and Development Service, Department of Veterans Affairs, and National Institutes of Health–National Institute of Neurological Disorders and Stroke Grants 1K01NS057389, NS-25074 (Javits Award), R01NS369744, and ONR N0014-06-0185. We thank our trial participants for their dedication to this research. We also thank Abe Caplan, Misha Serruya, Maryam Saleh, and other employees of Cyberkinetics Neurotechnology Systems for data collection, device engineering, manufacturing, and clinical trial design and management; Jon Mukand for his role as clinical investigator for participant 1; and John Simeral, Sung-Phil Kim, and Michael Black for discussions.
- Correspondence should be addressed to Wilson Truccolo, Department of Neuroscience and Brain Science Program, 60 Olive Street, Providence, RI 02912.Wilson_Truccolo{at}Brown.edu