Abstract
Persistent activity generated in the PFC during the delay period of working memory tasks represents information about stimuli held in memory and determines working memory performance. Alternative models of working memory, depending on the rhythmicity of discharges or exclusively on short-term synaptic plasticity, are inconsistent with the neurophysiological data.
Dual Perspectives Companion Paper:Working Memory: Delay Activity, Yes! Persistent Activity? Maybe Not, by Mikael Lundqvist, Pawel Herman, and Earl K. Miller
Introduction
Working memory (WM) is the ability to maintain and manipulate information in mind, over a time span of seconds, and is a core component of many other cognitive functions, including language, problem solving, reasoning, and abstract thought (Baddeley, 2012). Neurophysiological experiments in nonhuman primates identified neurons that not only respond to sensory stimuli but remain active during a period after a stimulus was no longer present; this “persistent activity” therefore provided a neural correlate of WM (Fuster and Alexander, 1971; Funahashi et al., 1989). Here, we define persistent activity as memorandum-selective activity of single neurons that spans the delay interval of WM tasks. Persistent activity is thought to be maintained through recurrent connections in a network of neurons, although a number of architectures can give rise to persistent activity (Chaudhuri and Fiete, 2016; Zylberberg and Strowbridge, 2017).
Persistent activity has not been observed in all vertebrate species, but it appears to be present in species with larger brains and more extensively interconnected neurons, such as primates. Neurophysiological studies of WM in rodents reveal neurons that remain active for short intervals each, “tiling” the delay period between a stimulus and a response (Bolkan et al., 2017; Runyan et al., 2017). In contrast, robust persistent activity that spans the entire delay period of WM tasks has only been described in primate studies (reviewed in detail here) and in human intracranial recordings (Kamiński et al., 2017; Kornblith et al., 2017; Haller et al., 2018). Nonetheless, the role of persistent activity has been reevaluated over the past few years (Sreenivasan et al., 2014; D'Esposito and Postle, 2015; Lara and Wallis, 2015), and some neurophysiological studies have failed to demonstrate persistent activity in the PFC in some short-term memory tasks. Accordingly, other models of WM not relying on persistent discharges, but rather on the rhythmicity of spiking (Lundqvist et al., 2016, 2018), or exclusively on short-term synaptic changes (Fiebig and Lansner, 2017) have been proposed. Still, there is important evidence to support persistent activity as a neural correlate of WM.
Here, we examine the role of persistent activity in WM, with an emphasis on the lateral PFC (referred to as “PFC” henceforth, for simplicity). The role of other areas has been discussed previously (Riley and Constantinidis, 2016; Leavitt et al., 2017a). We review multiple types of information encoded in persistent activity, how it subserves behavior in WM tasks, what evidence exists against this model, and the alternative models that have been proposed.
Information maintained in persistent activity
The task most extensively used to study visuospatial WM is the oculomotor delayed response task (ODR task; Fig. 1), which presents subjects with a brief stimulus and, after a delay period, requires an eye movement to the remembered location of the stimulus (Hikosaka and Wurtz, 1983; Funahashi et al., 1989). Neuronal activity recorded during execution of this task with eight targets presented at 10–15 degree eccentricity reveals that ∼30% of all neurons sampled in the region of the dlPFC in and around the posterior half of the principal sulcus exhibit persistent activity (Funahashi et al., 1989). Larger percentages, as high as 70%, of all prefrontal neurons exhibit delay period activity when tested with denser grids of stimuli, encompassing 16 locations (Leavitt et al., 2018). For many neurons, the persistent firing was not tied to the onset of the stimulus but started 300–400 ms after the visual cue presentation, which is longer than the mean of the visual response latency observed in the dlPFC. And the persistent firing often stopped abruptly around the start of the saccade. Furthermore, the majority of these neurons are selective for the spatial location of the 8 possible targets typically used in the ODR task, so that approximately one-fourth of all dorsolateral prefrontal neurons exhibit spatially tuned elevated persistent firing across the delay period. One can clearly discern persistent firing on single trials in Figure 1C, during the delay for the neuron's preferred direction compared with the low levels of firing before the cue, or other locations. Very similar percentages of spatially tuned neurons with persistent activity have been obtained from the dlPFC when tested with other spatial WM tasks, such as a spatial match/nonmatch task (Meyer et al., 2011). Proportions of active neurons vary, however, between the anterior/posterior and dorsal/ventral subdivisions (Qi and Constantinidis, 2013; Riley et al., 2017). This enrichment of neurons in caudal principal sulcal dlPFC likely results from the anatomical connectivity of this subregion and their relationship to task demands (i.e., this region receives highly processed visuospatial information from the posterior parietal association cortex and projects to the frontal eye fields to regulate the oculomotor response) (Selemon and Goldman-Rakic, 1988). In addition, the persistent firing depends on correct performance; it is diminished or absent in error trials (Fuster, 1973; Funahashi et al., 1989; Zhou et al., 2013). Thus, this pattern of persistent firing is not simply the result of increased excitability of a single neuron but, rather, the product of a circuit in an appropriate state of excitability for achieving some specific purpose.
Some researchers have argued that the persistent firing of PFC neurons is simply a motor preparation signal rather than information held in WM. For example, the location of the preceding stimulus in the ODR task is confounded with the preparation for the motor response, leading to suggestions that persistent activity represents motor preparation (Markowitz et al., 2015). However, more complex tasks reveal that the majority of prefrontal neurons represent the stimulus properties rather than motor preparation. For example, when a task requires monkeys to make an eye movement toward a location other than the location of the visual stimulus, the majority of prefrontal neurons represent the location of the preceding stimulus rather than the location of the impeding saccade (Funahashi et al., 1993; Takeda and Funahashi, 2002).
Persistent activity tuned for the location of a stimulus appears in the PFC even in tasks where the stimulus does not immediately allow planning of a movement. In the spatial delayed-match-to-sample task, subjects are required to release a lever or press a button when a stimulus appears at a previously cued location; in the match/nonmatch task, the monkeys have to saccade to a green or blue response target depending on whether two stimuli presented in sequence appeared at the same location or not. In such tasks, prefrontal neurons generate persistent activity following the presentation of the original stimulus that is tuned for its spatial location, and not the preparation of a motor response, the direction of which is not known until later in the trial (Qi et al., 2011; Goodwin et al., 2012; Masse et al., 2017).
Prefrontal neurons generate persistent discharges that represent other types of information, in addition to spatial location. Generally, smaller populations of prefrontal neurons are tuned for object attributes, such as geometric shape, color, or complex features (e.g., specific faces), than spatial location; a regional specialization is also present, with spatial information more prevalent in the dlPFC than the vlPFC (Meyer et al., 2011). Nonetheless, robust, stimulus-selective persistent activity has been observed during WM tasks requiring subjects to remember the identity and features of stimuli. Examples include stimuli defined by simple, geometric shapes differing in color or luminance (Hoshi et al., 1998; Constantinidis et al., 2001; Sakagami et al., 2001; Inoue and Mikami, 2006; Genovesio et al., 2009); complex images, such as real objects and faces, or abstract pictures (Wilson et al., 1993; Miller et al., 1996; Rao et al., 1997; O Scalaidhe et al., 1997, 1999; Rainer et al., 1998; Rainer and Miller, 2000; Freedman et al., 2001); and the direction of motion of a random-dot stimulus that is always presented at the same location (Zaksas and Pasternak, 2006; Mendoza-Halliday et al., 2014). Persistent activity has been demonstrated not just for visual stimuli. Vibratory stimuli produce robust persistent activity, whose firing rate varies as a function of the stimulus frequency (Romo et al., 1999; Romo and Salinas, 2003). Cross-modal representations between vibratory and auditory stimuli have also been described (Fuster et al., 2000; Vergara et al., 2016).
In recent years, it has been recognized that persistent activity in the PFC also represents information beyond the physical characteristics of stimuli. Activity may represent the abstract rules of the cognitive task subjects are required to perform (White and Wise, 1999; Wallis et al., 2001), categories (Freedman et al., 2001; Shima et al., 2007; Roy et al., 2014), and numerical quantities (Nieder et al., 2002). It may be also related to perceptual decisions (Kim and Shadlen, 1999), past choices (Barraclough et al., 2004), reward expectation (Leon and Shadlen, 1999), and sequences of events or actions (Averbeck et al., 2002; Inoue and Mikami, 2006; Sigala et al., 2008; Berdyyeva and Olson, 2010). Persistent activity of single neurons may represent more information than stimulus features and task variables individually (Rigotti et al., 2013), and also play an important role for reinforcement learning by maintaining signals related to the animal's previous experience (Seo et al., 2007; Curtis and Lee, 2010).
Working memory behavior supported by persistent activity
Studies of lesions in humans and nonhuman primates first provided evidence linking the PFC with performance of even simple tasks requiring WM (Jacobsen, 1936; Milner, 1963; Rossi et al., 2007; Buckley et al., 2009; Pasternak et al., 2015). Reversible inactivation of the PFC (e.g., through cooling) impairs spatial WM and diminishes persistent firing in interconnected parts of the spatial cognition circuit (Chafee and Goldman-Rakic, 2000). Persistent activity is not merely an epiphenomenon of spatial WM but predicts behavior in WM tasks. The most straightforward evidence that persistent activity controls behavior comes from analysis of error trials in the ODR task, which are characterized by lower levels of delay period activity compared with correct trials (Funahashi et al., 1989; Zhou et al., 2013). In other words, trials in which persistent activity is diminished are more likely to result in errors. A near linear relationship between behavioral performance and persistent activity has been revealed in other tasks that modulate parametrically the difficulty of a WM judgment (Constantinidis et al., 2001), and errors in WM tasks are also seen in aged monkeys when persistent firing is insufficient to reach across the entire delay epoch (Wang et al., 2011). Choice probability analysis, comparing the distributions of firing rates in the delay period of correct and error trials, also reveals a strong relationship between prefrontal persistent activity and behavioral outcomes (Mendoza-Halliday et al., 2014).
Computational models provide a detailed picture of the relationship between behavioral performance related to WM and persistent activity. Persistent activity can be sustained in such models by virtue of recurrent connections between neurons with similar tuning for stimulus properties, so that activation outlasting afferent input is maintained in the system (Compte et al., 2000; Murray et al., 2017a). Drifts in neuronal activity across the network of prefrontal neurons have been shown to predict precisely the relationship between firing rate and the endpoint of the saccade (the spatial location being recalled by the monkey) in the ODR task (Wimmer et al., 2014). For example, persistent activity recorded from trials in which monkeys make eye movements deviating clockwise versus counterclockwise relative to the true location of the stimulus yields slightly different tuning curves, as would be expected if the location recalled was determined by the peak of activity at the end of the delay period (Fig. 2). Other, counterintuitive model predictions are confirmed experimentally; for example, negative spike-count correlations for pairs of neurons with different tuning in trials when the stimulus appears between them; and highest variability in trials when the stimulus appears at the flanks of a neuron's receptive field. Importantly, these findings cannot be predicted by alternative models assuming a gradual decay of delay period activity, nor do they hold for neurons that do not exhibit persistent discharges, even though the latter are more numerous in the PFC (Wimmer et al., 2014).
Dynamics of persistent activity
Part of the confusion in the literature about the role of persistent activity can be attributed to some authors equating persistent activity with perfectly stationary activity during the delay interval. Even if a firing rate is temporally modulated during the delay period and is therefore nonstationary, information can be decoded from the neuron activity through maintenance using a stable readout. Indeed, time-varying PFC activity is commonly observed in WM tasks, especially those with fixed delay duration enabling temporal expectation. For instance, the majority of neurons encoding vibrotactile information in WM have been shown to exhibit ramping, increasing or decreasing, activity while maintaining the frequency of the stimulus in a parametric fashion (Romo et al., 1999; Romo and Salinas, 2003). Temporal modulation of delay activity of anticipatory nature has been also reported in some detail by Hussar and Pasternak (2010, 2012, 2013). As we describe below, time-varying delay activity, such as ramping, is not at odds with persistent activity underlying WM maintenance. We contrast nonstationary, persistent activity, however, with transient activity of individual neurons, being active at only a short period of time during the delay period.
Fundamentally, models of persistent activity describe properties of a population code, rather than an individual neuron; that is, the representation of the memorandum is encoded as a pattern of activations across a population of neurons. For instance, in the ODR task, canonical delay activity of PFC neurons exhibits nonmonotonic, bell-shaped tuning curves to the stimulus (visuospatial angle); therefore, stimulus identity cannot be unambiguously read out from a single neuron. The population code of neuronal spiking activity based on persistent activity has the following properties. First, the memorandum is represented at a given time point in the spatial pattern of spiking intensity across the relevant population. This property is in contrast to models of purely synaptic mechanisms, and of transient activity bursts, discussed below. Second, the memorandum is represented in this code stably across time during WM maintenance. This property is in contrast to models of dynamic coding, discussed below. Theoretical and empirical analyses have shown that stable population coding of WM is consistent with time-varying neuronal activity (Machens et al., 2010; Druckmann and Chklovskii, 2012; Murray et al., 2017b). Furthermore, persistent activity does not require an overall increase of activity in the population (Murray et al., 2017b), as the population exhibits both increases and decreases in firing rate across neurons. Finally, simultaneous modulations of firing rate and resulting trial-to-trial correlation between neurons may entrain the information represented by a population of prefrontal neurons (Leavitt et al., 2017b).
Arguments against persistent activity
Some neurophysiological experiments have reported findings that appear at odds with persistent activity: only transient representation of motion information in the delay period of a WM task (Zaksas and Pasternak, 2006), achieving continuous representation of stimulus properties only through averaging of activity through multiple neurons (Hussar and Pasternak, 2012, 2013); very few prefrontal neurons with pure color information (Lara and Wallis, 2014); or encoding of only the last of a sequence of stimuli that the subject successfully recalled in its entirety (Konecky et al., 2017). Failure to detect activity in the PFC when tested with a random stimulus, without characterization of the neuron's tuning function across the dimension being tested, is not a strong argument against the role of persistent activity as the neural correlate of WM. Across the population of prefrontal neurons, only a small minority would be expected to be active during maintenance of any single stimulus in memory (Fig. 1A, vertical white line). Moreover, in contrast to earlier findings of Zaksas and Pasternak (2006), more recent experiments have revealed robust persistent activity representing direction of motion throughout the delay period of a WM task in the PFC (Mendoza-Halliday et al., 2014). Similarly, activation of only a small proportion of prefrontal neurons, in the order of 5%–15% (Lara and Wallis, 2014), may be sufficient for the faithful representation of stimulus color. It is also possible that color-selective neurons are concentrated in specific prefrontal “patches” (Lafer-Sousa and Conway, 2013), and persistent activity representing color information may be concentrated in such modules rather than diffusely distributed across the entire prefrontal surface. Representation of sequences of stimuli, or more complex displays, may also involve some transformation of the individual elements (Parthasarathy et al., 2017), without knowledge of which tuned persistent activity may not be reliably detected.
It has also been noted that persistent activity can be highly variable during the course of a trial, and from trial to trial, leading to the proposal that persistent activity demonstrated in earlier studies is an “artifact of averaging across trials” (Lundqvist et al., 2016). A high level of variability of persistent activity between and within trials is also not inconsistent with the persistent activity model. To the contrary, variability in discharge rate during the course of the trial would be expected, even among neurons that are highly active at some time point, as the activity might drift in the population (Fig. 2A, horizontal white line). Indeed, higher irregularity of spiking activity (quantified by the coefficient of variation of the interspike interval) has been observed in the delay period than during the fixation period of the ODR task (Compte et al., 2003). This otherwise puzzling finding is precisely predicted by the network models of persistent activity. Furthermore, if the observed persistent activity in single-neuron peristimulus time histograms were an artifact of averaging across trials, then this scenario would make strong testable predictions for higher-order statistics of spiking activity. If individual trials of WM activity were characterized by short intermittent bursts of high-rate firing (Lundqvist et al., 2016), then across-trial spike-count variability (e.g., as quantified by the Fano factor) would increase dramatically during mnemonic delays, relative to the task foreperiod with stable fixation, which is inconsistent with empirical measurements (Chang et al., 2012; Qi and Constantinidis, 2012).
As mentioned above, the concentration of neurons with domain-related persistent firing likely reflects the underlying anatomical projections (e.g., neurons with persistent firing to visual features, such as faces, are concentrated more ventrally than those with persistent activity to visual space) (Wilson et al., 1993), whereas those with persistent firing to somatosensory information are found even more ventrally in the inferior prefrontal convexity (Romo et al., 1999), all of which corresponds to the location of visual spatial, feature, and somatosensory inputs to the lateral PFC (Goldman-Rakic, 1987; Carmichael and Price, 1995). Topographical differences in neuronal response may also reflect the efferent projections to differing motor systems (e.g., subregions that project to frontal eye fields for control of eye movements compared with subregions projecting to premotor and motor cortices for a manual response) (Preuss and Goldman-Rakic, 1989). Thus, the relative concentration of neurons with persistent firing to a specific sensory/motor domain likely reflects the underlying anatomy. This topography must be respected, especially when interpreting apparent negative results, where the researcher may not have recorded from the most relevant subregion of PFC.
Alternative models of WM
Recent models have proposed mechanisms other than persistent discharges as the neural correlate of WM. We focus here on two categories of models: nonspiking models dependent exclusively on synaptic mechanisms and rhythmic-spiking models conveying information based on the frequency and phase of discharges without necessarily a persistent increase in spiking activity. We point out the limitations of these models and emphasize that researchers should distinguish between different levels of analysis and different underlying mechanisms.
First, computational models have been proposed to account for information storage via mechanisms that do not depend on spike generation, but instead involve short-term modification of synaptic properties that are not reflected in spiking activity (Mongillo et al., 2008; Sugase-Miyamoto et al., 2008; Mi et al., 2017). These models use synaptic mechanisms, such as short-term facilitation of vesicle release through availability of calcium at the presynaptic terminal, whose kinetics have time constant in the scale of seconds (Mongillo et al., 2008), or short-term potentiation of synaptic strengths (Fiebig and Lansner, 2017), which decays over minutes (Erickson et al., 2010). Beyond imprinting a passive trace of the stimulus activation pattern, it is less straightforward to consider how purely synaptic mechanisms would mediate other WM tasks, such as the ODR, delayed alternation, or free recall tasks, which can be controlled more flexibly. For example, short-term potentiation cannot underlie the updating of WM contents within the timeframe of seconds. We should note that plastic neuromodulation and other synaptic mechanisms have an important role in WM in that they can lower the threshold for the generation of spiking activity or help to shape network inputs (Arnsten et al., 2012), but lack the flexible, precise pattern and timing needed for accurate neural representations.
Second, WM may be maintained through spiking activity in a population code, but a code that is dynamic in time, rather than the stable population coding of persistent activity (Sreenivasan et al., 2014; D'Esposito and Postle, 2015; Lara and Wallis, 2015; Stokes, 2015). In dynamic coding models, population-level mnemonic representations are time-varying and do not generalize across time during maintenance. By contrast, recent population-level analyses of PFC activity have found that stimulus representations during WM delays provide a stable population code that generalizes across time, although they are distinct from transient representations present during stimulus presentation (Murray et al., 2017b; Spaak et al., 2017). As noted above, observation of time-varying neuronal activity does not imply dynamic coding and can be consistent with stable population coding and persistent activity (Machens et al., 2010; Druckmann and Chklovskii, 2012; Murray et al., 2017b).
Third, rhythmic activity has been implicated in WM (Siegel et al., 2009; Buschman et al., 2012; Liebe et al., 2012; Salazar et al., 2012; Brincat and Miller, 2015). The magnitude, frequency, and phase of oscillations within the PFC and between the PFC and other areas have been shown to be modulated depending on stimulus and task information (Buschman et al., 2012; Liebe et al., 2012). The coherence in rhythmic synchronization between neurons in prefrontal and posterior parietal cortex has also been reported to be content-dependent; prefrontal and parietal neurons synchronize their firing at specific frequencies according to stimuli held in memory (Salazar et al., 2012). In addition, spiking activity at different phases of local field potential (LFP) oscillations could also differentiate information representing two sequentially presented stimuli (Siegel et al., 2009).
Recent studies have specifically proposed that the rate of LFP bursting in the gamma frequency range, which correlates negatively with power in the beta frequency, underlies WM (Lundqvist et al., 2016, 2018; Bastos et al., 2018; Stanley et al., 2018; Wutz et al., 2018). A corollary of this model is that gamma-bursting pooled from error trials should be lower than that of correct trials. Unfortunately, no measures of behavior were shown to correlate with the purported neural basis of WM during the delay interval in any of these studies. Differences in gamma-bursting between correct and error trials were reported in one study (Lundqvist et al., 2018). Critically, no differences were reported during the delay periods following the sample presentations in the WM task used. Instead, error and correct trials were differentiated by levels of gamma-bursting only during the period when test stimuli were presented and the monkeys had to judge whether they matched stimuli held in memory, and errors were characterized by generally higher (not lower) levels of gamma-bursting (Lundqvist et al., 2018). In summary, gamma-bursting rate appears to be a poor predictor of whether information is successfully maintained during the delay period of a WM task, unlike persistent activity.
An additional concern in the Lundqvist et al., 2018 study is that LFP recordings were obtained from high impedance electrodes, which also isolated spikes. The existence of gamma power in such recordings is not a strong prediction of a model that seeks to falsify persistent activity. Multiple studies have shown remnants of spikes on LFPs, especially in high frequencies (Ray and Maunsell, 2011), even if one were to remove the spike waveforms from the LFP record (Zanos et al., 2011), which was not attempted in the aforementioned study. It is possible that the LFP measures reported are driven by spiking activity in the delay period of the task, including persistent activity.
In any case, oscillatory activity is not incompatible with persistent activity but, rather, might reflect the underlying persistent firing and its ramifications from a distance. For example, both robust persistent activity and gamma-band rhythmicity have been reported during the delay period of the ODR task (Pesaran et al., 2002; Lundqvist et al., 2016), as well as the two-item sequential WM task (Siegel et al., 2009). Furthermore, concurrent persistent activity and gamma-band rhythmicity are observed when recordings are performed in the cortical site that corresponds to task demands: both increased persistent firing (Warden and Miller, 2007) and gamma-band activity (Lundqvist et al., 2018) were captured from more ventral recording sites during an object feature WM task. Similarly, the classic ODR spatial WM task that generates persistent firing in dlPFC was associated with pronounced gamma bursts from the same region (Lundqvist et al., 2016). Thus, although measures of oscillatory activity allow the researcher to sample a broader range of neuronal activity than can be performed with single- or multiple-unit recordings, persistent firing appears to underlie the oscillatory events captured during WM.
In conclusion, the evidence reviewed here demonstrates that stimulus-tuned, persistent activity is readily observed in PFC during the execution of any type of WM task. Proportions of neurons active during the delay period may vary between prefrontal subdivisions depending on task. Overall levels of task performance and more specific behavioral outcomes that reflect the accuracy of recall are predicted by persistent activity. In the presence of such evidence, negative results of individual studies should be treated with caution. Alternative models not depending on persistent activity have been inspired by functional imaging studies, MEG studies, and the emergence of computer models that are relatively unconstrained. These may appear attractive on theoretical grounds but fall short of accounting for WM behavior. Synaptic mechanisms, including calcium availability, short-term potentiation, and neuromodulation, may play a role in WM because they facilitate the generation of spiking activity or shape its selectivity. Persistent activity appears in brains that have undergone a massive expansion of the devoted prefrontal microcircuits that readily maintain and manipulate information for abstract thought, a process that reached its apex in the human PFC (Haller et al., 2018).
Footnotes
This work was supported by National Institutes of Health, National Eye Institute Awards R01 EY017077 and R01 EY016773 to C.C., R01MH112746 to J.D.M., R01MH108643 and R01MH108629 to D.L., and DP1AG047744 to A.F.T.A.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Christos Constantinidis, Department of Neurobiology and Anatomy, Wake Forest School of Medicine, Medical Center Blvd, Winston-Salem, NC 27157. cconstan{at}wakehealth.edu
References
Response From Dual Perspectives Companion Author–Mikael Lundqvist
We think Constantinidis et al. misunderstand our view. These authors mostly defend the role of “delay activity” spiking in WM. There is no disagreement about this. Our point is that, on single trials (instead of the usual practice of averaging across trials), activity is sparse and complex, not simply persistent. Their arguments that delay activity conveys information, predicts behavioral errors, etc., are not relevant to this issue because those studies averaged across trials. Indeed, the same observations were made when the spiking on single trials was sparse (Lundqvist et al., 2018). They cite us for observing “persistent” activity (Warden and Miller, 2007) when we subsequently demonstrated sparseness using the same data (Lundqvist et al., 2018). The former study averaged across trials; the latter used single-trial analyses. That's precisely our point.
Constantinidis et al. also point to an example of a single neuron that seemed to show persistent spiking on single trials. We do not believe that single-neuron examples are good tests of models. They are cherry-picked to show a given property and are “best of,” not typical.
Constantinidis et al. further claim that random bursts of spiking should increase the Fano factor (spiking variability across trials) instead of the observed decrease. Actually, the decrease in Fano factor makes perfect sense. The bursts are random before stimulus onset. After the stimuli, the bursts are not completely random but task-modulated (e.g., increase over the delay). That decreases trial-to-trial variability, decreasing the Fano factor.
In addition, Constantinidis et al. argue that many neurons do not show persistence because stimuli were not optimized to each neuron's tuning curve. Under real-world conditions, a tiny fraction of neurons are operating under such ideal conditions. Parsimony suggests that whole neuron populations contribute to behavior, not just a tiny, select, fraction.
They argue against a purely synaptic model in which spikes are unimportant. Actually, in our model, spikes play critical roles in setting synaptic weights, for memory retention and reading out WMs. Again, there is no disagreement that spikes play an important role in WM.
They claim our study showed that gamma is a poor predictor of behavioral errors. That is wrong. We reported the balance between gamma and β during delays and decision-making predicted, not just forthcoming errors, but the exact type of error (Lundqvist et al., 2018), better prediction than has been reported for spiking. Another study showed that gamma was superior to spiking for real-time decoding of WM contents (Jia et al., 2017).
Finally, they claim our gamma could be reflecting spiking, but our investigation indicated it was not (Lundqvist et al., 2016, 2018). The narrow-band frequency and temporal profiles indicated true oscillatory LFP dynamics, not an indexing of spike waveforms.
Our observations that delay interval spiking is often sparse on single trials seems to have been interpreted as questioning the role of spiking in WM. It is the opposite. A new way of looking at the data has provided new insights but still points to a central role for spiking. It is just that the story is more complex than previously suspected. Is not that always the case?