Abstract
Rats, using their whiskers, have excellent capabilities in texture discrimination. What is the representation of texture in rat somatosensory cortex? We hypothesize that as rats “whisk” over a surface, the spatial frequency of a grooved or pebbled texture is converted to a temporal frequency of whisker vibration. Surface features such as groove depth or grain size modulate the amplitude of this vibration. Validation of the hypothesis depends on showing that vibration parameters have distinct neuronal representations in cortex. To test this, we delivered sinusoidal vibrations to the whisker shaft and analyzed cortical neuronal activity. Seven amplitudes and seven frequencies were combined to construct 49 stimuli while recording activity through a 10 × 10 microelectrode array inserted into the middle layers of barrel cortex. We find that cortical neurons do not explicitly encode vibration frequency (f) or amplitude (A) by any coding measure (average spike counts over different time windows, spike timing patterns in the peristimulus time histograms or in autocorrelograms). Instead, neurons explicitly encode the product of frequency and amplitude, which is proportional to the mean speed of the vibration. The quantity Af is an invariant because neuronal response encodes this feature independently of the values of the individual terms A and f. This was true across a wide time scale of firing rate measurements, from 5 to 500 msec. We conclude that vibration kinetics are rapidly and reliably encoded in the firing rate of cortical ensembles. Therefore, the cortical representation of vibration speed could underlie texture discrimination.
Introduction
Rats possess superb tactile capacities through use of their large facial whiskers. Indeed, as nocturnal animals albino rats depend on their whisker sensory system for object localization, judgment of the size and shape of objects, and the roughness or texture of surfaces (Guic-Robles et al., 1989; Brecht et al., 1997). It is of particular interest that humans and rats have approximately equivalent capacities in texture discrimination tasks such as comparing two spatial densities of grooves on a surface (Carvell and Simons, 1990). The purpose of the present work is to propose and test one hypothesis for the neuronal representation of texture in rat somatosensory cortex.
The rat somatosensory system is well suited for examining how neuronal activity encodes stimuli, not only because of its excellent functional capacities but also because of its unique anatomic and physiological organization. Layer IV of vibrissal somatosensory cortex is organized as a map of “barrels” (Woolsey and Van der Loos, 1970; Welker, 1971; Jensen and Killackey, 1987). Neurons in a given barrel and its associated column yield the strongest response to one single whisker and a weaker response to several surrounding whiskers (Simons, 1978; Armstrong-James and Fox, 1987; Diamond et al., 1993). The last several years have seen progress in revealing the temporal and spatial unfolding of barrel cortical neural activity in response to single-whisker deflections (Petersen and Diamond, 2000). From this, it has been possible to unravel the functional circuitry underlying cortical activity, but most studies have not provided insights into the issue of precisely which stimulus features are signaled by neuronal activity. One clue, however, comes from measurements of barrel cortex neural activity during ramp whisker deflections that varied in velocity and final resting position (Gibson and Welker, 1983b; Ito, 1985; Pinto et al., 2000). The main finding was that cortical neuronal firing rate increased with whisker velocity. Ramp and hold movements, however, are very different from the whisker movements observed during texture discrimination (Carvell and Simons, 1990). In the present study, we have used stimuli that more closely approximate the whisker movements that occur when behaving rats collect tactile information.
Rather than passively allowing objects to encounter the whiskers, rats generate tactile stimuli by sweeping their whiskers forward and backward (Welker, 1964; Fee et al., 1997; Sachdev et al., 2001, 2002; Berg and Kleinfeld, 2003). We hypothesize that during the forward and backward “whisking” movement, contact with textured surfaces produces vibrations in the whisker shaft. Surface features such as groove depth or grain size and the spacing between them modulate the frequency and amplitude of the vibrations. Whisker vibrations, the “signatures” of surfaces, propagate to the mechanoreceptors at the base of the whisker, which then transduce the motion to produce a neural representation. The hypothesis is viable only if different vibration parameters produce reliable and systematically distinct neuronal responses. Therefore we delivered vibrations as sinusoidal movements of the whisker shaft and analyzed cortical neuronal activity to determine whether, and in what way, vibration parameters are represented in cortex.
Materials and Methods
Subjects, surgery, and data acquisition. For a detailed description of surgical methods, see Rousche et al. (1999). Five adult male Wistar rats, weighing 250-350 gm, were used. All experiments were conducted in accordance with National Institutes of Health, international, and institutional standards for the care and use of animals in research. Surgical anesthesia was induced by urethane (1.5 gm/kg body weight) and maintained at a constant level by monitoring hindpaw withdrawal, corneal reflex, and respiratory rate and administering extra doses (10% of original dose) as necessary. Body temperature was maintained near 37.5°C using a servo-controlled heating blanket (FHC, Brunswick, ME). After placing subjects in a stereotaxic apparatus (Narishige, Tokyo, Japan), left somatosensory cortex was exposed by a 7 × 7 mm craniotomy centered on a point 2 mm posterior and 6 mm lateral to bregma. The dura mater was left intact. The vibrissae region of left somatosensory cortex was then identified according to vascular landmarks and stereotaxic coordinates (Hall and Lindholm, 1974; Chapin and Lin, 1984). A “Utah” array consisting of a 10 × 10 grid of 1.5-mm-long electrodes with 400 μm tip-to-tip spacing (Bionic Technologies, Salt Lake City, UT) was positioned over the selected area and inserted into the barrel (Fig. 1a,b) cortex using a pneumatic impulse inserter (Bionic Technologies) (Rousche and Normann, 1992). Another wire was positioned in the cortex to serve as a reference. From photographs of the inserted array and from response latency to whisker movement (see below) the minimum and maximum depths of electrode penetrations were estimated to be 700-1000 μm in all experiments (Fig. 1a).
Signals from all 100 electrodes were simultaneously amplified and filtered (gain, 5000; bandpass 250-7500 Hz; Bionic Technologies). For each active electrode, a voltage threshold was set to a value 2.5-3.5 times the root mean square voltage. A digital signal processor (Bionic Technologies) extracted 1 msec of data (0.33 msec before each threshold crossing, 0.67 msec after) and saved the waveform to the hard disk of a personal computer at 30,000 samples per second (Guillory and Normann, 1999). The waveforms emitted by a multiunit neural cluster (Fig. 1c) at each channel were selected off-line using spike-sorting programs (Bionic Technologies), and their time stamps were saved for further analysis. In earlier work it has been estimated that each electrode captures the activity of a cluster of ∼3-5 neurons (Rousche et al., 1999). At several electrodes in each experiment, it was possible to separate putative single-unit activity (Fig. 1d), and we exploited this to analyze the temporal firing patterns of single units (see Fig. 7). However, our principal aim was to select multiunit cluster activity rather than isolate single units.
Whisker stimulation paradigm. Using thin, lightweight glass micropipettes, we made a five-rung “ladder” and attached it to a piezoelectric wafer (Morgan Matroc, Bedford, OH). We positioned the ladder on the right side of the animal's snout so that the five rungs lay just below the corresponding five rows of whiskers (A-E), each whisker shaft resting lightly on the ladder ∼5 mm from the skin. Vibration duration was 500 msec with a 1 sec interval between consecutive vibrations. The piezoelectric wafer was driven vertically by sinusoidal voltages of varying amplitude and frequency, as follows. The discrete sinusoidal waveforms (10,000 samples/sec) were generated in MATLAB (Mathworks, Natick, MA). The stimulus signal came from a digital-to-analog converter (National Instruments, Austin, TX) that was split and sent in parallel to an amplifier (gain, 40) that drove the piezoelectric wafer and to a channel of the data acquisition computer, to maintain a record of the exact time of each trial. To explore cortical coding of the vibration, its frequency and amplitude on each trial assumed one of seven values (frequency: 19, 30, 50, 81, 131, 211, and 341 Hz; amplitude: 8, 12, 21, 33, 54, 87, and 140 μm). The resulting 49 different frequency-amplitude combinations (Fig. 2a,b) were presented in pseudorandom order 100-200 times per stimulus. To give an impression of the range of frequencies and amplitudes, short segments of the stimuli as measured by an optic sensor (see below) are shown in Figure 2c. Using a calibrated custom-built optic sensor, we confirmed that the stimulator moved in a smooth sinusoidal manner. We also monitored the displacement of the whisker shaft to verify that whisker movement smoothly “followed” and matched wafer movement across the full range of frequencies and amplitudes (Fig. 2d).
After delivery of the full stimulus set, all whiskers were cut 3 mm from their base and stimulated individually by a piezoelectric wafer with 1 Hz square pulses for 1 min. The resulting data were used off-line to determine for each electrode whether the neuronal cluster had a statistically significant response to any single whisker and, for such clusters, to identify the principal whisker and the response onset latency. Criteria for these measures are given in Petersen and Diamond (2000). Neuronal clusters that gave a statistically significant response and had a clear principal whisker were selected for analysis of responses to vibration stimuli; there were 16, 24, 37, 35, and 18 acceptable electrodes in experiments 1-5, respectively. Response onset latencies were in the range of 5-8 msec, signifying recording sites in a cortical layer receiving direct thalamic input. Identification of the principal whisker input to each neuronal cluster allowed us to construct barrelfield maps like the one shown in Figure 1b.
Results
Before turning to neuronal responses, it is important to take a deeper look at the composition of the stimuli. We have plotted for each stimulus the quantity loge(Af), where A is vibration amplitude and f is vibration frequency (Fig. 2b). The quantity Af is proportional to the mean speed of the stimulus. The stimulus set was organized in diagonal bands (top left to bottom right) within which widely varying frequency-amplitude combinations yielded similar values of Af. For example, for the stimuli f = 81 Hz, A = 87 μm and f = 211 Hz, A = 33 μm, loge(Af) = 8.86 and 8.85, respectively. We refer to these diagonally oriented series of stimuli as iso-Speed, or iso-Af, bands. Af differed sharply between neighboring bands. Note that the stimulus set could equally well be plotted in terms of loge(Af)2, the quantity (Af)2 being proportional to the kinetic energy of the stimulus. In that case, the diagonally oriented bands would consist of iso-energy stimuli. The main result to be illustrated in the course of the report, that neuronal response is systematically related to (Af)n, holds for any positive value of n.
Typical responses of a neuronal cluster to whisker vibrations (211 Hz, 87 μm) are evident in the raster plot and peristimulus time histograms (PSTHs) shown in Figure 3. The neuron cluster showed a sharp initial increase in firing rate 5 msec after stimulus onset, rising to a peak a few milliseconds later (see PSTH inset). After a second, smaller rise in firing rate at 70 msec, the response settled to a level of ∼25 spikes/sec and remained there until stimulus offset. There was a small increase in firing rate when the stimulus ended. Neurons commonly showed the two response phases present in this example: an early, rapidly adapting response (typically the first 50-100 msec of activity) and a subsequent, slowly adapting response (until stimulus offset).
How were vibration features encoded in neuronal activity? Figure 4 shows the responses of a neuronal cluster to the entire set of 49 stimuli. Response was measured as the firing rate across the entire 500 msec stimulus duration averaged across trials. In Figure 4a, responses are plotted for frequency-varying, iso-amplitude lines, highlighting how firing rate increased as vibration frequency increased. In Figure 4b, the same neuronal response values are plotted for amplitude-varying, iso-frequency lines; here, the dependence of firing rate on vibration amplitude is clear.
From these findings we conclude that neuronal firing rate varies with both frequency and amplitude. However, “decoding” these features from the firing rate of a single cluster would be ambiguous: a given response level could, in principle, represent an infinite number of frequency-amplitude combinations. Hence, barrel cortical neurons do not appear to explicitly report either of these two vibration features, frequency or amplitude, in their response magnitudes. Was some other feature of the vibration unambiguously encoded by response magnitude? As shown below, the product of frequency and amplitude (proportional to mean vibration speed) seems to be unambiguously encoded by response magnitude. Figure 5a illustrates firing rate, averaged over the 500 msec stimulus duration, for all 49 frequency-amplitude combinations. Responses within a given iso-Af band, running from top left to bottom right as in Figure 2, a and b, were nearly equal even if the stimuli differed drastically in frequency and amplitude. Moreover, firing rate increased monotonically from one iso-Af band to its neighbor in the top right direction. To test whether the late or “adapted” part of the neural response encodes Af, Figure 5b shows firing rate for the neuronal cluster excluding the first 100 msec of the response. Firing rates during late response (100-500 msec after stimulus onset) were lower than those measured over the whole stimulus duration, but correlated equally well with Af.
Stimuli of very different frequencies evoked nearly equal values of neuronal firing rate, provided that vibration amplitude was shifted in proportion to 1/f: vibration frequency therefore was not encoded by firing rate, at least at the slow time scales used above. However, it is possible that two other coding mechanisms, spike timing on a more precise time scale and correlations between spike times, could carry information to disambiguate frequency from amplitude. We examined these possibilities by comparing the PSTHs and autocorrelograms for different stimuli. Differences in PSTH profiles across stimuli imply coding by the precise timing of individual spikes; differences in autocorrelation structure across stimuli imply coding by spike correlations within neuronal clusters (Panzeri et al., 2001; Petersen et al., 2001).
Figure 6a presents the PSTHs corresponding to the 4 × 4 grid of stimuli framed in the top quadrant of Figure 5a. Although there were clear variations in PSTH profile (the emergence of a second peak at 70 msec, for example), the profiles were dictated by the product Af, not by frequency or amplitude alone. For example, note the similarity of the top left and bottom right PSTHs (asterisks), which were generated by stimuli of equal mean velocity but frequencies of 81 and 341 Hz. Autocorrelation histograms for the same stimuli are shown in Figure 6b; there were no specific autocorrelation “signatures” associated with particular vibration frequencies or particular amplitudes. Rather, the autocorrelograms reflect increasing numbers of events, that is, higher firing rates, as Af increased. These results were characteristic of the multiunit data from all five rats: in no case did we find evidence that the frequency-amplitude ambiguity present in neuronal firing rates (Fig. 4) could be resolved by precise spike timing or by correlations between spikes within these same neuronal clusters.
If stimulus frequency were encoded by the spike timing of individual neurons, the code might be obscured by analyses limited to multiunit clusters. To exclude this possibility, we analyzed the activity of putative single units whenever possible. Figure 7 shows the coding of vibration frequency for the neuron illustrated in Figure 1, d and e. Responses to stimuli from the top row of the 7 × 7 stimulus grid were selected for illustration; here, the entire vibration frequency range was sampled at the highest amplitude so that phase-locked activity, if it existed as a coding mechanism, would be most evident. Autocorrelograms (Fig. 7) reveal that the number of spikes increased as stimulus frequency increased (the height of the “shoulder” of the autocorrelogram), yet no frequency-dependent temporal firing patterns emerged. Even at the lowest frequency (19 Hz), there was no autocorrelogram peak at 53 msec, as would be expected if firing were phase-locked to stimulus cycles. In contrast, the firing rate of the single unit was clearly proportional to vibration frequency, increasing progressively as frequency increased (Fig. 7b).
Because individual neurons do not appear to encode vibration features through time-locked signals, we decided to characterize the neuronal encoding of vibration parameters for large neuronal populations by summating the spikes recorded at all electrodes positioned in the barrel cortex. These data came from at least 16 electrodes in each experiment (see Materials and Methods). For the five experiments, the mean population firing rates for the entire stimulus duration (0-500 msec) are given in Figure 8a. The strong correlation between stimulus speed and response magnitude was highly reproducible in different rats.
In the analyses to this point, neuronal responses were characterized by the activity in a window whose beginning and end time were fixed exactly to the time of stimulus onset. However, “downstream” neurons that integrate barrel cortex spike trains may not have independent signals regarding stimulus onset time. Moreover, the intervals across which downstream neurons integrate spikes may be much shorter than the periods of several hundred milliseconds we have considered; freely moving rats can make texture discriminations in shorter time spans (Carvell and Simons, 1990). It is important to consider whether a fast, “stimulus time-free” code might represent the vibration features. Therefore, we also quantified the neuronal response in a simple way that does not make use of the exact stimulus onset time: we shifted a narrow temporal window in 1 msec steps along the whole-array spike train and registered the maximum number of spikes falling in the window on each trial. The sliding window began 500 msec before stimulus onset and ended 500 msec after stimulus offset. For each stimulus, we computed the average of this peak across trials. The method included no explicit knowledge of stimulus onset time besides the separation of successive trials. Results for a 5 msec time window are shown in Figure 8b. For each of the five experiments, the encoding of stimulus speed by peak firing rate was nearly indistinguishable from the encoding of stimulus speed by a long-time window firing rate (Fig. 8a). This finding was insensitive to the width of the sliding time window for the entire range tested, from 2 to 50 msec.
Discussion
Identifying the encoded feature of the stimulus
In the present study, to our surprise, neither vibration frequency f nor amplitude A was reported in the activity of barrel cortex neurons in any unambiguous manner. Instead, the firing rate of barrel cortex neurons was related to log(Af)n. We illustrated the results with n = 1, such that neuronal activity was considered in relation to the mean speed of deflection. However, the same monotonic relation between stimulus and neuronal activity would have been obtained with, for example, n = 2; the quantity (Af)2 characterizes the kinetic energy of vibration. Because vibration energy and mean speed differ only by the exponent n, it is impossible to know whether neuronal activity encodes one, the other, or some combination. On the other hand, the data exclude the coding of any physical property of the vibration that has an asymmetric dependence on amplitude and frequency. One such property is acceleration, proportional to Af2. If cortical neurons explicitly encoded acceleration, then their activity would have depended more strongly on changes in frequency than on changes in amplitude.
From the present study, certain differences between subcortical and cortical responses to stimulus kinetics can be identified. Primary afferents can “follow” sinusoidal stimuli in a phase-locked manner at high frequencies (Dykes, 1975; Gottschaldt and Vahle-Hinz, 1981), whereas cortical neurons cannot. In contrast, some properties of the subcortical stations are conserved in cortical neurons. One example is the threshold for whisker velocity. In our experiments, the response threshold fell somewhere between the fourth and sixth velocity bands (Fig. 8), corresponding to 4-10 mm/sec at 5 mm distance from the skin. This places them near the typical threshold range for primary afferent fibers (8 mm/sec) (Gibson and Welker, 1983; Shoykhet et al., 2000) and for neurons in the ventral posterior medial nucleus of the thalamus (Ito, 1988) when measured using ramp and hold stimuli.
Converging results with multiple analytical approaches
To demonstrate the robustness of cortical vibration coding, we showed that the relation between Af and firing rate was very strong whether the window over which firing rate was measured included the entire 500 msec stimulus presentation (Figs. 5a,8a) or else a smaller window comprising only the “adapted” part of the response (Fig. 5b). Because these measures depend on the experimenter synchronizing the response measurement on stimulus onset time (information that the rat itself may not have), we examined an alternative coding scheme that is independent of a precise knowledge of stimulus time. On each trial, we slid a narrow time window along the whole-array spike train and registered the peak firing rate. From the mean of this peak across trials an explicit representation of Af nevertheless emerged (Fig. 8b).
Contact with an object during natural whisking usually is maintained for some tens of milliseconds in each whisking cycle (Sachdev et al., 2001; Berg and Kleinfeld, 2003), and a candidate coding mechanism must allow for stimulus information to be integrated on this time scale. In this light, the representation of Af by peak firing rate is particularly appealing not only because it is “stimulus time-free,” but also because it is rapid: the information necessary to distinguish stimuli is present within as little as 5-15 msec after stimulus onset. It is interesting that, when tested for the encoding of an entirely different sort of information (the identity of a single deflected whisker), barrel cortex activity provides maximum discriminability on the same time scale (Petersen and Diamond 2000; Panzeri et al., 2001; Petersen et al., 2002).
In contrast to their rapid and robust neuronal representation of the product Af, none of the firing rate decoding methods differentiated frequency or amplitude independently. We therefore looked to temporal codes. We asked whether two iso-Af stimuli with similar mean firing rates could induce distinct spike timing patterns correlated with frequency or amplitude. Neither the time-varying firing rate profile, as estimated by the PSTH, nor the within-cluster temporal patterns, as estimated by the autocorrelogram, could distinguish vibration amplitude or frequency. Further work with formal informational methods (Panzeri and Schultz, 2001; Panzeri et al., 2001; Pola et al., 2002) will be able to delineate exactly how much information is carried by neuronal activity about different stimulus features, which aspects of neuronal activity (firing rate, correlations) carry the information, and precisely how much stimulus discriminability could be afforded by the coding properties of cortical neurons.
Models for texture discrimination
In primates, the leading model for tactile texture discrimination proposes that type I slowly adapting receptors contain an isomorphic representation of the surface in their discharge patterns (Blake et al., 1997; Yoshioka et al., 2001). The response of these afferent fibers is then spatially filtered by neurons in area 3b of the primary somatosensory cortex to compute local spatial variations or roughness (Hsiao et al., 1993; Johnson, 2001). Although rats have texture discrimination capacities rivaling those of humans, there is, to date, no specific proposal for the neuronal representation of object features. One thing is certain: gross differences at the receptor level exclude the possibility that the same mechanisms could account for tactile processing in primates and rats. Examples of such differences include the specialized structure of hair follicles, the presence of different receptor types, and the discrete nature of the whisker grid, as compared with the continuous structure of the skin. The present study is a first step in addressing how the rat whisker sensory system has achieved capacities similar to those of primates through an independent coding strategy.
The challenge is to discover the essential differences in the neural representations of two different surface textures. In awake, behaving animals this has proved difficult and inconclusive (Prigg et al., 2002). Our expectation is that from experiments in anesthetized animals we can formulate specific hypotheses to be tested subsequently in awake, mobile animals. Sinusoidal vibrations, although inevitably a simplification of the waveforms generated at the follicle as the whiskers scan a surface, provide a controllable model containing some of the critical features associated with textures. The simplest interpretation of our results is that neurons encode textures not in terms of whisker vibration frequencies but rather in terms of the velocity (Af) or kinetic energy (Af)2 of the whisker vibration. More specifically, as the rat's whisking motion sweeps the whiskers across an object, the surface features, ridges or grains, for example, could produce whisker vibrations superimposed on the sweep. Surfaces might be distinguishable by the velocity (or kinetic energy) of the vibrations. The complete understanding of texture coding will need to take into account a number of additional factors. The vibration induced in a whisker as it sweeps over a surface is likely to depend on the properties of active whisking (whisking frequency and sweep amplitude), the structure of the textured surface, the distance between the base of the whisker and the point of surface contact, as well as the intrinsic mechanical properties of the whiskers.
This last factor is of particular interest because the mechanical properties of whiskers affect the transmission of vibrations along the shaft. Whisker length varies systematically across different arcs on the rat's snout (Brecht et al., 1997); as a consequence, the longer, caudal ones resonate at lower frequencies than the shorter rostral ones (Neimark et al., 2003). This suggests that the whisker array might act analogously to the cochlea, differentially distributing across the whiskerpad the principal frequencies associated with different surfaces. Induced frequencies matching the resonance frequency of the whisker would transmit the largest amount of energy to the base and, according to the findings presented here, this energy would be encoded in the sensory pathways. Rats could judge texture according to a comparison of the amount of energy transmitted by each whisker.
A second set of studies suggests an entirely different sort of mechanism. High-frequency (>200 Hz) oscillations in barrel cortex surface field potentials are evoked by whisker deflection (Jones and Barth, 1999; Jones et al., 2000). The oscillations are time-locked to stimulus onset, and the peak amplitude of the oscillation is centered on the column of the stimulated whisker. Most importantly, the oscillations induced by simultaneous stimulation of different whiskers interact depending on submillisecond time differences in the stimulus onsets; slight changes in relative stimulus timing determine whether there is constructive or destructive interference in the field potentials. Taken together, these observations point to a model in which different textures generate distinct timing patterns among the whiskers, and such timing patterns are represented in the high-frequency oscillations in barrel cortex. The proposal we have put forward differs from both ideas outlined above in that multiwhisker patterns do not appear to be a critical component of the cortical representation of texture in our model.
Additional work involving neuronal recordings from awake animals will be required to test whether barrel cortex neurons encode whisker vibrations during natural behavior and whether such coding is in fact an essential mechanism underlying tactile discrimination.
Footnotes
This work was supported by European Community IST-2000-28127, Telethon Foundation GGP02459, J. S. McDonnell Foundation 20002035, Consiglio Nazionale delle Ricerche 02.00536.ST97, Wellcome Trust 066372/Z/01/Z, and Regione Friuli Venezia Giulia. We thank Miguel Maravall, Stefano Panzeri, Adrienne Fairhall, Marcelo Magnasco, and John Nicholls for valuable discussions and Erik Zorzin for construction of the optic sensor.
Correspondence should be addressed to Mathew E. Diamond, Cognitive Neuroscience Sector, International School for Advanced Studies, Via Beirut 2/4, 34013 Trieste, Italy. E-mail: diamond{at}sissa.it.
Copyright © 2003 Society for Neuroscience 0270-6474/03/239146-09$15.00/0