 |
Previous Article | Next Article 
The Journal of Neuroscience, April 1, 1998, 18(7):2626-2645
Structure of Receptive Fields in Area 3b of Primary Somatosensory
Cortex in the Alert Monkey
James J.
DiCarlo,
Kenneth O.
Johnson, and
Steven S.
Hsiao
Krieger Mind/Brain Institute, Department of Neuroscience, and
Department of Biomedical Engineering, Johns Hopkins University,
Baltimore, Maryland 21218
 |
ABSTRACT |
We investigated the two-dimensional structure of area 3b neuronal
receptive fields (RFs) in three alert monkeys. Three hundred thirty
neurons with RFs on the distal fingerpads were studied with scanned,
random dot stimuli. Each neuron was stimulated continuously for 14 min,
yielding 20,000 response data points. Excitatory and inhibitory
components of each RF were determined with a modified linear regression
algorithm. Analyses assessing goodness-of-fit, repeatability, and
generality of the RFs were developed. Two hundred forty-seven neurons
yielded highly repeatable RF estimates, and most RFs accounted for a
large fraction of the explainable response of each neuron. Although the
area 3b RF structures appeared to be continuously distributed, certain
structural generalities were apparent. Most RFs (94%) contained a
single, central region of excitation and one or more regions of
inhibition located on one, two, three, or all four sides of the
excitatory center. The shape, area, and strength of excitatory and
inhibitory RF regions ranged widely. Half the RFs contained almost
evenly balanced excitation and inhibition. The findings indicate that
area 3b neurons act as local spatiotemporal filters that are maximally
excited by the presence of particular stimulus features. We believe
that form and texture perception are based on high-level
representations and that area 3b is an intermediate stage in the
processes leading to these representations. Two possibilities are
considered: (1) that these high-level representations are basically
somatotopic and that area 3b neurons amplify some features and suppress
others, or (2) that these representations are highly transformed and
that area 3b effects a step in the transformation.
Key words:
receptive field; somatosensory; cortex; tactile; form; texture; area 3b; SI; monkey; first-order kernel; reverse
correlation
 |
INTRODUCTION |
Our long-term goal is to elucidate
the neural mechanisms of tactile form and texture perception. In this
study, we sought to understand the representation of tactile stimuli in
area 3b of primary somatosensory (SI) cortex because it is the first
cortical area involved in processing the neural signals implicated in
form and texture perception. Within area 3b regions representing the fingerpad, we described the receptive field (RF) of each neuron as the
two-dimensional pattern of excitation and inhibition that determines
the neuronal response to complex spatial patterns scanned across the
skin.
Several lines of evidence suggest that the neural signals that underlie
the perception of tactile form and texture on the glabrous skin of the
primate fingerpad are conveyed initially by the population of slowly
adapting type I (SAI) mechanoreceptive afferents (for review, see
Johnson and Hsiao, 1992 ). After transmission through the dorsal column
nuclei (Mountcastle, 1984 ) and the thalamus (Poggio and Mountcastle,
1960 ), these neural signals are projected most densely to area 3b of SI
cortex (Jones, 1986 ). Evidence that area 3b is the primary
cortical-receiving area for these signals comes from physiological
studies showing that, among the SI areas (areas 3a, 3b, 1, and 2), area
3b has the largest cortical surface area devoted to digit
representation (Sur et al., 1980 ), the highest proportion of cells
responsive to light cutaneous stimulation (Powell and Mountcastle,
1959b ; Iwamura et al., 1983 ; Kaas et al., 1984 ), cells with the
smallest RFs (Paul et al., 1972 ; Sur et al., 1980 , 1985 ), and the
highest proportion of cells responding to static skin indentation (Paul
et al., 1972 ; Sur et al., 1984 ). Removal of area 3b in the monkey
markedly reduces the responsiveness of area 1 neurons (Garraghty et
al., 1990 ) and secondary somatosensory (SII) neurons (Pons et al.,
1992 ; see also Pons et al., 1987 ) to cutaneous stimuli. It also
produces profound behavioral deficits in all somatosensory tasks
tested, whereas removal of other SI areas produces specific deficits in
the tactile discrimination of textures (area 1) and three-dimensional
forms (area 2) (Randolph and Semmes, 1974 ).
Although most area 3b neurons appear to have homogeneous, excitatory
RFs when probed with punctate stimuli (Mountcastle and Powell, 1959 ;
Sur, 1980 ), response properties that are more complex than those that
would be expected if the RFs were homogeneous have been reported in
some area 3b neurons. These include excitatory summation (Gardner and
Costanzo, 1980a ), surround inhibition (Mountcastle and Powell, 1959 ;
Iwamura et al., 1983 ), directional selectivity (Whitsel et al., 1972 ;
Hyvarinen and Poranen, 1978a ; Warren et al., 1986 ), and orientation
selectivity (Pubols and Leroy, 1977 ; Hyvarinen and Poranen, 1978a ;
Warren et al., 1986 ). Likewise, when neurons with RFs on the fingerpads
are stimulated with scanned, complex spatial stimuli, almost all yield
responses that are more complex than those that can be accounted for by
simple, excitatory RFs (Phillips et al., 1988 ; Bankman et al., 1990 ;
Johnson et al., 1995 ).
In this study, we scanned surfaces of randomly distributed, raised dots
across the RFs of 330 neurons in area 3b of three alert monkeys. This
subjected each neuronal RF to a very large number of different spatial
dot patterns. We determined the excitatory and inhibitory structure of
each neuron's RF by comparing the instantaneous spike rates with the
patterns that evoked them. Most RFs contained a single, well-defined
excitatory region and one or more surrounding or offset inhibitory
regions. The RFs describe how tactile form on the fingerpad is
transformed to a neural representation in area 3b. This transformation
may serve to enhance the presence of spatial features in the stimulus.
Alternatively, it may reflect the early stages of a transformational
sequence whose end result is a totally different, nonsomatotopic
representation that directly underlies form and texture perception.
 |
MATERIALS AND METHODS |
Animals and surgery
Two male and one female rhesus monkey (Macaca
mulatta) weighing 4-5 kg were used in this study. Each animal was
trained to perform a visual detection task during the presentation of
tactile stimuli (see below). Although the task was unrelated to the
tactile stimuli, it served to maintain the animal in a constant, alert state during recording periods. After the animal was performing the
task nearly perfectly, which took a few weeks, surgery was performed to
attach a head-holding device and a recording chamber to the skull.
Surgical anesthesia was induced with ketamine HCl (33 mg/kg, i.m.) and
maintained with pentobarbital (10 mg · kg 1 · hr 1,
i.v.). All surgical procedures were done under sterile conditions and
in accordance with the rules and regulations of the Johns Hopkins
Animal Care and Use Committee and the Society for Neuroscience.
Recording
Electrophysiological recordings were made in the postcentral
gyri of five hemispheres using standard techniques (Phillips et al.,
1988 ; Mountcastle et al., 1991 ). On each recording day, a
multielectrode microdrive (Mountcastle et al., 1991 ) was loaded with
seven quartz-coated platinum/tungsten (90/10) electrodes (diameter, 80 µm; tip diameter, 4 µm; and impedance, 1-5 M at 1000 Hz). Each
electrode was coated with one of three fluorescent dyes (DiI, DiI-C5,
or DiO; Molecular Probes, Eugene, OR) to facilitate later histological
localization of the recording sites (DiCarlo et al., 1996 ). The tip of
the microdrive was then inserted into the recording chamber, which was
filled with physiological saline, and oriented so the electrodes were
normal to the skull. The electrodes emerged from the tip of the
microdrive separated by 400 µm and arranged in a single row. The row
of electrodes was oriented normal to the plane of the central sulcus so
that three to four electrodes could be positioned in a line across the
laminae of area 3b (DiCarlo et al., 1996 ).
The localization of recordings to area 3b was accomplished in two ways:
initially, during the experiments, by physiological evidence and later,
after euthanasia, by histological methods. During the experiments, we
relied on the characteristic progression of RF locations as each
electrode was advanced through area 1 into area 3b (Merzenich et al.,
1978 ; Sur et al., 1984 ). This progression was nearly identical in all
five hemispheres. At the first sign of neural activity, the cutaneous
RFs usually occupied most, or a portion, of a single digit and could be
found on either the glabrous or hairy skin. As the electrode was
advanced, the location of the RF shifted toward the base of the finger
and often spread to the palmer whorls (at ~1500-2000 µm below the
first sign of neural activity). As the electrode was advanced further, the RF location began to move back onto the glabrous surface of the
proximal phalanx and was much more localized. Later histological analysis showed that this reversal corresponded closely to the border
between areas 1 and 3b. Over the next 500-1000 µm, the RF locations
continued to move distally along the glabrous surface of the digit
without jumps or discontinuities. Neurons with RFs located on one of
the distal fingerpads in area 3b were typically located in a region
2000-3000 µm below the first signs of neural activity. All neurons
that met the following criteria were studied using the stimulus
procedures described below: (1) the neuron's action potentials were
well isolated from the noise, (2) the neural RF was located on one of
the distal fingerpads (digits 2-5), and (3) the stimulus drum and the
hand (see below) could be positioned so that the RF was centered on the
portion of the fingerpad in contact with the stimulus. Using manual
probes, we encountered very few neurons (<5%) with RFs that extended
over multiple digits, but we cannot eliminate the possibility that
multidigit RF components might be revealed with quantitative RF-mapping
techniques spanning more than a single fingerpad.
On each successive day of recording, we shifted the position of the
line of microelectrodes ~300 µm medial (or lateral) from the
position of the previous day. Thus, after ~15-20 recording days in
each hemisphere, we sampled from the entire cortical volume devoted to
the glabrous distal digit representation (digits 2-5) in area 3b.
Action potentials from each electrode were discriminated from the
background noise with an amplitude discriminator. A continuous record
of the stimulus location and the times of occurrences of action
potentials, stimulus events, and behavioral events were stored in a
computer with an accuracy of 0.1 msec. At the end of each day, the
electrodes were withdrawn, two drops of dexamethasone phosphate (0.1%)
and gentamycin (0.002%) were applied to the dura, Gelfoam (Upjohn,
Kalamazoo, MI) soaked in sterile saline was inserted into the recording
chamber, and the chamber was sealed.
Stimuli
The stimuli were patterns of raised dots (see Figs. 1, 2) or
letters fabricated from sheets of photosensitive plastic that are
water-soluble until exposed to UV light (Toyoba Printight plastics, EF
series). A raised stimulus pattern was produced by laying a
photographic negative of the pattern over the plastic sheet and
exposing it to UV light, which polymerizes and hardens the exposed
plastic. The portion of the surface layer not exposed to UV light is
scrubbed off lightly in water, leaving a raised pattern whose thickness
equals the thickness of the original water-soluble layer (400 or 500 µm; see below). The etching precision is limited only by the
precision of the photographic negative. The stimulus pattern was
specified in Postscript and printed at 3386 dots per inch. After
construction, sheets containing the stimulus patterns were wrapped
around and glued to cylindrical drums, 320 mm in circumference, which
were mounted on a rotating drum stimulator (Johnson and Phillips, 1988 )
(see Fig. 1).
The same photographic negative with a single random dot pattern, 28 mm
wide and 250 mm long, was used to construct the random dot stimuli for
all three monkeys. In the experiments performed on one monkey, the
pattern was trimmed to 175 mm long to make room for a segment with
oriented bars. Otherwise, the stimuli were identical in the three
monkeys. Each dot was 400 µm high (in relief) and 500 µm in
diameter at its top, with sides that sloped away at 60° relative to
the surface of the drum. The location of the center of each dot was
determined by selecting two random numbers from a uniform random number
generator and scaling them to the height (28 mm) and width (250 mm) of
the pattern. The dot centers were specified to a precision of 1 µm,
and dots were allowed to overlap when the random number generator
placed them closer than the dot diameter. Dots were added until the
average dot density equaled 10 dots per square centimeter (i.e., 700 dots on a surface 28 mm wide × 250 mm long). Random dot patterns
are unbiased in the sense that all possible patterns with the specified
dot density are equally likely and the probability of a repeated
pattern is virtually zero. The dot density (10 dots/cm2) was chosen to satisfy two constraints.
First, it was large enough that, at each position on the pattern, the
RF was often stimulated by several dots. Such stimulation enables the
detection of inhibitory RF regions as a decrement in the response to a
simultaneously stimulated excitatory RF region (see the Discussion).
Second, preliminary studies showed that higher dot densities resulted in lower average firing rates in primary afferent SAI neurons, an
important source of input to area 3b. A second stimulus pattern consisted of embossed Helvetica letters 8 mm high that were composed of
lines 500 µm wide and 500 µm in relief (Phillips et al., 1988 ).
After one or more neurons with overlapping RF locations were isolated
with one or more of the electrodes, a drum with one of the stimulus
patterns was positioned over the fingerpad so that all of the neural
RFs were located in the cutaneous region contacting the drum surface.
During stimulus presentation, the orientation and the angular velocity
of the drum were adjusted to produce proximal-to-distal stimulus
movement at 40 mm/sec across the skin surface. Contact force was
controlled by a torque motor set to deliver 30 gm of force to the
fingerpad. The drum was positioned initially so that the cutaneous
contact region was entirely within the random dot pattern and the
center of the contact region was ~5 mm from the edge of the long side
of the pattern. After each revolution, the drum was stepped 200 µm
along its axis of rotation (Fig. 1). For
most of the neurons presented here, the drum completed 100 revolutions
(i.e., sweeps) and thus stepped a total distance of 20 mm, which took
~14 min. Two hundred marker impulses triggered at fixed, equal
increments around the drum hub were used to determine the position
of the stimulus relative to the occurrence of each action
potential with an accuracy of 0.1 msec (Johnson and Phillips, 1988 ).

View larger version (23K):
[in this window]
[in a new window]
|
Figure 1.
Stimulus. The stimulus pattern was a field
(28 × 250 mm) of randomly distributed, raised dots (see Fig. 2)
mounted on the surface of a drum (Johnson and Phillips, 1988 ). The dots
were 400 µm in relief and 500 µm in diameter and averaged 10 per
cm2. The drum was lowered onto the skin of the
distal fingerpad containing the neural RF with a controlled force of 30 gm. The hand and finger were held fixed from below (data not shown).
The drum rotated at a constant angular velocity to produce
proximal-to-distal motion at 40 mm/sec. After each rotation, the drum
was translated by 200 µm along its axis of rotation. The recording
period yielding the data for a single RF estimate typically involved
100 revolutions and lasted 14 min (see Materials and Methods for
details).
|
|
Histology
After several weeks of recording from each hemisphere, the
animal was deeply anesthetized with sodium pentobarbital (65 mg/kg, i.v.) and perfused transcardially with phosphate buffer followed by
cold, phosphate-buffered 3% paraformaldehyde, pH = 7.4. Frozen tissue sections (50 µm) were mounted but were not stained. Electrode tracks were visualized using fluorescent dye traces left by each electrode (DiCarlo et al., 1996 ). The electrode tracks and borders of
each section were traced into a computer data file (using Neurolucida). These data files were used to construct a three-dimensional rendering of the electrode tracings (using AutoCAD). Neuron locations were then
determined along these tracings using microdrive depth measurements. The tissue sections were then stained with cresyl violet, and the area
3b borders were defined according to the criteria of Powell and
Mountcastle (1959a) . For this study, we simply confirmed that all
neurons included in the final sample were located in area 3b. Detailed
laminar analysis of these results will be discussed in a future
paper.
RF estimation
The goal of the analysis presented here was to use the pattern
and strength of firing to infer the two-dimensional pattern of RF
excitation and inhibition on the skin. We assumed that each small
region of skin had a positive, negative, or zero effect on the firing
rate when stimulated and that the instantaneous firing rate was equal
to the sum of these effects. More specifically, we subdivided a 10 mm
square region of skin containing the RF into a grid of 625 (25 × 25) subregions, each 400 µm square. The method used to estimate the
contribution of each subregion is applicable to a wide range of
stimulus patterns. However, because our stimulus pattern is random, the
estimation method has a simple explanation. Each of the subregions was
stimulated by a passing dot ~1000 times (each region of skin was
stimulated by 50 cm2 of the stimulus pattern, which
contained ~500 dots, and each dot passed over each 400 × 400 µm subregion twice). On each of those occasions, the dot passed over
the subregion for 10 msec (400 µm at 40 mm/sec); thus, each subregion
was stimulated for a total of 10 sec out of the total recording time of
14 min. The average firing rate during the 1000 instances when a
subregion was stimulated is the overall mean firing rate (because all
other sites were stimulated randomly) plus the effect of the specific region under consideration. For example, if the subregion was strongly
inhibitory, then the mean firing rate during stimulation of that
subregion was much lower than the overall mean firing rate. If the
subregion was only mildly inhibitory, the mean firing rate was only
slightly less than the overall mean. In other words, the deviation
(positive or negative) from the overall mean firing rate was graded and
proportional to the strength of the effect (excitatory or inhibitory).
The RF maps presented in this paper are, in effect, maps of the mean
deviations in firing rate produced by each subregion in the RF. The
method just described is related closely to reverse correlation (de
Boer and Kuyper, 1968 ; Jones and Palmer, 1987 ). The method we actually
used was a modified form of linear regression, which includes reverse
correlation as a special case when the stimulus pattern is perfectly
random. Linear regression is more general because it provides a
least-squared-error solution whether the patterns are random or not.
The randomness of the pattern simply makes the estimates more robust
(less susceptible to noise in the response). The details follow.
Aligning the response with the stimulus. Each neuron
responded to the random dot pattern with a stream of up to 60,000 action potentials recorded continuously for the 14 min or so required to present the entire random dot pattern. To bring the neural response
into alignment with the stimulus, we assigned each action potential a
(x, y) position on the surface of the drum
corresponding to the location of the stimulus pattern at the time of
the action potential. The x location (distance in the
scanning direction from the beginning of the random dot pattern) was
determined by a digital shaft encoder. The y location was
determined by the axial position of the drum. The precision of the
method is better than 8 µm (Johnson and Phillips, 1988 ). The
resulting spatial raster (see, e.g., Fig. 2D) is
referred to as a spatial event plot (SEP).
This spatial raster of impulse locations was converted to a
two-dimensional histogram by dividing the space into a rectangular grid
of 400 × 400 µm bins and counting the number of impulses in
each bin. Four hundred micrometers were selected as the unit of spatial
resolution because this distance is less than half the mean spacing of
SAI and rapidly adapting (RA) primary afferents innervating the
fingerpad (Darian-Smith and Kenins, 1980 ) and is thus able to
approximate the spatial details of the afferent inputs even when they
are spaced irregularly. Along the scanning axis of the stimulus
(proximal-distal finger axis), 400 µm is equivalent to 10 msec (40 mm/sec scanning velocity). Along the axis orthogonal to the scanning
direction, 400 µm captures data from two sweeps because each sweep is
displaced 200 µm from the previous one (see above). A typical
stimulus run comprised 100 sweeps of the stimulus, which yielded a
response histogram with 20,000 bins.
A comparable histogram describing the random dot stimulus (stimulus
histogram) was created in two ways. In the simpler approach, each
400 × 400 µm bin containing one or more dot centers was
assigned the value 0.4 (mm in relief), and all other bins were filled
with zeros. In the more complicated approach, each histogram bin was assigned a number proportional to the raised dot area within its boundaries. The results were unaffected by the choice of method, so the
simpler approach was used.
To determine the stimulus interactions that produced the neural
response, we aligned the stimulus and the response histograms. Even
though position signals from the drum stimulator were accurate to the
nearest 8 µm (Johnson and Phillips, 1988 ) and spike times were
recorded with an accuracy of 0.1 msec, lack of knowledge of the exact
spatial relationship between the stimulus drum and the RF as well as an
unknown delay between the stimulus and the cortical action potentials
made the initial alignment of the stimulus and the response histograms
approximate. However, exact alignment is not critical. If the stimulus
is shifted relative to the response, the whole RF excitatory and
inhibitory structure is simply shifted within the 10 × 10 mm RF
grid. Nevertheless, we standardized the alignment by shifting the
response histogram to produce the maximum absolute cross-correlation
(usually positive but occasionally negative) between the stimulus and
the response histograms. Because of this, the RF peak value (usually
excitatory) was centered in the RF grid.
Estimating the RF map. We assumed that the impulse rate in
each response bin [r(n); n = current response bin] was the summed effect of several factors
that included a possible constant background discharge rate
(b0 in the equation below), the summed
excitatory and inhibitory effects of all of the stimulus elements
within the RF, possible nonlinear interactions between the
stimulus elements (rnl), and a random
component ( ):
|
(1)
|
The product
bixi(n)
represents the contribution of the stimulus
xi(n) in the ith RF
bin at time step n. The stimulus
xi in the ith bin was assigned
a value equal to the height of the stimulus in the bin at that moment
(0.4 mm if a dot was present; 0 if it was not). The coefficients
b1 to b625 represent the
weights (positive, excitatory; negative, inhibitory) given to the
stimuli x1 to x625 in the
25 × 25 subregions of the RF. The weight matrix specified by
b1 to b625 specifies the
pattern of excitation and inhibition within the RF; for brevity, it
will be referred to as the RF.
This formulation resulted in one equation for each bin (i.e., time
step) in the response histogram except for those near the beginning and
end of each sweep where the random dot pattern did not cover the RF
completely. The actual number of equations depended on the number of
sweeps and the alignment of the stimulus and response but was typically
~20,000 equations.
The unknown weights that specify the RF
(b1 ... , b625)
can be estimated in a variety of ways, including neural net (Johnson et
al., 1995 ), kernel (Marmarelis and Marmarelis, 1978 ), and regression (Draper and Smith, 1981 ) methods. These methods all yield essentially the same result, the set of weights that minimizes the mean-squared difference between the predicted and observed responses. We
accomplished this minimization by adhering as closely as possible to
the standard methods of multiple linear regression (Draper and Smith,
1981 ). The main complication relates to the nonlinearity inherent in an
inability of a neuron to produce negative impulse rates (threshold effect). The details are presented in . The nonlinear response component rnl(n) is, by definition,
the repeatable (nonrandom) part of the response that cannot be
explained by the linear RF. The magnitude of this response component is
estimated by averaging repeated sweeps (see ).
After the RF is estimated, issues related to the quality of the
estimate arise. We address these under the headings of repeatability, goodness-of-fit, and generality. The methods used to address the questions of repeatability and generality are presented in the Results.
The question of goodness-of-fit is what fraction of the explainable
variation in the data (i.e., the repeatable variation) is actually
explained by the linear interactions described by the RF
(b0 ... , b625).
The computation of goodness-of-fit is presented in .
 |
RESULTS |
Random dot patterns were scanned across the RFs of 330 neurons in
area 3b of five hemispheres in three alert monkeys. A neuron with an RF
located on one of the distal fingerpads was excluded from the study
only if the drum stimulator could not be positioned to bring the RF,
mapped with a manual probe, well within the contact region between the
skin and the stimulus surface. Even neurons that were marginally
responsive to manual probing were studied with the idea that the random
dot pattern might uncover responsiveness that was not evident with
simpler probing. Of these 330 neurons, 298 (90%) were sufficiently
modulated by the random dot pattern so that their responses could be
confidently aligned with the stimulus pattern and their RFs determined
(see Materials and Methods). The mean firing rate evoked by the random
dot patterns was 31.9 impulses/sec.
We first present a description of the RF structures observed in area 3b
followed by several key measures of those RFs, such as RF areas,
magnitudes, orientations, and aspect ratios. Then, we address issues
related to the quality of these RF estimates under the headings of
repeatability, goodness-of-fit, and generality.
A typical RF
An example of a typical area 3b RF is shown in Figure
2, which illustrates the conventions
adhered to throughout this paper. The square, gray
scale image illustrated in Figure 2A shows
the excitatory (dark) and inhibitory (light) weights that best describe the response patterns illustrated in Figure 2, D and
E. The RF weights are displayed as though they were viewed
through the back of the finger with the tip of the finger pointing to
the left of the figure. The stimulus pattern (Fig.
2B) is moving from right to
left beneath the fingerpad at 40 mm/sec. The relative motion is that that would occur if the RF (Fig. 2A) was
scanned from left to right across the stimulus pattern (Fig.
2B).

View larger version (50K):
[in this window]
[in a new window]
|
Figure 2.
A typical neural response and the resulting RF
estimate. A, RF estimate. The gray scale
represents the grid of weights (25 × 25 bins = 10 × 10 mm) that best described the response of the neuron to the random dot
stimulus pattern (see Materials and Methods). The RF diagram is meant
to represent excitatory and inhibitory skin regions viewed through the
back of the finger as the finger points to the left and
the stimulus pattern moves from right to left under the finger. The background
gray level (50% black) represents the
region where dots had no (linear) effect on the neural response, with
darker levels representing excitatory regions where dots increased the
probability of firing and lighter levels representing regions where
dots decreased the probability of firing. B, A portion of the random dot stimulus pattern with the RF superimposed at three
locations. The intensity of the RF gray scale has been
reduced so the stimulus dots can be seen.
C, Neural impulse rates predicted by convolving the RF
(A) with the random dot stimulus (B) and by
clipping negative values to zero. Darker regions correspond to higher
predicted rates. The arrows extending from
B to C point to the predicted impulse
rates for each of the three RF positions in B.
D, Observed response of this neuron. Each tick
mark indicates the occurrence of a single spike. The plotted
position of each spike was determined by the location of the stimulus
pattern at the instant the spike occurred (SEP). The three
vertical arrows indicate the responses at the stimulus
locations corresponding to the three predicted responses in
C. E, Predicted (black
line) and observed (gray histogram)
impulse rates in a single scan are indicated by the
arrows at the sides of C
and D. Predicted rates <0 correspond to periods in
which the summed inhibitory effects exceed the summed excitatory
effects.
|
|
The gray scale RF plot illustrated in Figure
2A is a 25 × 25 grid of weights, each
representing the influence of a stimulus dot within a single 400 × 400 µm patch of skin. Each RF weight value can be interpreted as
the instantaneous change in firing rate that occurs when its 400 × 400 µm skin region is depressed by a stimulus feature 400 µm
high (the dot relief) moving proximally to distally at 40 mm/sec. For
purposes of visualization, each RF was normalized by dividing all
weights by the absolute value of the largest weight, thereby
restricting the values to the range from 1 to +1. The
"background" gray level (the gray level
around the border, which is 50% black) represents the
region where dots had no average effect (zero RF weight), with darker
regions representing excitatory effects and lighter regions
representing inhibitory effects. Black regions (100%
black) correspond to normalized RF values of +1.
White regions (0% black) correspond to a
normalized RF value of 1 and occur only when the absolute maximum RF
value is inhibitory, which happened infrequently.
The RF map in Figure 2A shows that this neuron had a
region of intense excitation followed by a slightly larger region of inhibition and that both regions were oriented at ~45° relative to
the proximal-distal axis of the finger. The weights and their spatial
organization were determined by multiple regression of the neural
response on the stimulus as described in Materials and Methods and
. The relationship between the weight pattern and the responses
can be seen by inspecting Figure 2B-D.
Whenever one or more dots occurred anywhere within the darkened region of the RF, the probability of firing increased. Occurrence of dots only
within the white, inhibitory region of the RF had no effect because
this neuron, like most neurons in the study, had no background
discharge to be inhibited. However, whenever one or more dots occurred
in the inhibitory region at the same time that dots occurred in the
excitatory region, the probability of firing was reduced relative to
that expected from stimulation of the excitatory region alone. Three
instances are shown in Figure 2B-D (the
RF weight pattern in Figure 2B is lightened so the
stimulus dots within the excitatory region can be seen). The
left example shows an instant in the ongoing interaction
between the RF and the stimulus pattern in which three dots happen to
lie within the excitatory region of the RF. This alignment predicts an
intense response, which is displayed at the tip of the left
vertical arrow in Figure 2C. The actual response is
displayed at the comparable location in Figure 2D.
The second (middle) example illustrates an alignment in
which only a single dot lies within the excitatory region. The
predicted (Fig. 2C) and actual (Fig. 2D)
responses are much less intense than are those in the first example.
The third (right) example shows an alignment with a stimulus
dot at exactly the same place within the excitatory subfield but also with two dots within the inhibitory subfield. The predicted (Fig. 2C) and actual (Fig. 2D) result is a
cessation of firing. A typical experiment produced ~20,000
stimulus-response combinations of the kind illustrated in these three
examples, which provided the basis for the precise estimation of the RF
weights. Figure 2E shows a continuous
trace of the predicted and observed firing rates across the
single scan indicated by the arrows to the left and right of Figure 2, C and D.
The RF illustrated in Figure 2, like 36% of the neurons in this study
(see Fig. 3), has a single region of
inhibition displaced distally relative to the center of excitation.
Because the random dot patterns were scanned across the RF of the
neuron at one velocity and in the proximal-to-distal direction, space
and time are confounded in that direction. For example, the inhibitory
region in the RF illustrated in Figure 2, whose center is displaced 2.4 mm distal to the center of excitation, could have resulted from
inhibition displaced distally from the center of excitation by 2.4 mm,
from inhibition (or suppression) delayed by 60 msec relative to the excitation (2.4 mm at 40 mm/sec), or from some combination of the two
possibilities.

View larger version (102K):
[in this window]
[in a new window]
|
Figure 3.
RF structures observed in area 3b. Each
panel gives a typical example of the type, the total
number of RFs fitting the description, and their percent of the total
RF sample (n = 247). The types are shown in
decreasing order of frequency. A, A single inhibitory region located on the trailing (distal) side of the excitatory region.
B, A region of inhibition located on one of the three nontrailing sides of the excitatory region. C, Two
regions of inhibition on opposite sides of the excitatory region.
D, Inhibition on three sides of the excitatory region.
E, Inhibition on two contiguous sides of the excitatory
region. F, A complete inhibitory surround.
G, An excitatory region only. H, RF
dominated by inhibition. I, RFs not easily assigned to
one of the preceding categories.
|
|
Characterization of the RFs
Because some neurons were not well driven by the random dot
pattern, the RFs derived from such neurons were flat, variable, and not
suitable for quantitative analysis. To identify those neurons, we
adopted a criterion based on estimates of the noise and signal in the
estimated RF (see below). Using this criterion, we removed 51 of the
298 neurons (15%), reducing the sample on which more extensive
analyses were done to 247 neurons.
Although the RF structures ranged widely, as can be seen by examining
the many examples presented in this paper, they typically consisted of
a central region of excitation combined with surrounding, flanking, or
offset regions of inhibition. To give a general picture of the RFs, we
assigned each RF to one of the nine types illustrated in Figure 3.
Thirty-six percent (89 of 247) of the RFs had a single inhibitory
region located on the trailing side of the excitation (i.e., along the
direction in which the pattern was scanned); 16% (39 of 247) had a
single region of inhibition that was located on one of the three
nontrailing sides of the excitatory region; 15% (38 of 247) had two
regions of inhibition on opposite sides of the excitation; 14% (35 of
247) had inhibition on three sides of the excitatory region; 9% (22 of
247) had inhibition on two contiguous sides of the excitatory region;
3% (8 of 247) were clear examples of an excitatory center with an
inhibitory surround; 1% (2 of 247) had only an excitatory region; and
1% (2 of 247) were dominated by inhibition. Only 5% (12 of 247) could
not be assigned to any of these categories. Although we have grouped the RFs to illustrate the proportions of RF types found in area 3b, it
should be emphasized that the RF examples shown in Figure 3 do not
represent distinct, separate groups; rather, they are samples from an
apparently continuous distribution of RF structures.
Neuronal responses are affected not only by the relative positions of
the excitatory and inhibitory regions in the RF but also by their
areas, amplitudes, shapes, and orientations. In the following sections,
we describe these and other properties. In each graph, we chose RFs to
illustrate the points in that graph and also to illustrate the
distribution of RFs we encountered in area 3b. Overall, a sixth of the
total sample (43 of 247 RFs) is illustrated.
RF area
Convergence within the nervous system results in a loss of
specificity for location and, it is presumed, a gain in specificity of
some other kind. The area of each RF that is sensitive to dot stimulation provides an index of this convergence and therefore an
index of the degree to which the neural representation of tactile stimuli has changed.
Excitatory (inhibitory) RF area was computed as the number of positive
(negative) RF bins exceeding a threshold multiplied by the area of skin
represented by each bin (400 µm × 400 µm = 0.16 mm2). If the RF excitatory and inhibitory values
dropped off steeply at their boundaries and there was no noise, no
special considerations would be required to estimate the RF excitatory
and inhibitory areas. However, that is not the case. The RF estimates
do contain noise (see below), and their excitatory and inhibitory
profiles often decline gradually near their borders (see Fig. 6),
making the exact location of their boundaries a matter of definition. The definition used in this study is described in . Briefly, each RF was first smoothed with a Gaussian filter (SD = 300 µm) and then thresholded at a value equal to 10% of the absolute maximum RF value, which was usually the peak excitatory value. A final stage
eliminated small isolated, noise-induced islands of positive and
negative pixels. All RF parameters (except RF noise, discussed below)
were computed after these filtering and thresholding steps.
Distributions of excitatory area, inhibitory area, excitatory to
inhibitory area ratio, and total RF area are shown in Figures 4 and 5.
Excitatory and inhibitory areas were distributed without any obvious
clustering and without strong correlation (Pearson correlation = 0.262). Both excitatory and inhibitory areas varied widely. The
minimum, mean, and maximum excitatory areas were 3, 14, and 43 mm2, respectively. The comparable inhibitory areas
were 1, 18, and 47 mm2, respectively. When plotted
in logarithmic coordinates, neither the excitatory nor the inhibitory
distribution was significantly different from a normal distribution
(p > 0.05; Kolmogorov-Smirnoff one-sample
test; SPSS). The bivariate (excitatory and inhibitory area) log plot
shown in Figure 4 has means of 1.100 and 1.189 log10
units), SDs of 0.224 and 0.262 log10 units, and a
correlation of 0.327. The geometric mean inhibitory area was 23%
larger than was the geometric mean excitatory area, and 67% (164 of
247) of area 3b neurons had larger inhibitory than excitatory areas.
The total RF areas ranged from 7.4 to 64 mm2 (mean
of 32.3 mm2). This mean receptive field area is many
times greater than the primary SAI and RA afferent receptive field
areas measured with single, scanned dots (Johnson and Lamb, 1981 ),
which indicates a large cumulative divergence between the periphery and
area 3b. Larger inhibitory than excitatory RF areas suggest greater
inhibitory than excitatory divergence.

View larger version (41K):
[in this window]
[in a new window]
|
Figure 4.
Scatter plot of excitatory and inhibitory RF
areas. Excitatory area was measured as the total positive area in the
thresholded RF (positive RF regions with values >10% of the peak
absolute RF value, see Results). Inhibitory area was measured as the
total negative-thresholded RF area (negative RF regions with absolute values >10% of the peak absolute RF value). The sample RFs on the
left (A-P) are meant to
illustrate a wide range of combinations of excitatory and inhibitory
areas. The letter above each RF is keyed to a
point in the scatter plot (right).
|
|

View larger version (15K):
[in this window]
[in a new window]
|
Figure 5.
Distributions of RF areas. The
y-axis values represent numbers of neurons
(n = 247 neurons in all graphs). From
left to right, the x-axis
values represent excitatory RF area, inhibitory RF area, total RF area
(sum of excitatory and inhibitory areas), and the ratio of excitatory
to inhibitory area, all on logarithmic scales.
|
|
RF mass
The excitatory and inhibitory effects are graded, and therefore a
single RF region may have a small or large overall effect on firing
rate, depending on the magnitudes of the excitatory or inhibitory
weights within its boundaries. Examples of variation in RF values
across the skin surface are shown in Figure
6. To summarize this aspect of the RF
structure, we integrated these intensities over the excitatory and
inhibitory areas and termed the resulting measures excitatory and
inhibitory "mass." More precisely, the excitatory (inhibitory) mass
was defined as the sum of the positive (negative) RF bin values and has
units of impulse per second per millimeter of stimulus relief. The
thresholded RF was used for this calculation for the same reasons given
for the calculation of RF area.

View larger version (29K):
[in this window]
[in a new window]
|
Figure 6.
Scatter plot of excitatory and inhibitory RF
masses. Excitatory (inhibitory) mass was measured as the integral of
the absolute value of excitatory (inhibitory) RF values over the
excitatory (inhibitory) area of the RF (see Fig. 4). Mass units are
impulses per second per millimeter of stimulus relief (see Results).
The dashed line in the scatter plot
(right) represents balanced excitatory and inhibitory
mass. The sample RFs on the top left
(A-H) illustrate a wide range of
excitatory and inhibitory mass combinations. The line
through each RF passes through the excitatory and inhibitory peaks. The
plots below the two-dimensional RFs (bottom left), which
display the RF values (y-axis) along these
lines, are meant to display the relative intensities of
the excitatory and inhibitory components. The letter
above each RF is keyed to a point in the scatter
plot.
|
|
The distributions of excitatory mass, inhibitory mass, excitatory to
inhibitory mass ratio, and total RF mass are shown in Figures 6 and
7. Like the excitatory and inhibitory
areas, the area 3b excitatory and inhibitory masses were distributed
widely and without any obvious clustering. Both excitatory and
inhibitory mass varied by 50:1 across the population, and they were
more highly correlated with each other than were the excitatory and inhibitory areas. The minimum, mean, and maximum excitatory masses were
210, 2140, and 10,300 mass units, respectively. The comparable values
for inhibitory mass were 125, 1620, and 6830 mass units. As seen for
area, neither distribution was significantly different from a normal
distribution when plotted in logarithmic coordinates (p > 0.20; SPSS). The bivariate (excitatory and
inhibitory mass) log plot shown in Figure 6 has means of 3.176 and
3.080 log10 units, SDs of 0.367 and 0.344 log10
units, and a correlation of 0.56. The geometric mean excitatory mass
was 25% larger than was the mean inhibitory mass. The total RF mass
(excitatory plus inhibitory) ranged from 590 to 14,200 mass units (mean
of 3760 mass units).

View larger version (15K):
[in this window]
[in a new window]
|
Figure 7.
Distributions of RF masses. Axes are as described
in Figure 5 except that the x-axis represents mass
rather than area. Mass units are impulses per second per millimeter of
stimulus relief (see Results).
|
|
Examples of RFs with a range of excitatory and inhibitory masses are
shown on the left of Figure 6. Mass differences are
illustrated using histograms that show the relative excitatory and
inhibitory values in each RF along the lines shown on the RF
plots. In general, the peak inhibitory values were less than were the
peak excitatory values. As a result, even though the inhibitory areas
were typically greater than were the excitatory areas (Fig. 5), the
excitatory masses were, on average, greater than were the inhibitory
masses (Fig. 7). Sixty-one percent (150 of 247) of area 3b neurons had larger excitatory than inhibitory masses.
There was no significant correlation between excitatory mass and
excitatory area (correlation, 0.11; p > 0.05);
however, inhibitory mass and inhibitory area were correlated
(correlation, 0.39; p < 0.01). Excitatory mass was
correlated, whereas inhibitory mass was anticorrelated, with the
average, evoked firing rate (partial correlation coefficients of 0.85 and 0.17, respectively). Excitatory and inhibitory areas, in
contrast, had little if any relationship to the average, evoked firing
rate (partial correlations of 0.00 and 0.08, respectively). An
interesting correlation was observed between the excitatory and
inhibitory masses and the responses of the cortical neurons to
sustained indentation. A subset of the 247 neurons reported here (87 neurons) were studied with sustained indentation (1000 µm) using a
servo-controlled linear motor and a punctate probe (Chubbuck, 1966 ).
Half of those neurons (44 of 87) responded to the sustained indentation
with a sustained impulse rate significantly above the background rate
(p < 0.05; t test; paired difference
between rates in 0.5 sec intervals, one just before and one beginning
1.0 sec after the onset of indentation; 1.5 sec duration; 20 repetitions). Neurons with a sustained response to steady indentation
had excitatory and inhibitory RF masses that were, on average, twice
those of neurons without a sustained response (p < 0.001; t test). Regression analysis showed that this
sustained response was related three to four times more strongly to
excitatory than to inhibitory mass.
Relative locations of excitatory and inhibitory RF regions
Because the relative locations of excitation and inhibition are
critical for stimulus selectivity, we determined the spatiotemporal relationship of the centers of excitatory and inhibitory RF mass. The
center of excitatory (inhibitory) mass was computed from the thresholded RFs as the weighted mean location of all bins within all
excitatory (inhibitory) subregions, with the location of each bin
weighted by its absolute excitatory (inhibitory) value. The result of
this analysis is shown in Figure 8. The
position of each circle in this plot represents the position
of the center of all inhibitory RF mass relative to the position of the
center of all excitatory RF mass for a single neuron. The
area of each circle represents the ratio of
inhibitory to excitatory mass.

View larger version (33K):
[in this window]
[in a new window]
|
Figure 8.
Scatter plot of locations of centers of inhibitory
mass relative to centers of excitatory mass. The origin of the graph on the right represents the center of excitatory mass for
each neuron. The axes, like the RF images shown throughout this paper
(e.g., Fig. 2), are oriented as though the finger was pointing to the left with the glabrous surface down and the RF was
viewed through the back of the finger. The x-axis
represents proximal displacement of the center of inhibitory mass
relative to the center of excitatory mass. The y-axis
represents (anatomical) leftward displacement to the
centers of mass. The size of each circle
is proportional to the ratio of inhibitory to excitatory mass (see
key at top right). The sample RFs on the
left (A-I)
illustrate examples in which the center of inhibitory mass is displaced
distal and leftward (A), leftward
(B), etc. The letter above each RF
is keyed to a point in the scatter plot.
|
|
The dispersion of points around the center of Figure 8 indicates that,
across the population, the inhibitory center of mass of the RF was
found at all positions around the excitatory center of mass. This is
also illustrated in the examples on the left of Figure 8.
However, it is evident that the center of inhibitory mass was displaced
toward the fingertip relative to the excitatory center of mass in most
RFs. Seventy-two percent of all points in Figure 8 fell in a wedge
60° wide to the left of the origin (between the
dashed lines). The most intense concentration was 2.5-3.0 mm distal to the excitatory center of mass. The fact that 72%
of the centers of mass fell within this wedge does not mean that most
of the inhibition fell within this region; only 36% (89 of 247) of
the RFs had most of their inhibitory mass located in this wedge
(see Fig. 3).
Because the stimulus was scanned in the proximal-to-distal direction at
40 mm/sec, a 2.5-3.0 mm distal offset also corresponds to inhibition
with a time lag of 65-75 msec relative to the peak of excitation.
Although a temporal lag seems like the most likely explanation for this
inhibitory offset, a fixed time lag of 65-75 msec would result in a
spatial offset that was proportional to scanning velocity. However,
other data from these same neurons show that this offset is unaffected
by changes in scanning velocity from 20 to 80 mm/sec, so the
explanation must be more complex (J. DiCarlo and K. Johnson,
unpublished observations; see Discussion).
Shape of excitatory and inhibitory RF regions
The shapes of RF subregions were examined because they may
indicate neuronal selectivity for certain tactile forms. Excitatory and
inhibitory RF subregions varied in shape from nearly round to highly
elongated, as can be seen from the many examples displayed here. To
characterize the elongation and orientation of the excitatory and
inhibitory subregions, we segmented each RF into distinct lobes (or
islands). A lobe was defined as a continuous region of excitation or
inhibition in the thresholded RF. It was classified as a dominant
excitatory or inhibitory lobe if it contained at least 80% of the
total excitatory or inhibitory mass. Eighty percent (197 of 247) of the
area 3b RFs had a dominant excitatory lobe defined in this way, and
74% (182 of 247) had a dominant inhibitory lobe. The elongation and
orientation of each dominant lobe were measured by fitting its
distribution of excitatory (or inhibitory) weights with a bivariate
Gaussian density function. Elongation was defined as the ratio of major
to minor SDs along the principal axes of the Gaussian function (i.e.,
aspect ratio). Orientation was defined as the angle of the major axis
counterclockwise from the proximal-distal finger axis. The results are
displayed in Figure 9. The majority of
dominant excitatory (60%, 118 of 197) and inhibitory (73%, 132 of
182) lobes had aspect ratios >1.5. The excitatory lobes were oriented
in all directions (i.e., the orientations were not significantly
different from a uniform distribution; p = 0.641;
SPSS). However, the inhibitory lobes were preferentially bunched around
90° counterclockwise from the proximal-distal finger axis, as can be
seen in Figure 9 (p = 0.005; SPSS).

View larger version (39K):
[in this window]
[in a new window]
|
Figure 9.
Elongation and orientation of dominant excitatory
and inhibitory RF lobes. Dominant lobes [defined as continuous regions
of RF excitation (inhibition) that contained at least 80% of the total
excitatory (inhibitory) mass] were fitted with a two-dimensional Gaussian function to obtain ellipsoidal descriptors of orientation and
elongation (see Results). Top row, The
x-axis of each graph represents the angle of the lobe's
major axis relative to the finger axis. The y-axis
represents the ratio of the lengths of the best-fitting ellipsoidal
major and minor axes. Ellipses at the
right illustrate aspect ratios of 1.0, 2.0, and 3.0. Bottom row, Distributions of excitatory and inhibitory
aspect ratios are shown. The x-axis represents the
aspect ratios of the excitatory and inhibitory lobes. The
y-axis values represent the number of neurons with each
aspect ratio.
|
|
The shape of the excitatory and inhibitory RF regions of a neuron might
confer some stimulus selectivity (e.g., to elongated bars), but the
relationship between excitatory and inhibitory RF regions could confer
even greater selectivity. We examined the relationship between
excitatory and inhibitory lobes in the 78 neurons whose RFs had
dominant excitatory and inhibitory lobes with aspect ratios > 1.5. A
majority of these RFs (65%, 51 of 78) contained dominant excitatory
and inhibitory lobes that were aligned to within 20° of one another
(within the dashed lines of Fig.
10). The number expected by chance is
17 of 78 (22%), and the difference is highly significant
(p < 0.001). In those cases in which the lobes
were misaligned, the inhibitory lobe tended to be oriented at or near
90° counterclockwise from the proximal-distal finger axis relative
to the scanning direction. The points labeled G and
H in Figure 10 are examples of such fields.

View larger version (26K):
[in this window]
[in a new window]
|
Figure 10.
Relationship between excitatory and inhibitory RF
lobe orientations. Only neurons with dominant excitatory and inhibitory lobes, both with aspect ratios > 1.5, were included
(n = 78, see Fig. 9). Excitatory and inhibitory RF
lobe orientations tended to be similar (correlation coefficient = 0.586). Sixty-five percent of the points lie within
±20° of the diagonal (within the dashed lines on the
right). Examples are shown on the left
(A-H).
|
|
Reliability, goodness-of-fit, and generality
Although the RFs presented here provided the best linear
approximations of the neural responses to the random dot patterns, there are several reasons why the estimated RF for a single neuron might be a poor approximation of the true response properties of the
neuron. First, variability in the estimated RF because of variability
in the response may be large enough to render the RF estimate
unreliable. Second, few relationships in biology are truly linear. A
first-order (linear) RF approximation is justified only by the degree
to which it captures the repeatable and therefore potentially
explainable variation in the responses. Third, even if the RF provides
a good description of the neural responses to the random dot patterns,
it may not account well for neural responses to other kinds of stimuli.
Each of these issues is addressed below.
Reliability
The reliability of each RF estimate was assessed in two ways. The
first method was based on a measure of the noise in each RF estimate.
This provided an index of the random pixel-to-pixel variation in the RF
estimate and therefore an indication of the variability in each RF
relative to other RFs. The second method was based on the correlation
between repeated, independent estimates of the same receptive
field.
Standard methods of error analysis (Draper and Smith, 1981 ) provide a
basis for estimating the variance of the noise in the weight values
(the 625 excitatory and inhibitory values that compose a single RF
estimate). The application of these methods to our data suggest that
the correlations between errors in adjacent RF bins are low; however, a
precise estimate of the RF noise requires precise, bin-by-bin knowledge
of the response variance that is difficult to obtain. Instead, we
devised a signal-to-noise index based on direct measurement of the
variation in RF bin values. Once the RF was estimated, it was filtered
with a two-dimensional Gaussian filter with a SD (300 µm) that was
small relative to the spatial dimensions of interest (see ). We
proceeded with the assumption that the spatial frequency spectrum of
the RF noise was similar across neurons and therefore that this
filtering removed the same fraction of the total RF noise variance in
each neuron. In that case, the variance of the removed noise
(difference between the raw and filtered RFs) is a measure of the total
noise in the estimated RF. The magnitude of the RF noise relative to the RF signal is the relevant measure of RF reliability. Thus we
adopted the ratio of the SD of the noise removed by the Gaussian filter
to the peak filtered RF value as an index of the reliability of
the RF estimate. This measure is referred to as the RF noise index.
Figure 11 shows RFs from seven area 3b
neurons with similarly structured RFs and noise indices ranging from 5 to 35%. This figure shows that as the noise index increases, the raw
(unfiltered) RF becomes more variable, and the thresholded RF starts to
include regions that may be attributable to noise in the RF estimate. The noise index was, on average, related inversely to the total number
of action potentials entering the RF estimate. For example, RF
estimates with noise indices between 30 and 35% were based on
recordings whose impulse counts averaged 2500, which corresponds to an
average rate of 4 impulses/sec. Larger numbers of impulses generally
produced lower noise indices and less RF variability. For example, all
neurons that responded with >10,000 impulses to the random dot
stimulation (103 of 247 neurons) had noise indices < 30% (mean
of 8.4%).

View larger version (89K):
[in this window]
[in a new window]
|
Figure 11.
RF noise. RFs of seven area 3b neurons with
similar RF structures illustrating a range of RF noise indices. The
gray-scale plots show the raw estimated RFs and the
effects of filtering (two-dimensional Gaussian filter; SD = 300 µm) and thresholding (see ). The thresholded version was used
to estimate all of the reported RF measures. Histograms, taken along a
horizontal line through the center of each RF, are shown at the
bottom (scaled independently for each neuron). The
dashed vertical line indicates the RF noise index (30%)
above which RF estimates were considered to be too variable for
reliable measurements and were not included in the final
analyses.
|
|
An alternative measure of the reliability of the RF estimates was
obtained by dividing the data into two independent sets, obtaining
independent RF estimates from the two data sets, and comparing the
estimates. The data were divided three different ways. The action
potentials were assigned to separate data sets from (1) alternate (odd-
and even-numbered) sweeps of the random dot pattern, which were
separated by only 200 µm; (2) the first and last halves of each
sweep (duration, 6.25 sec); and (3) the first (~50) and last (~50)
sweeps across the pattern. The three methods of dividing the data
assess different possible reasons for a lack of RF repeatability.
Division by the first method assesses the effect of noise alone, for
there is no other reason that RF estimates obtained from interleaved
data should be different. The second and third methods introduce other
potential reasons for differences. Both involve estimates from
responses evoked by nonoverlapping stimuli. Division by the second
method also reveals differences that might be caused by short-term
adaptation between the first and second halves of each 6.25 sec (250 mm
at 40 mm/sec) sweep across the random dot pattern. Division by the third method reveals differences that might be caused by longer-term changes, because the neuron is subjected to continual stimulation over
a period of almost 14 min.
An estimate of the RF and its noise index, as defined above, was
obtained from each of the six sets of action potentials (data division
in three different ways). Repeatability was assessed by Pearson's
correlation between the two independent RF estimates obtained from each
of the three divisions. The correlation calculation was effected by
plotting the pairs of weights from matching RF bins in a
two-dimensional plot (data not shown) and by calculating the
correlation of this bivariate scatter. Results from the first data
division method are displayed in Figure
12, which shows that the correlation
was, as expected, a declining function of the RF noise index and that
RFs with noise indices < 30% were highly reproducible (mean
correlation = 0.893; SD = 0.078). This high degree of
correlation is not attributable to similarities in the RFs among the
population of area 3b neurons, because the mean correlation of the RFs
of different neurons with RF noise indices < 30% was only 0.465. These results and our subjective assessment of data like those
illustrated in Figure 11 caused us to adopt an RF noise index of 30%
as the threshold for inclusion in the study. Examples of RFs with 11, 20, and 45% noise indices in Figure 12 show that this is a
conservative threshold. The examples at 11 and 20% illustrate the less
repeatable cases (lower correlations) at each of those noise indices.
In contrast, the example at 45% was chosen to illustrate how reliable
RF estimates with high noise indices can be.

View larger version (28K):
[in this window]
[in a new window]
|
Figure 12.
RF repeatability. Examples of three pairs of
independent RF estimates from three cortical neurons are shown on the
right. Two independent RF estimates for each neuron were
computed from spike data collected on even- and odd-numbered drum
scans, respectively (see the Results). The correlation coefficients
plotted on the y-axis of the graph on the
left and displayed to the right of each
pair of RFs are correlations of the 625 paired RF weights of the two
independent RF estimates from each neuron. The x-axis represents the RF noise index displayed in Figure 11. Note that the
noise index for each RF in this figure is based on half the data and is
thus 2 greater than the noise index of the RF obtained from all the
data. The labels A, B, and
C above the example RFs are keys to
points in the graph on the left.
A and B were chosen to illustrate the
less repeatable cases among neuronal RFs with noise indices near 10 and
20%, respectively. C was chosen to show how good the
repeatability can be even when the noise index is over 40%, which is
well above the cutoff (dashed vertical line) used in
this study for measuring RF parameters.
|
|
When examining the RF pairs in Figure 12, it should be noted that they
are each based on only half the data that went into the RF estimates
for each neuron. The noise indices for each point in the scatter plot
are the average of the noise indices obtained from the two RFs computed
from the interleaved data sets and are thus larger than the noise
indices of the RFs computed using the full data set (by approximately a
factor of  ). The plot in Figure 12 is intended
mainly to show the relationship between the measure that we used as an
index of the variability of the RF estimate (noise index) and a direct
measure of RF variability (the correlation between repeated
measures).
The second and third methods of segregating the data resulted in two
more correlation coefficients for each neuron. These correlation
coefficients were almost identical to the correlation coefficients
based on the interleaved data division (Fig. 12), and plots of these
correlation coefficients versus the noise index were indistinguishable
from Figure 12. Pairwise analysis of correlation coefficients from
single neurons showed a slight drop relative to the correlations based
on the interleaved data. The correlations based on the first and last
50 sweeps (division by method three) were, on average, 0.041 correlation units lower than were the correlations based on interleaved
sweeps. The correlations based on the first and last halves of each
6.25 sec sweep (division by method two) were, on average, 0.082 units
lower than were the correlations based on interleaved sweeps. We
attribute these small differences to the fact that the first
correlation coefficient was based on independent responses to the same
dot pattern, whereas the latter two correlations were based on RFs
obtained from responses to different random dot patterns.
Goodness of fit
A brief description of our approach to measuring goodness-of-fit
follows; a complete description is given in . The goal of the
analysis is to determine how well the actual neural response is
described by the response predicted by the linear RF (see Fig. 2C-E). The neural response to any stimulus
consists of two parts, a random part (noise) and a repeatable part,
which is, in principle, explainable and is the part we aim to
understand. The repeatable or explainable part is, in turn, composed of
two parts, a part that can be explained as the sum of the excitatory
and inhibitory contributions described by the RF (the linear part) and
the remainder, which is by definition the nonlinear part. The
goodness-of-fit measure presented here is the fraction of the
repeatable or explainable variance that is, in fact, explained by the
summation effects specified by the RF. The regression method provides a
measure of the variance explained by the RF (variance of the linear
part). All that remains to measure goodness-of-fit is an estimate of the repeatable (linear plus nonlinear) variance.
The repeatable part of the neural response can be estimated as the mean
of many repeated trials, but this is impractical when a single trial
requires almost 14 min of stimulation. However, the noise variance can
be estimated accurately from only two trials, which provides an
effective way of estimating the variance of the repeatable part. By
definition, the noise and repeatable variations are uncorrelated, so
the variance of the repeatable part is the difference between the total
response variance and the noise variance. The goodness-of-fit measure
is the percent of this "explainable" variance that was actually
explained by the estimated RF.
The distribution of goodness-of-fit values is shown in Figure
13. A goodness-of-fit value of 100%
indicates that the RF described the repeatable component of the neural
response completely. A value of 0% indicates that the RF explained
none of the repeatable part. Figure 13 shows that the distribution of
goodness-of-fit values for area 3b neurons was approximately normal and
unimodal, indicating no clear distinction between neurons that have
predominantly linear and those that have predominantly nonlinear
responses. The mean and SD of the distribution were 40.3 and 13.3%,
respectively. For comparison, note that the goodness-of-fit of the RF
shown in Figure 2 was 53%. The mean of the distribution in Figure |