Abstract
Efficient processing of incoming sensory input is essential for an organism's survival. A growing body of evidence suggests that sensory systems have developed coding strategies that are constrained by the statistics of the natural environment. Consequently, it is necessary to first characterize neural responses to natural stimuli to uncover the coding strategies used by a given sensory system. Here we report for the first time the statistics of vestibular rotational and translational stimuli experienced by rhesus monkeys during natural (e.g., walking, grooming) behaviors. We find that these stimuli can reach intensities as high as 1500 deg/s and 8 G. Recordings from afferents during naturalistic rotational and linear motion further revealed strongly nonlinear responses in the form of rectification and saturation, which could not be accurately predicted by traditional linear models of vestibular processing. Accordingly, we used linear–nonlinear cascade models and found that these could accurately predict responses to naturalistic stimuli. Finally, we tested whether the statistics of natural vestibular signals constrain the neural coding strategies used by peripheral afferents. We found that both irregular otolith and semicircular canal afferents, because of their higher sensitivities, were more optimized for processing natural vestibular stimuli as compared with their regular counterparts. Our results therefore provide the first evidence supporting the hypothesis that the neural coding strategies used by the vestibular system are matched to the statistics of natural stimuli.
Introduction
Efficient processing of sensory input is critical for an organism's survival. Because some stimuli are more likely to be encountered than others, it is commonly assumed that the statistics of the natural sensory environment influence the brain's coding strategies (Attneave, 1954; Laughlin, 1981; Barlow, 2001; Simoncelli and Olshausen, 2001). However, this hypothesis remains untested for the vestibular system, which processes head motion information and thus plays a vital role in the stabilization of gaze and control of balance and posture, as well as perception of spatial orientation and self-motion. Notably, addressing whether the coding strategies used by the vestibular system are optimized to process natural stimuli first requires knowledge of the head motion experienced during natural behaviors.
The vestibular system is phylogenetically the oldest part of the inner ear and is highly conserved throughout evolution (Wallman et al., 1982). Vestibular end organ sensors detect head motion across six dimensions (three axes of translation detected by the otoliths and three axes of rotation detected by the semicircular canals; Angelaki and Cullen, 2008; Cullen, 2011). In the absence of motion, afferents display a wide range of resting discharge variability and are characterized as regular or irregular—a classification that correlates with differences in morphological features and response dynamics (Baird et al., 1988; Fernández et al., 1988; Goldberg, 2000; Straka et al., 2005). Afferent response dynamics have been traditionally measured using artificial self-motion stimuli such as sinusoids or noise with low intensity to ensure that neurons are constrained to their linear ranges (Goldberg, 2000; Sadeghi et al., 2007b; Jamali et al., 2013). To date only a few studies have reported nonlinear responses to high-intensity vestibular stimuli (Fernández and Goldberg, 1976; Sadeghi et al., 2007a). However, a recent study performed in humans has revealed that vestibular stimuli can reach large intensities during everyday activities that could elicit nonlinear responses from afferents (Carriot et al., 2014).
Thus, it currently remains unknown whether peripheral vestibular neural responses to natural input can be determined from those to artificial stimuli and whether coding strategies are constrained by the statistics of the natural environment. Here we characterized for the first time the structure of self-motion signals that are experienced by rhesus monkeys during typical natural behaviors. Existing linear models of peripheral processing incorrectly predicted that such stimuli elicit neural responses outside of the physiological range. Accordingly, we then recorded from afferents and developed new models incorporating static nonlinearities that accurately described responses to naturalistic stimuli. Finally, using these models, we tested whether coding by peripheral afferents is constrained by natural stimulus statistics. We found that irregular afferents, because of their higher sensitivities, were more optimized to process natural stimuli than their regular counterparts. Our results therefore have important implications for understanding the contributions of different classes of peripheral sensory neurons to the encoding of natural stimuli, and further provide evidence supporting the hypothesis that the neural coding strategies used by the vestibular system are matched to the statistics of natural stimuli.
Materials and Methods
Experimental preparation.
All experimental protocols were approved by the McGill University Animal Care Committee and were in compliance with the guidelines of the Canadian Council on Animal Care. Two male macaque monkeys (Macaca fascicularis) were prepared for chronic extracellular recording using aseptic surgical techniques. The surgical preparation was similar to that previously described (Dale and Cullen, 2013). Briefly, using aseptic surgical techniques and under isoflurane anesthesia (0.8–1.5%), a stainless steel post was secured to the animal's skull with stainless steel screws and dental acrylic resin, allowing complete immobilization of the head during the experiments. The implant also held in place a recording chamber oriented stereotaxically toward the vestibular nerve where it emerges from the internal auditory meatus. Finally, an 18–19 mm diameter eye coil (three loops of Teflon-coated stainless steel wire) was implanted in the right eye behind the conjunctiva. After the surgery, buprenorphine (0.01 mg/kg, i.m.) was administered as postoperative analgesia every 12 h for 2–5 d depending on the animal's pain level, and Anafen (2 mg/kg, then 1 mg/kg subsequent days) was used as an anti-inflammatory. In addition, cefazolin (25 mg/kg, i.m.) was injected twice daily for 10 d. Animals were given at least 2 weeks to recuperate from the surgery before any experiments began.
Head movement recording.
Head movements were recorded using a microelectromechanical systems (MEMS) module (iNEMO platform, STEVAL-MKI062V2; STMicroelectronics), as done recently for humans (Carriot et al., 2014). The module combined three linear accelerometers (recording linear accelerations along the fore-aft, lateral, and vertical axes) and three gyroscopes (recording angular velocity about pitch, roll, and yaw). To extend the velocity range to ±2000 deg/s, the MEMS module was augmented with a STEVAL-MKI107V2 three-axis gyroscope. The MEMS module, a battery, and an microSD card were encased in an extremely light (64 g) and small (35 × 35 × 15 mm) enclosure, which was firmly secured to the head posts of two macaque monkeys. Specifically, the plane spanned by the fore-aft and lateral axes of the MEMS module was set parallel to the horizontal stereotaxic plane (i.e., the plane passing through the inferior margin of the orbit to the external auditory meatus; Carriot et al., 2014). Data from each of the six sensors were sampled at 100 Hz and recorded wirelessly on an microSD card. Since we were interested in afferent responses to linear motion, we recorded the total gravito-inertial acceleration (GIA; i.e., the sum of gravity and linear motion), which is henceforth referred to as linear acceleration. Note that the data were reported in sensor coordinates in Figures 1 and 2.
Each monkey was released separately into a large familiar play cage (9.5 m3, with a multilevel wooden platform) where it was able to freely move and interact with another monkey from our colony for 160 min while a camera was recording its behavior. We segregated activities into three groups: (1) behaviors associated with low levels of activity that included monkey sitting observing its environment, holding and playing with objects, or grooming; (2) behaviors associated with medium levels of activity that included walking around and foraging; and (3) behaviors associated with high levels of activity that included running, jumping, and climbing, as well as rapid head shaking.
Single-unit recording.
During the experiment, the head-restrained monkey was seated in a primate chair mounted on top of a vestibular table in a dimly lit room. The vestibular nerve was approached through the floccular lobe of the cerebellum, as identified by its eye-movement-related activity (Lisberger and Pavelko, 1986; Sadeghi et al., 2007b; Jamali et al., 2013); entry to the nerve was preceded by a silence, indicating that the electrode had left the cerebellum. Extracellular single-unit activity of semicircular canal and otolith afferents was recorded using glass microelectrodes (24–27 MΩ), the depth of which was controlled using a precision hydraulic microdrive (Narishige). Rotational head velocity and linear acceleration were measured using an angular velocity sensor (Watson) and a linear accelerometer (ADXL330Z; Analog Devices) sensor firmly secured to the animal's head post. Note that vestibular afferents could not be recorded during natural movements due to the technical difficulty of maintaining isolation from single eighth nerve afferents using high-impedance glass micropipettes while the animal underwent such complex and high-intensity dynamic stimuli. Therefore, we used naturalistic stimuli whose time course mimicked that of the natural stimuli. These were generated as done previously (Jamali et al., 2009; Brooks and Cullen, 2013; Carriot et al., 2013). As both naturalistic and natural stimuli tended to consist of excursions whose time course was approximately bell shaped, we quantified them using the following measures: intensity I (maximum value), the full width at half maximum (FWHM), and area under the curve (AuC), which were all computed from the absolute value of the stimulus.
Natural rotation stimuli displayed large heterogeneities as all three quantities were distributed over large intervals (I: 100–1500 deg/s, FWHM: 84–582 ms, AuC: 13–173 deg). The characteristics of our naturalistic rotation stimuli were all within these ranges (I = 356 ± 41 deg/s, FWHM = 277 ± 19 ms, and AuC = 100 ± 8 deg). Natural translation stimuli also displayed large heterogeneities (I: 0.8–8 G, FWHM: 24–228 ms, and AuC: 0.9–5.8 m/s). The characteristics of our naturalistic translational stimuli were also all within these ranges (I = 0.87 ± 0.35 G, FWHM = 146 ± 30 ms, and AuC = 1.2 ± 0.3 m/s).
These stimuli were applied manually since the required velocities and accelerations were beyond the range of our controller. We focused our analysis on horizontal canal afferents (N = 11 regular and N = 25 irregular) and utricle otolith afferents (N = 26 regular and N = 27 irregular), since our motion platform was designed to apply stimuli along these axes of motion. For each afferent, stimuli were applied along the preferred axis of rotation (i.e., yaw) or translation (horizontal) as previously described (Sadeghi et al., 2007b; Jamali et al., 2013). For otolith afferents, this was achieved by rotating the monkey's head such that the neuron's preferred direction was aligned with the apparatus' axis of translation. During experimental sessions, unit activity, horizontal and vertical eye positions, and head-velocity signals were recorded on digital audiotape for later playback. During playback, action potentials from extracellular recordings were discriminated using a windowing circuit (BAK Electronics). Eye position and head-velocity signals were low-pass filtered at 250 Hz (eight-pole Bessel filter) and sampled at 1 kHz.
We generated binary spike trains, with 1 kHz sampling rate for each afferent. Periods of spontaneous activity (i.e., no head movement) were used to classify afferents as regular or irregular according to the variability in their baseline spiking activity as quantified by the coefficient of variation, CV = σISI/μISI, where μISI is the mean interspike interval (ISI) and σISI is the SD of the ISIs. Because CV varies with mean, the mean ISI, a normalized coefficient of variation (CV*), was computed using the ISI distribution to quantify resting discharge variability as described previously (Goldberg et al., 1984). Afferents with CV* < 0.1 were classified as regular, while those with CV* ≥ 0.1 were classified as irregular as done previously (Sadeghi et al., 2007b; Jamali et al., 2013). The firing rate r(t) was obtained by convolving the binary spike train with a Gaussian centered on the spike time with 10 ms SD. For the purposes of illustration only, we computed the cross-correlation function between the firing rate r(t) and the stimulus s(t) and noted the lag at which it was maximal. This lag was then used to align the response with the stimulus.
Statistical analysis of natural head movements.
To test whether natural head movement signals were characterized by probability distributions that differed significantly from Normal, surrogate datasets of the same length with the same mean and variance but drawn from a Gaussian distribution were generated. Probability distributions for both the data and the surrogate datasets were generated with bin widths of 20 deg/s for angular velocities and 0.1 G for linear accelerations. The deviation from normality was quantified using the kurtosis k =
Linear models of vestibular afferent responses.
We first used previously established linear models to predict afferent responses to the experimentally recorded natural stimuli. Specifically, we assumed that the output firing rate r(t) in response to stimulus s(t) is given by the following: r(t) = (H * s)(t) + r0, where the asterisk denotes a convolution with a filter H(t) and r0 is the baseline (i.e., in the absence of stimulation) firing rate. We used r0 = 104 Hz for canal afferents and 79 Hz for otolith afferents, which corresponds to average baseline firing rates observed experimentally (Sadeghi et al., 2007a; Jamali et al., 2013). Before filtering with these transfer functions, natural angular velocity recorded was projected onto the horizontal semicircular canal plane as done previously (Carriot et al., 2014) using an angle of 22° nose down (Sadeghi et al., 2009).
For canal afferents, the Fourier transform of H(t) (i.e., the transfer function) is given by the following: with S = i2πf. For regular afferents, parameter values were as follows: k = 2.83 (spikes/s)/(deg/s), T1 = 0.0175 s, T2 = 0.0027 s, and Tc = 5.7 s. For irregular afferents, parameter values were as follows: k = 27.09 (spikes/s)/(deg/s), T1 = 0.03 s, T2 = 0.0006 s, and Tc = 5.7 s. Overall, these parameter values are similar to those used previously (Hullar et al., 2005) and were chosen to best match available experimental data (Ramachandran and Lisberger, 2006; Sadeghi et al., 2007a).
For otoliths, we used the following transfer function (Angelaki and Dickman, 2000): with k = 59.0106 (spikes/s)/G; k1 = 0.0643, k2 = 2.208, a = 0.0138 s, and b = 0.0255 s for regular afferents; and k = 112.7417 (spikes/s)/G, k1 = 0.3084, k2 = 2.6834, a = 0.0136 s, and b = 0.0318 s for irregular afferents. As for canal afferents, parameter values were chosen to best match available experimental data (Jamali et al., 2013). These expressions were used to generate the linear predictions of firing rate responses to natural stimuli shown in Figure 3.
Linear–nonlinear cascade models.
To characterize our experimentally observed nonlinear responses of vestibular afferents to naturalistic stimuli, we used linear–nonlinear (LN) cascade models (Chichilnisky, 2001; Massot et al., 2012) in which the firing rate response r(t) is given by the following: where T is a nonlinear function. We note that, unlike previously used one-stage nonlinear models in which the firing rate at a given time is a nonlinear function of the current stimulus value (Fernández and Goldberg, 1976; Sadeghi et al., 2007a), LN models explicitly take into account neuronal temporal filtering properties (Chichilnisky, 2001) that are known to be prevalent in vestibular afferents (for review, see Goldberg, 2000). The LN model was fit to each individual recorded afferent and was used in Figures 4 and 5.
We first estimated the transfer function H(f) using: where Psr(f) is the cross-spectrum between the stimulus s(t) and binary spike train response, and Pss(f) is the power spectrum of the stimulus s(t). Note that we only used low-amplitude portions of the stimulus to ensure that the afferent was constrained to its linear regime. The baseline firing rate was then added to this to form the linear prediction rL(t), which was then plotted as a function of the experimentally observed firing rate r(t) to reveal the shape of the nonlinear function T as done previously (Massot et al., 2012). We found that the experimental data could be well fit by a sigmoidal function: in which c1, c2, and c3, are fit parameters, and erf(·) is the error function. Based on published results (Fernández and Goldberg, 1976; Sadeghi et al., 2007a) the parameter c3, which determines the maximum firing rate, was always >270 spikes/s. We also found that the variance of the firing rate distribution could be well fit by a combination of error functions: In practice, the first half of the experimental data was used to obtain the transfer function and the shape of the nonlinear function T. The LN model was then used to generate a prediction of the response to the second half of the stimulus that was compared with the experimental data. The model's performance was quantified using R2. The linear model was generated by assuming that T(x) = x instead and was used to generate a linear prediction of the response that was then also compared with the experimental data and its performance was also quantified using R2. Note that the functions plotted in Figure 5 are population averages.
Optimal stimulus distribution that maximizes information.
We used the LN model described above for which the response is given by the following:
with Tsig as a nonlinear sigmoidal function. Assuming a stimulus with probability distribution p(s) and a response with probability distribution p(r), the mutual information between stimulus and response is given as follows (Cover and Thomas, 1991; Rieke et al., 1996):
where H(r) = −
In the noise-free case, there is a deterministic relationship between the linear prediction and the response and we have p(r | rL) = Tsig(rL)δ(r − rL). The mutual information is then given by the following: I(rL, r) = H(r) and is maximum when (Laughlin, 1981; Wark et al., 2007): where T′sig is the derivative of Tsig.
If we consider that there is noise, it is not possible to find an analytical expression for the probability distribution popt(rL) for which the mutual information is maximum. However, previous studies have found an approximate solution if we assume that r = Tsig(rL) +
Finally, to compute the optimal stimulus distribution popt(s), we inverse filtered a signal with probability distribution popt(rL) whose power spectrum was equal to that of the filtered natural stimulus.
Quantifying the distance between probability distributions.
To quantify the distance between two probability distributions p(x) and q(x), we used the Jensen–Shannon (JS) divergence defined by the following: where m(x) = (p(x) + q(x))/2 and DKL is the Kullback–Leibler divergence defined by the following:
Results
Statistics of natural vestibular stimuli
We recorded the vestibular input experienced by freely moving rhesus monkeys during natural behaviors using a MEMS module that was attached to their heads (Fig. 1A; see Materials and Methods). The MEMS module consisted of three linear accelerometers measuring linear acceleration along the fore-aft, lateral, and vertical axes and three gyroscopes measuring angular velocity of rotations along these axes (i.e., pitch, roll, and yaw). We note that the linear acceleration signals that we recorded correspond to the total GIA (i.e., the sum of gravity and linear motion). We found that natural vestibular stimuli reached large intensities in all six axes of motion (Fig. 1B, left), and were described by probability densities that were not Gaussian as they displayed characteristically long tails (Fig. 1B, right). This was quantified by large kurtosis values that were all significantly greater than that obtained for a Gaussian distribution (Fig. 1C, compare black and red bars), as seen in other sensory modalities (Simoncelli and Olshausen, 2001).
To test whether specific behaviors influence stimulus statistics, we next grouped behaviors into three categories associated with low, medium, and high levels of activity and quantified the statistics of the vestibular input corresponding to each category (Fig. 2A; see Materials and Methods). Specifically, periods where the monkey was sedentary and observing its environment, holding and playing with objects, or grooming behaviors were associated with low levels of activity, while those where the monkey was walking around and/or foraging were associated with medium levels of activity. Periods where the monkey was running, jumping, climbing, and was rapidly shaking its head were associated with high levels of activity. Overall, low, medium, and high levels accounted for 63, 28, and 9% of all activities, respectively. We found that the vestibular input experienced during behaviors associated with all three categories displayed probability densities that were for the most part not Gaussian (Fig. 2B) as quantified by large kurtosis values (Fig. 2C). As expected, vestibular signals associated with low, medium, and high levels of activity were distributed over progressively larger ranges across all six motion dimensions (Fig. 2B). The mean, SD, and kurtosis values obtained across low, medium, and high levels of activity for all six motion dimensions are given in Table 1.
Traditional linear models of vestibular processing cannot accurately predict responses to natural stimuli
We next tested whether previously published models of early vestibular processing that have been shown to accurately reproduce the responses of peripheral otolith and semicircular canal afferents to artificial stimuli (Goldberg, 2000; Cullen, 2004, 2011; Sadeghi et al., 2007b; Massot et al., 2011, 2012; Jamali et al., 2013) could accurately predict responses to natural stimuli. Linear systems identification techniques were used to find the optimal transfer function that best captures the input–output relationship between head motion and firing rate and predict the firing rate response, henceforth referred to as the linear prediction (Fig. 3A). We used established expressions for the transfer functions of regular and irregular semicircular canal and otolith afferents (Angelaki and Dickman, 2000; Massot et al., 2012) and fitted these to available data (Sadeghi et al., 2007a; Jamali et al., 2013). Specifically, prior studies have shown striking differences between the neural coding strategies used by semicircular canal and otolith afferents (Sadeghi et al., 2007b; Jamali et al., 2013). Most notably, while the neural sensitivities for irregular afferents are larger than those of their regular counterparts, this difference is much more pronounced for otolith than for semicircular canal afferents (Fig. 3B).
To test whether these linear models could give rise to physiologically realistic responses during natural behaviors, we used all stimulus waveforms measured during low, medium, and high levels of activity for both rotations and translations. Example stimulus waveforms used are shown in the top of Figure 3C for yaw rotations (left) and fore-aft translations (right). We found negative linear firing rate predictions for both otolith and semicircular canal afferents, which is of course outside of the physiological range. The middle and bottom of Figure 3C show the responses of regular and irregular semicircular canal (left) and otolith (right) afferents to the example waveforms, respectively. Moreover, the models' tendency to incorrectly predict negative firing rates was greater for stimuli encountered during high levels of activity (e.g., 1500 deg/s and 8 G). For the same stimulation, linear models also showed a greater tendency to incorrectly predict negative firing rates for irregular afferents due to their larger sensitivities. Quantification of the percentage of time during which the firing rate prediction is negative revealed low values (<5%) across all six motion dimensions for regular afferents. However, for irregular afferents, these values exceeded 20% for high levels of activity (Fig. 3D). We conclude that existing linear models of vestibular afferent responses (for review, see Goldberg, 2000) cannot reliably predict responses to natural stimuli, and that this is especially the case for irregular afferents during the high-amplitude stimuli associated with high levels of activity.
Building LN models of peripheral vestibular afferent responses to naturalistic stimuli
It follows that if our conclusion above is correct, then natural stimuli will elicit nonlinear responses from vestibular afferents. To explicitly test this, we recorded afferent responses to high-intensity naturalistic stimuli consisting of either rotation or linear motion whose time course mimicked that seen during natural movements (Fig. 1B, insets). We then used LN models (Chichilnisky, 2001) to characterize the nature of the nonlinearity in the input–output relationship of vestibular afferents (Fig. 4A). These models assume that the output firing rate is calculated by first linearly filtering the input stimulus via a transfer function and then passing the resulting linear prediction through a static nonlinear function (Fig. 4A, nonlinear prediction; see Materials and Methods). In practice, for each canal and otolith afferent (Fig. 4B), we estimated the transfer function from the response to the low-amplitude portions of the stimulus (see Materials and Methods). We then estimated the shape of the nonlinearity by plotting the actual firing rate as a function of the linear prediction from the whole dataset. Figure 4C compares the performance of both linear and LN models when predicting the firing rate response of an example irregular otolith afferent to naturalistic stimuli. We found that both models provided comparable predictions for low-amplitude stimulus segments (Fig. 4C, left). In contrast, we found that the LN model gave a better prediction of the response to high-amplitude stimulus segments (Fig. 4C, right) as quantified by more than a twofold larger R2 value (Fig. 4D, inset). This is because the afferent displays cutoff (i.e., ceases to fire action potentials) and saturation during the large negative and positive portions of the stimulus, respectively, which cannot be predicted from the linear model alone (Fig. 4C). Consequently, the firing rate response as a function of the linear prediction showed strong deviations from the identity line that were well fit by a sigmoidal function (Fig. 4D). Qualitatively similar results were seen for an example irregular canal afferent (Fig. 4E,F). Thus, LN cascade models can accurately predict afferent responses to high-amplitude naturalistic stimuli mimicking natural signals. Figure 5 shows the results of this same analysis for our populations of canal and otolith afferents where the population-averaged actual firing rate is plotted as a function of the linear prediction. On average, the goodness of fit of LN models was always significantly larger than that of linear models for both regular (Fig. 5A,C) and irregular (Fig. 5B,D) afferents. However, this difference was most striking for irregular afferents (Fig. 5, compare A, B and C, D). Accordingly, consistent with our original hypothesis, we conclude that vestibular afferents do indeed display nonlinear responses to high-amplitude naturalistic stimuli that can be accurately described by LN models.
Neural heterogeneities in tuning and in trial-to-trial variability influence optimal coding
Our above analyses of canal and otolith afferents demonstrated that those with more irregular resting discharges are likely to be driven into a nonlinear regime during naturalistic stimulation. Thus, peripheral vestibular neural responses to natural input cannot be predicted from those to low-intensity artificial stimuli. We next asked whether, and if so, how the observed differences in coding between regular and irregular afferents (i.e., likelihood of entering the nonlinear regime) as well as those seen between canal and otolith afferents (i.e., differences in sensitivity) are constrained by the statistics of natural sensory input to optimize information transmission. Specifically, are differences in resting discharge variability optimized to code for different features of natural input? Moreover, is such optimization different for canal and otolith afferents?
Theoretically, one potential optimal coding strategy is to devote the most neural resources to code for stimuli that will occur most frequently in the natural environment, which maximizes information transmission (Laughlin, 1981; Simoncelli and Olshausen, 2001; Wark et al., 2007). If we neglect trial-to-trial variability in the neural response (i.e., deterministic), then information is maximized when the firing rate response is uniformly distributed (see Materials and Methods). In this case, the stimulus distribution that maximizes information transmission (i.e., the optimal stimulus distribution; see Eq. 9) is simply the derivative of the neuron's input–output relationship (Fig. 6A). Figure 6B shows two different hypothetical sigmoidal input–output relationships characterized by different levels of steepness. It is thus expected that regular and irregular afferents will have different optimal stimulus distributions due to their differences in sensitivity (Fig. 3B).
Additionally, it is well known that regular and irregular afferents display different amounts of trial-to-trial variability. It is thus important to consider the influence of such variability on the optimal stimulus distribution. Theoretical studies have derived an approximate expression for the optimal stimulus distribution taking into account trial-to-trial variability in the form of variance in the firing rate response to a given stimulus (Brunel and Nadal, 1998; McDonnell and Stocks, 2008; see Materials and Methods). Figure 6C shows different distributions for the firing rate variance and how they influence the optimal stimulus distribution for the input–output relationship shown in Figure 6A. In particular, if the firing rate variance is normally distributed, then the optimal stimulus distribution is wider than that obtained in the deterministic case (Fig. 6C, compare black and gray curves). In contrast, a uniform firing rate variance distribution does not alter the optimal stimulus distribution (Fig. 6C, compare black and dashed curves).
Thus, to compute the optimal stimulus distributions for different afferent classes, it is first necessary to characterize—from experimentally recorded afferent responses—the firing rate variance as a function of the stimulus to naturalistic motion. Figure 7A shows the firing rate variance as a function of the stimulus for regular canal (top left), irregular canal (bottom left), regular otolith (top right), and irregular otolith (bottom right). Notably, as expected, irregular afferents displayed larger firing rate variances than regular afferents. Importantly, however, the firing rate variance decreased sharply for stimulus intensities that elicit either rectification or saturation (Fig. 5). This effect was most pronounced for irregular afferents. Figure 7B shows the optimal stimulus distribution neglecting trial-to-trial variability (black curves) and taking into account the variance distributions shown in Figure 7A (dashed curves). While some differences are apparent in that the distributions taking into account trial-to-trial variability had slightly larger extent than those obtained by neglecting trial-to-trial variability, both distributions had the same shape and largely overlapped for all afferent classes. To quantify the similarity between distributions, we computed the JS divergence (see Materials and Methods). We found that the JS divergence between regular and irregular afferents with variability was significantly greater than that obtained for either regular or irregular afferents with and without variability (p ≪ 10−3 in all cases, Wilcoxon rank sum tests with Bonferroni correction; Fig. 7B, insets). Thus, we conclude that the different levels of trial-to-trial variability displayed by both regular and irregular canal and otolith afferents have minimal influence on the stimulus distributions that maximize information transmission.
Irregular afferents are better optimized to code for natural stimuli than regular afferents
We next explicitly tested whether and how the neural coding strategies used by the peripheral vestibular system are constrained by the statistics of stimuli encountered in the natural environment by comparing the optimal stimulus distributions obtained for each afferent class to the natural distributions. While differences in the trial-to-trial variability do not have much impact on the shape of the optimal distribution (Fig. 7B), we hypothesized that the large differences in sensitivity observed for regular and irregular afferents should strongly impact their optimal stimulus distributions. Specifically, as illustrated above in Figure 6B, we expected that the larger sensitivities of irregular afferents should lead to narrower optimal stimulus distributions. Figure 8A shows the population-averaged optimal stimulus distributions with variability for regular (red) and irregular canal afferents (green), as well as when both populations are combined (cyan), with the natural stimulus distribution (black) superimposed. To facilitate comparison with Figure 7, these distributions are also plotted on a linear scale in the inset of Figure 8A. Consistent with our hypothesis, the optimal stimulus distribution for irregular afferents was indeed narrower than that obtained for regular afferents. Importantly, we further found that the optimal stimulus distribution of irregular afferents was closer to natural stimulus distribution. Overall, the JS divergence between the optimal stimulus distribution with variability and the natural distribution was always significantly lower for irregular afferents than regular afferents. This was true when we considered either the stimuli arising from all levels of activity or the stimuli arising from either low, medium, or high levels of activity alone (Fig. 8B). We performed a comparable analysis on our population of otolith afferents and obtained qualitatively similar results. Notably, the optimal stimulus distribution (with variability) of irregular otolith afferents was significantly narrower and better matched to the natural distribution in the fore-aft direction when compared with that obtained for regular afferents (Fig. 8C) as quantified by significantly lower JS values (Fig. 8D). Qualitatively similar results were obtained when comparing the optimal stimulus distributions of irregular and regular otolith afferents to the natural distribution in the lateral direction (population-averaged JS divergence values for irregular vs regular afferents; low: 0.39 ± 0.08 vs 0.73 ± 0.05, medium: 0.33 ± 0.07 vs 0.70 ± 0.04, high: 0.15 ± 0.06 vs 0.56 ± 0.07, all: 0.30 ± 0.07 vs 0.67 ± 0.05, p < 0.05 in all cases). Further, the optimal stimulus distributions obtained by pooling across all afferents (i.e., including both regular and irregular subgroups) were not better matched to the natural stimulus distribution than those obtained when only considering irregular afferents (Fig. 8A,C, compare green and cyan curves). Finally, we note that irregular otolith afferents were more optimized than irregular canal afferents (Fig. 8, compare A, C) as quantified by significantly lower JS values (p ≪ 10−3 in all cases, Wilcoxon rank sum tests).
Discussion
Summary of results
Here we studied the statistics of natural vestibular signals experienced by monkeys to determine whether coding is optimized for such stimuli. We found that natural vestibular inputs reached large intensities as evidenced by probability distributions with long tails across all six motion dimensions, similar to those described by Carriot et al. (2014) for human. We further found that well established linear models of early vestibular processing could not predict semicircular canal or otolith afferent responses to natural vestibular stimuli. Instead they incorrectly predicted physiologically impossible negative firing rates in response to large-amplitude “off-direction” movements. This was particularly true for the coding of high-intensity activities (e.g., running, jumping, climbing, etc.) by irregular afferents. Accordingly, to develop accurate models, we recorded from afferents during naturalistic rotational and linear motion. We found that linear–nonlinear cascade models could accurately describe neural responses. Thus, we used these models to determine whether afferent coding strategies are constrained by natural stimulus statistics by computing the optimal stimulus distribution that maximizes information. We found that irregular otolith and semicircular canal afferents, due to their higher sensitivities, were better optimized to process natural stimuli. It is therefore likely that the neural coding strategies used by the vestibular system have developed to match the statistics of natural stimuli.
Irregular afferents are better optimized to process natural stimuli
There is growing evidence that sensory systems have evolved coding strategies that are adapted to optimally process natural sensory input (Laughlin, 1981; for review, see Wark et al., 2007). For example, in the fly visual system, Laughlin (1981) compared the experimentally measured neuronal input–output relationship to that which maximizes information transmission about the natural luminance distribution and found excellent agreement between the two. Here we used a similar approach to compare the natural stimulus distribution to that which maximizes information transmission given the experimentally measured neuronal input–output relationship (Fig. 8) and found that irregular afferents better optimized to code for natural stimuli.
Interestingly, the evolution from stem tetrapods to amniotes was accompanied by the appearance of type I vestibular hair cells and a novel afferent terminal with a calyceal ending (for review, see Eatock and Songer, 2011). Previous studies have shown that afferents supplied by type I hair cells tend to be more irregular in their resting discharges (Goldberg and Fernández, 1977; Si et al., 2003; Xue and Peterson, 2006). It has been hypothesized that type I hair cells evolved in amniotes as an adaptation to changes in natural stimulus statistics resulting from (1) the transition from water (i.e., characterized by resistive hydrodynamic forces) to a land-based environment (Eatock and Songer, 2011) and (2) neck elongation in amniotes (Steinacker, 2004). While it has been suggested these two factors lead to higher amplitude motion in amniotes, at least some anamniote species experience vestibular stimuli with extremely large intensities during natural self-motion (e.g., >1000 deg/s in the spiny dogfish (Domenici et al., 2004) and 15 G in the swimming pike (for review, see Domenici and Blake, 1997). Thus, it is possible that other factors also contributed to the evolution of the vestibular periphery. For example, aquatic species might experience more constant vestibular stimulation due to water currents. Further studies comparing the statistics of natural motion across amniotes versus anamniotes are needed to address this important issue.
Nature of the neural code used by the vestibular system to transmit information about natural stimuli
It is generally thought that neurons within early vestibular pathways transmit information about head-motion stimuli in a linear fashion through their time-varying firing rates (i.e., a rate code) rather than through precise timing of action potentials (i.e., a temporal code) (Goldberg, 2000; Sadeghi et al., 2007b; Massot et al., 2011, 2012). However, previous studies have mostly focused on characterizing neuronal responses to low-amplitude artificial stimuli. As a result modulations in the firing rate were largely constrained to the neuron's linear regime and did not elicit static nonlinearities such as saturation or rectification. Interestingly, our results show that this is also often applicable to the afferent coding of natural vestibular stimuli because monkeys spend much of their time engaged in activities characterized by relatively low-amplitude head motion (i.e., sitting, grooming). However, we found that this was not true for the vestibular stimuli experienced during intermittent high-activity behaviors, such as running, because of their large amplitudes. For such behaviors, the likelihood of nonlinearities (e.g., rectification, phase locking, saturation), open the interesting possibility that the neural code used by the vestibular system to process high-amplitude stimuli is inherently different than that used for low-amplitude stimuli. Further studies will also be needed to test this interesting hypothesis.
Role of variability in neural coding
Neural variability is seen ubiquitously in the CNS but its role in neural coding is highly debated (Stein et al., 2005; Faisal et al., 2008). The peripheral vestibular system is particularly well suited for studying the effects of variability on neural coding because regular and irregular afferents display low and high amounts of variability, respectively. While previous studies have found that variability plays an important role in vestibular coding (Goldberg, 2000; Sadeghi et al., 2007b; Massot et al., 2011, 2012; Jamali et al., 2013), our results show that sensitivity largely determines the corresponding optimal stimulus distribution (Figs. 7, 8). Thus, from this point of view, the increased variability of irregular afferents might just be a consequence of their increased gain: the detrimental effects of such increased variability on coding and feature detection can then be reduced by averaging the activities of large afferent populations (Sadeghi et al., 2007b; Jamali et al., 2013). The fact that the increased variability and sensitivity are strongly correlated in the vestibular system (Jamali et al., 2013), as they most likely originate from intrinsic properties (Smith and Goldberg, 1986; Kalluri et al., 2010), support this hypothesis. Alternatively, it is theoretically possible that there is a source of sensory noise that is common to all afferents and that is greater for irregular afferents because of their larger sensitivity. Further studies will be needed to test these possibilities.
Coding natural stimuli by otolith versus semicircular canal afferent populations
Previous results have shown that the coding strategies used by otolith afferents to encode linear motion differ markedly from those used by semicircular canal afferents to encode rotational motion (Sadeghi et al., 2007b; Jamali et al., 2013). Notably, strong correlations between trial-to-trial variability and sensitivity in otolith afferents are such that their ratio remains constant. As a consequence, neural detection thresholds (i.e., the minimum stimulus intensity that elicits a detectable change in neural activity) are independent of either resting discharge variability or frequency (Jamali et al., 2013). In contrast, although a strong positive correlation between variability and sensitivity is also observed for semicircular canal afferents (Goldberg, 2000), the increased sensitivity displayed by irregular semicircular canal afferents is not sufficient to compensate for their substantially higher trial-to-trial variability. As a result, irregular semicircular canal afferents display higher detection thresholds than regular ones (Sadeghi et al., 2007b; Massot et al., 2011). Interestingly, supporting these differences, we found that irregular otoliths are more optimized than irregular canal afferents, which was primarily due to the fact that the differences in sensitivity are far more pronounced in the otolith system. Nevertheless, despite these differences, we further found that both irregular semicircular canal and otolith afferents are more optimized to process natural stimuli when compared with regular afferents, thereby suggesting that both the semicircular and otolith systems have adapted to natural stimulus statistics through common principles.
Implications for higher order processing of natural vestibular stimuli
Our results have important implications for downstream processing and behavior. We have recently shown that vestibular sensory information encoded by eighth nerve afferents is nonlinearly integrated by postsynaptic neurons at the first central stage of vestibular processing (i.e., in the vestibular nuclei; Massot et al., 2012). Notably, this nonlinearity generates an intensity-dependent bias in the output firing rate, when low- and high-amplitude (or frequency) stimuli are presented concurrently. We speculate that this nonlinear effect will be particularly significant for high-intensity natural behaviors.
Finally, we note that although irregular afferents are better optimized to encode natural stimuli, the combined activities of both regular and irregular afferents most likely contribute to vestibular perception. Previous results obtained using artificial stimuli have led to the hypothesis that regular afferents are better suited to estimate the detailed time course of the stimulus while irregular afferents are instead better suited to detect high-frequency features (Goldberg, 2000; Sadeghi et al., 2007b; Massot et al., 2011). Such parallel processing of sensory information is a common strategy used across modalities including auditory (Takahashi et al., 1984; Oertel, 1999; Gelfand, 2004), visual (Marr, 1982; Livingstone and Hubel, 1987; Merigan and Maunsell, 1993), and electrosensory (Carr and Maler, 1986; Bell and Maler, 2005; Kawasaki, 2005), to code for different stimulus attributes. Further studies are needed to test the interesting hypothesis that the vestibular system uses distinct channels of peripheral input to encode different features of natural vestibular stimuli.
Footnotes
This study was supported by Canadian Institutes of Health Research, National Institutes of Health, and FRQNT. We thank S. Nuara and W. Kucharski for excellent technical assistance.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Kathleen E. Cullen, McIntyre Medical Research Building, Room 1219, 3655 Promenade Sir William Osler, Montreal, PQ, H3G 1Y6, Canada. kathleen.cullen{at}mcgill.ca