Skip to main content

Main menu

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Collections
    • Podcast
  • ALERTS
  • FOR AUTHORS
    • Information for Authors
    • Fees
    • Journal Clubs
    • eLetters
    • Submit
  • EDITORIAL BOARD
  • ABOUT
    • Overview
    • Advertise
    • For the Media
    • Rights and Permissions
    • Privacy Policy
    • Feedback
  • SUBSCRIBE

User menu

  • Log in
  • My Cart

Search

  • Advanced search
Journal of Neuroscience
  • Log in
  • My Cart
Journal of Neuroscience

Advanced Search

Submit a Manuscript
  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Collections
    • Podcast
  • ALERTS
  • FOR AUTHORS
    • Information for Authors
    • Fees
    • Journal Clubs
    • eLetters
    • Submit
  • EDITORIAL BOARD
  • ABOUT
    • Overview
    • Advertise
    • For the Media
    • Rights and Permissions
    • Privacy Policy
    • Feedback
  • SUBSCRIBE
PreviousNext
Articles, Behavioral/Cognitive

The Role of Harmonic Resolvability in Pitch Perception in a Vocal Nonhuman Primate, the Common Marmoset (Callithrix jacchus)

Michael S. Osmanski, Xindong Song (宋欣东) and Xiaoqin Wang
Journal of Neuroscience 22 May 2013, 33 (21) 9161-9168; DOI: https://doi.org/10.1523/JNEUROSCI.0066-13.2013
Michael S. Osmanski
Laboratory of Auditory Neurophysiology, Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xindong Song (宋欣东)
Laboratory of Auditory Neurophysiology, Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiaoqin Wang
Laboratory of Auditory Neurophysiology, Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF
Loading

Abstract

Pitch is one of the most fundamental percepts in the auditory system and can be extracted using either spectral or temporal information in an acoustic signal. Although pitch perception has been extensively studied in human subjects, it is far less clear how nonhuman primates perceive pitch. We have addressed this question in a series of behavioral studies in which marmosets, a vocal nonhuman primate species, were trained to discriminate complex harmonic tones differing in either spectral (fundamental frequency [f0]) or temporal envelope (repetition rate) cues. We found that marmosets used temporal envelope information to discriminate pitch for acoustic stimuli with higher-order harmonics and lower f0 values and spectral information for acoustic stimuli with lower-order harmonics and higher f0 values. We further measured frequency resolution in marmosets using a psychophysical task in which pure tone thresholds were measured as a function of notched noise masker bandwidth. Results show that only the first four harmonics are resolved at low f0 values and up to 16 harmonics are resolved at higher f0 values. Resolvability in marmosets is different from that in humans, where the first five to nine harmonics are consistently resolved across most f0 values, and is likely the result of a smaller marmoset cochlea. In sum, these results show that marmosets use two mechanisms to extract pitch (harmonic templates [spectral] for resolved harmonics, and envelope extraction [temporal] for unresolved harmonics) and that species differences in stimulus resolvability need to be taken into account when investigating and comparing mechanisms of pitch perception across animals.

Introduction

Pitch is one of the most important percepts in audition and a principle attribute of spectrally and temporally periodic sounds, including music and speech (Plack et al., 2005; Moore, 2012). Many vertebrate species are sensitive to pitch, including songbirds (Cynx and Shapiro, 1986; Lohr and Dooling, 1998; Dooling et al., 2002), chinchillas (Shofner, 2011), cats (Heffner and Whitfield, 1976), ferrets (Kalluri et al., 2008; Walker et al., 2009), and monkeys (Tomlinson and Schwarz, 1988). Pitch is not explicitly represented in the cochlea; instead, it must be computed centrally. For example, pitch can be extracted by calculating the specific frequencies of, and harmonic spacing between, the individually identified (resolved) components of a complex sound (Plomp, 1964; Plomp and Mimpen, 1968).

Resolvability is estimated by measuring sensory thresholds for pure tones as a function of notch width in a notched-noise masker (Patterson, 1976; Glasberg and Moore, 1990). These estimates assume that the basilar membrane contains a bank of auditory filters, and different frequencies are resolved if they are processed by separate filters (Fletcher, 1940). As the width of the notch increases, the power of the noise passing through the filter, and thus the sensory threshold, decreases. These threshold data are used to estimate the equivalent rectangular bandwidth (ERB), which is the bandwidth of a rectangular filter passing the same total energy as the purported auditory filter.

If harmonics cannot be separated out on the cochlea (unresolved), pitch must be derived from the time-varying pattern of vibrations produced by the interaction of those harmonics on the basilar membrane. A compelling case has been made for temporal coding mechanisms in humans (de Cheveigné, 1998; Meddis and Hewitt, 1991a, 1991b; Meddis and O'Mard, 1997; Yost et al., 1996; but see Oxenham et al., 2011) and animals (Cariani and Delgutte, 1996a, 1996b; Simmons and Buxbaum, 1996). However, combining spectral and temporal mechanisms provides a more parsimonious explanation of human psychophysical data (Flanagan and Guttman, 1960a, 1960b; Guttman and Flanagan, 1964; Carlyon and Shackleton, 1994; Shackleton and Carlyon, 1994), suggesting that pitch extraction is dependent on frequency resolution in the auditory periphery.

Recent work has described a putative area specialized for pitch processing in auditory cortex of marmosets (Bendor and Wang, 2005, 2010), similar to that in humans (Patterson et al., 2002; Penagos et al., 2004; Bendor and Wang, 2006; Wang and Walker, 2012). In marmosets, this area contains neurons that encode pitch using spectral or temporal envelope cues (Bendor et al., 2012). To evaluate the influence of harmonic resolvability on pitch perception, we trained marmosets to discriminate complex tones in which temporal cues (envelope repetition rate) could be varied independently from spectral cues (f0). We then measured frequency resolution using a psychophysical task in which tone thresholds were measured as a function of notched noise bandwidth. Results show that marmosets are sensitive to phase in harmonic complex sounds and can extract pitch using both spectral and temporal envelope cues and that the particular cue used depends on the resolvability of the stimulus spectrum.

Materials and Methods

Subjects.

Six common marmosets (five male, one female) were used in these experiments: four in Experiment 1 and five in Experiment 2 (three were subjects in both experiments). Animals were housed in individual cages in a large colony at the Johns Hopkins University School of Medicine and were maintained at ∼90% of their free-feeding weight on a diet consisting of monkey chow, fruit, and yogurt and had ad libitum access to water. Subjects were tested five days per week between the hours of 0900 and 1800. All experimental procedures were approved by the Johns Hopkins University Animal Care and Use Committee and were in compliance with the guidelines of the National Institutes of Health.

Operant testing apparatus.

Marmosets were tested while seated in a Plexiglas restraint chair mounted in the center of a single-walled sound isolation chamber (Industrial Acoustic, model 400A) lined with 3-inch acoustic absorption foam (Pinta Acoustics). A food delivery tube was positioned at the top of the restraint chair in front of the animal; and a custom bracket, which housed an infrared photobeam, was mounted to the end of the feeding tube. Responses were recorded whenever the animal broke the photobeam by licking at the tube. Reward food consisted of a mixture of Similac baby formula, Gerber single-grain rice cereal, and strawberry-flavored Nesquik, delivered via a syringe pump (New Era Pump Systems, model NE-500) mounted to the base of the chair.

All sound stimuli were generated using MATLAB software (MathWorks) and delivered at a nominal sampling rate of 100 kHz through a digital signal processor and programmable attenuator (Tucker-Davis Technologies, RX6 and PA5) followed by an audio amplifier (Crown Instruments, model D-75). Stimuli were played from a speaker (Tannoy) mounted 40 cm directly in front of the animal and were calibrated before each experiment using a 1/2” free-field microphone (Brüel and Kjaer, type 4191) positioned at the same location as the animal's head. Test sessions were monitored via portable webcam (Logitech, model B905).

Operant methods.

The basic operant task has been described in detail previously (Osmanski and Wang, 2011; Remington et al., 2012). Briefly, each session was comprised of 100 trials, each of which was composed of the following: (1) a 3–10 s variable duration waiting period in which a “background” sound was repeatedly presented to the animal, and (2) a 5 s response window in which the background sound was alternated with a “target” sound. Animals had to detect the presentation of the target sound and respond by licking the feeding tube (“hit”) to receive food reward. Response latency was measured as the time from the first target onset to the time of lick onset (i.e., when the animal first broke the infrared photobeam). Thirty percent of all trials were catch trials in which the target was the same as the background. Licks during a catch trial resulted in a “false alarm” and the chamber lights were extinguished (a “blackout”) for 5 s. A failure to respond during a target or catch trial was recorded as a “miss” or “correct reject,” respectively.

Experiment 1.

Marmosets were trained to discriminate harmonic complex tones differing in their phase relationships. Each stimulus consisted of up to nine harmonics and was constructed so that the components either all began in cosine phase (COS) or were alternated such that odd harmonics began in sine phase and even harmonics began in cosine phase (ALT). The f0 of a COS stimulus is the same whether it is measured using that sound's envelope repetition rate (temporal) or using the spacing between the sound's harmonics (spectral). An ALT stimulus, however, has the same harmonic spacing (and thus the same pitch derived from spectral cues) as a corresponding COS stimulus, but double the envelope repetition rate (i.e., the pitch derived from temporal envelope cues is one octave higher) (Fig. 1).

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

Harmonic complex tone stimuli. Harmonics were either in same (COS) or alternating (ALT) phase (odd harmonics in sine phase, even harmonics in cosine phase). Acoustic waveforms (left column) and frequency spectra (right column) shown for three example stimuli: 150 Hz f0 COS (top), 300 Hz f0 COS (middle), and 150 Hz f0 ALT (bottom). ALT stimuli have an identical spectrum, but twice the repetition rate, of a COS stimulus with the same f0.

Stimuli were 500 ms (20 ms rise/fall time, 200 ms interstimulus interval) and presented at an average level of 70 ± 4 dB SPL. Intensity was roved for each stimulus presentation to remove potential amplitude cues between stimuli with differing phase relationships. Stimuli were presented in blocks of 10 trials. Each block contained five COS targets, two ALT targets, and three catch trials. ALT stimuli essentially functioned as low-probability probe sounds to determine whether they were perceived in the same way as the higher-probability COS stimuli. We quantified responses in terms of d′ [d′ = z(hit rate) − z(false alarm rate)] (higher d′ values indicate better discriminability), as well as a corrected hit rate [hit ratecorr = (hit rate − false alarm rate)/(1 − false alarm rate)] (Gescheider, 1985). Final d′ and hit rate values are based on a minimum of 300 trials. Sessions with a false alarm rate >25% were discarded (<10% of all sessions). Significance tests were conducted with a Bonferroni correction.

Experiment 2.

Marmosets were trained to detect a 500 ms pure tone target (20 ms rise/fall time, 200 ms interstimulus interval) against a silent background. Targets were alternated with equal-duration periods of silence during the response window. We tested frequencies of 500, 1000, 7000, and 16,000 Hz. Seven different amplitudes (in 5 dB SPL steps) of a given tone frequency were presented per session and were chosen based on previously measured thresholds (Osmanski and Wang, 2011). Threshold was defined as the target amplitude corresponding to a hit rate of 50% correct detection, after a correction for false alarm rate. If a threshold could not be determined, tone amplitudes were adjusted in the subsequent session to bracket the presumed threshold. Animals were tested until threshold values stabilized (i.e., three successive sessions in which the obtained threshold was within ±1/3 step size). Final threshold estimates are based on a minimum of 300 trials. Sessions with a false alarm rate >25% were discarded (<12% of all sessions).

Once thresholds were determined in quiet, the detection task was modified to include a continuous broadband noise played across the effective bandwidth of the speaker (0.8–54 kHz). Noise level was increased over several sessions until thresholds were elevated ∼35 ± 5 dB relative to thresholds in quiet. The level of the noise was the same for all animals at a given frequency but varied across frequencies. The average noise spectrum level required to attain the criterion threshold shift across all frequencies tested was 34.8 ± 2.3 dB. Subsequent thresholds were measured by adding spectral notches to the noise background. Notched noise filter coefficients were generated in MATLAB and input to the digital signal processor. Notches were logarithmically centered on the tone frequency with bandwidths of 0.125, 0.25, 0.5, 1, and 2 octaves. Only one bandwidth was presented per session, and the same bandwidth was used until threshold values stabilized.

Derivation of auditory filter shapes and ERB estimation.

Auditory filter shapes were derived using a rounded exponential (roex) filter fitting function (using custom MATLAB software adapted from Glasberg and Moore, 1990). This procedure has been described in detail previously (Lina and Lauer, 2013). Briefly, the technique involves a minimization algorithm for calculating fitting parameters against the interpolated masked threshold data, which is determined by minimizing the root-mean-square error between the recorded threshold values and the filter-generated predicted values.

The power spectrum model of masking (Fletcher, 1940; Glasberg and Moore, 1990) is mathematically summarized as follows: Embedded Image where the masked threshold of a signal (Ps) is defined by the power spectrum of the noise (N(f)) and the weighted function of the signal (W(f)) in the frequency domain. The constant K is a scalar adjustment factor equal to the signal-to-masker ratio at the output of the filter required to achieve threshold. Previous psychoacoustic studies (Patterson, 1976) using noise with spectral notches have found that the shape of the auditory filter can be approximated by a roex(p,r) fit as follows: Embedded Image where p describes the filter skirt slopes and r flattens the filter at remote frequencies. The parameter g is the frequency deviation of the edge of the notch normalized by its center frequency (Δf/fc).

Equations 1 and 2 can be combined to yield Equation 3, which is analogous to Equation 1 converted into the power spectrum domain: Embedded Image Equation 3 is then used to compare masked thresholds to the optimized roex (p,r) filter output values. Masked thresholds and filter magnitudes are plotted in terms of the deviation at the near (lower) edge. Finally, using the resultant roex(p,r) filter parameters, the ERB of the auditory filter shape can be calculated according to the following: Embedded Image

Harmonic resolvability model.

The lowest f0 an auditory channel centered at fc can resolve is assumed equal to 1 ERB (Moore and Ohgushi, 1993; Moore, 2012). Thus, the highest resolved harmonic for this f0 can be estimated by the following equation: Embedded Image where fc is center frequency. Resolvability curves were cubic spline interpolated from the measured ERB (f0) points. Additionally, we used previously published neural frequency tuning data measured at each neuron's best sound level (Bartlett et al., 2011) to estimate the resolvability of A1 neurons in marmoset. For these neural data, we substituted the ERB term in Equation 5 with the frequency response bandwidth (calculated by Bartlett et al., 2011) to derive the highest resolved harmonic of marmoset A1 neurons.

Results

Experiment 1a: pitch discrimination

Animals were trained to discriminate one of two “targets” (2f0 COS or f0 ALT) from a repeating “background” (f0 COS) (see Materials and Methods). We assumed that, if marmosets rely on spectral cues to extract pitch, they would respond to the 2f0 COS target but not the f0 ALT target. However, if they rely on temporal envelope cues, they would respond to both targets. Animals were tested using three stimulus sets: first nine harmonics present (H1-H9), a missing f0 set containing the fourth through 12th harmonics of a common f0 (H4-H12), and first four harmonics present (H1-H4). Set H4-H12 was presented with a 3-octave-wide band of noise (10 dB signal-to-noise ratio) centered at f0 to minimize potential harmonic distortion effects at the level of the cochlea. In all three stimulus sets, background sounds had f0 values of 150, 300, 450, 600, and 900 Hz (2f0 COS targets had f0 values ranging from 300 to 1800 Hz).

Performance for ALT targets was highly dependent on f0 in both H1-H9 and H4-H12 conditions, but not so for 2f0 COS targets (Fig. 2A,B). We quantified performance in terms of d′ (see Materials and Methods) and found a significant decrease in discriminability for ALT stimuli as f0 increased (H1-H9, F(4,12) = 12.59, p < 0.001; H4-H12, F(4,12) = 33.29, p < 0.001) (Fig. 2A,B, top). Performance for ALT and COS targets was indistinguishable at 150 Hz (H1-H9, t(3) = −1.09, p = 0.36, corrected; H4-H12, t(3) = −2.74, p = 0.071, corrected), but significantly different at 900 Hz (H1-H9, t(3) = −16.86, p < 0.001, corrected; H4-H12, 900 Hz: t(3) = −17.25, p < 0.001, corrected). Subjects required significantly more time to respond to ALT targets as f0 increased (H1-H9, F(4,12) = 6.82, p < 0.01; H4-H12, F(4,12) = 10.42, p = 0.001) (Fig. 2A,B, bottom). However, discrimination was impaired across all five f0 values in the H1-H4 condition, suggesting that subjects could not make discriminations based on temporal envelope cues in these stimuli (150 Hz, t(3) = −7.97, p < 0.01, corrected; 300 Hz, t(3) = −6.67, p < 0.01, corrected; 450 Hz, t(3) = −18.18, p < 0.01, corrected; 600 Hz, t(3) = −42.63, p < 0.01, corrected; 900 Hz, t(3) = −5.34, p < 0.05, corrected) (Fig. 2C). Overall, these results suggest that marmosets are sensitive to temporal envelope cues at f0 values <450 Hz but rely more strongly on spectral cues for f0 values >450 Hz, and that the first four harmonics of these complex tones are probably resolved.

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

Evidence for two mechanisms of pitch processing. A, B, Marmosets were trained to discriminate f0 ALT (red lines) and 2f0 COS (blue lines) tone targets from an f0 COS background. Stimuli were 9-component harmonic complex tones in which either the first nine harmonics were present (A) or the first three harmonics were removed (B). Top panels, Results plotted as both proportion correct (dashed lines) and d′ (solid lines). Envelope repetition rate was used to correctly identify ALT phase stimuli at low, but not high, f0 values. Bottom panels, Animals require more time to respond to ALT stimuli at higher f0 values. C, Temporal envelope cues could not be used to reliably discriminate 4-component ALT stimuli (H1-H4) at any f0 tested, as measured by both proportion correct and d′ (top). Response latencies were also increased for ALT compared with COS stimuli (bottom). Error bars indicate SEM in all plots.

Experiment 1b: the role of harmonic order on temporal pitch processing

We further examined the influence of stimulus resolvability on temporal envelope processing by testing discrimination of complex tones with harmonics confined to one of three spectral regions: low (H1-H4), medium (H5-H8), or high (H9-H12). Stimuli in the medium and high conditions were presented with a 3 octave noise band centered at f0. Only f0 values of 150 Hz and 900 Hz were used. If all harmonics are resolved, performance should be dependent on spectral cues alone and marmosets should respond only to COS (not ALT) stimuli. If all harmonics are unresolved, or if there is a mixture of resolved and unresolved harmonics (which produces an ambiguous pitch in human listeners) (Shackleton and Carlyon, 1994), temporal envelope cues should cause marmosets to respond to both targets.

Discriminability of ALT targets was highly dependent on spectral region at 150 Hz (F(2,6) = 24.23, p < 0.01), but not at 900 Hz (F(2,6) = 1.01, p = 0.42) (Fig. 3A,B, top). At 150 Hz, d′ values for ALT targets were significantly lower than COS targets in both low (t(3) = −7.79, p < 0.01, corrected) and medium (t(3) = −5.65, p < 0.05, corrected), but not high (t(3) = −2.08, p = 0.129, corrected), conditions, and response latency decreased as spectral region increased (F(2,6) = 0.777, p < 0.05) (Fig. 3A, bottom). d′ values for ALT stimuli at 900 Hz were significantly different from COS stimuli at all f0 values (low, t(3) = −6.24, p < 0.01, corrected; medium, t(3) = −8.56, p < 0.01, corrected; high, t(3) = −4.95, p < 0.05, corrected) (Fig. 3B, top). These data suggest that at 150 Hz, but not 900 Hz, harmonics are probably unresolved above ∼fifth.

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3.

Role of resolvability in pitch discrimination. A, B, The role of resolved versus unresolved harmonics was tested using 4-component harmonic complexes with f0 values of 150 Hz (A) or 900 Hz (B) at three different spectral regions. Performance was assessed using proportion correct and d′ (top panels) and response latency (bottom panels). Discrimination performance improved, and latency decreased, for ALT phase stimuli as spectral region increased for 150 Hz, but not 900 Hz, stimuli. C, Stimulus reversal task requiring discrimination of f0 ALT and f0 COS targets from a 2 f0 COS background. Stimuli were the same as in Figure 2A. Proportion correct (top), d′ (center), and response latency (bottom) were the same for both ALT and COS phase complexes for both H1-H9 and H4-H12 stimulus sets, suggesting that marmosets hear these stimuli as distinct percepts. Error bars indicate SEM in all plots.

Experiment 1c: background-target stimulus reversal

Experiment 1a showed that discrimination performance for f0 ALT and 2f0 COS targets is the same at 150 Hz for both H1-H9 and H4-H12 conditions (Fig. 2A,B, top), but it is unclear whether these stimuli are perceptually equivalent. We changed the background-target relationships so that a correct response required discriminating f0 ALT and f0 COS targets from a 2f0 COS background. Animals thus had to report on an octave decrease. If f0 ALT and 2f0 COS stimuli are perceptually indistinguishable, then subjects should not respond to f0 ALT targets. Results show that, for both H1-H9 and H4-H12 stimuli, marmosets were able to easily discriminate both target types with equal latency (hit rate: H1-H9, t(3) = 0.54, p = 0.628; H4-H12, t(3) = 0.88, p = 0.442; d′: H1-H9, t(3) = 0.74, p = 0.516; H4-H12, t(3) = 0.88, p = 0.444; response latency: H1-H9, t(3) = 0.059, p = 0.957; H4-H12, t(3) = −0.721, p = 0.523) (Fig. 3C).

Experiment 2a: psychophysical estimates of frequency selectivity

Results of Experiment 1 suggest that frequency resolution in the auditory periphery primarily determines whether a spectral or temporal mechanism is used to extract pitch. We estimated frequency resolution by measuring ERBs using notched noise procedures (see Materials and Methods). Previous work has shown that auditory filter shapes can be approximated by applying roex filter fits to notched noise data (Glasberg and Moore, 1990). The ERB transforms this filter into a rectangular shape with the same peak and total transmission. We tested four tone frequencies (500, 1000, 7000, and 16,000 Hz) drawn from across the marmoset hearing range (Osmanski and Wang, 2011).

Psychometric functions were measured in seven conditions (quiet, broadband noise, and five different spectral notch widths). Examples of individual psychometric functions are shown in Figure 4A. Thresholds compared favorably with previously measured audiograms for this species (Osmanski and Wang, 2011). Masking effects are indicated by a rightward shift in the psychometric functions measured in noise relative to quiet. The largest shift is seen for broadband noise, and adding spectral notches reduced the magnitude of this shift. Individual thresholds for each notch width were averaged to derive the input to the roex function (see Materials and Methods). These thresholds, along with the best fitting function to the masked threshold data, are shown in Figure 4B. Filter parameters obtained from the fitting algorithm were used to derive the filter shapes, which were then converted into ERBs (Fig. 4C).

Figure 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4.

ERB estimation. A, Summary psychometric functions for each of the four tone frequencies in seven conditions (quiet, broadband noise, and five bandwidths of notched noise). Representative performance from example subjects is plotted as the proportion of correct responses as a function of tone amplitude. Threshold = 50% correct. B, Effects of notched noise bandwidth on detection thresholds for all five subjects (colored symbols) for each tone frequency. The best fitting function is shown (solid black line) fitted to the average of each deviation (black open circles). C, Estimated roex(p,r) filter functions plotted against the interpolated average notched-noise thresholds across all five subjects.

Psychophysical measures of frequency selectivity using notched noise data have been described for only a few species, and these data are shown in Figure 5 (mouse, May et al., 2006; chinchilla, Niemiec et al., 1992; guinea pig, Evans et al., 1992). ERBs are typically between 10% and 20% of center frequency across species. Those measured in the current experiment vary from 6.6% (7 kHz) - 25.4% (0.5 kHz) (Fig. 5). Marmoset ERBs tend to be wider than those measured in humans, except at 7000 Hz, where bandwidths were narrower than those of the other four species.

Figure 5.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 5.

Comparison of ERBs across species. Marmoset ERBs were generally similar to other mammalian ERBs measured using notched-noise procedures. Marmosets show wider ERBs compared with humans, except at 7000 Hz, which is the frequency of best hearing in these animals. Error bars indicate SD.

Experiment 2b: modeling harmonic resolvability

Our ERB estimates were used to model harmonic resolvability in marmosets. The highest resolved harmonic of a complex tone is approximately equal to the center frequency of the auditory filter divided by its ERB (see Materials and Methods). We compare resolvability for marmosets and humans in Figure 6A. The first five to nine harmonics of a complex tone are consistently resolved in humans (Plomp, 1964; Glasberg and Moore, 1990). In marmosets, however, the first four or five harmonics are resolved at an f0 of 150 Hz, and this number increases to the first 14–16 harmonics between 600 and 800 Hz (Fig. 6A) before decreasing again at higher f0 values. These values show a remarkable correspondence to our pitch discrimination data, where temporal cues could be reliably used to discriminate complex tones at 150 Hz f0 (but only when stimuli contained harmonics > fifth), and this ability decreased as f0 increased (up to 900 Hz, presumably as the stimuli became entirely composed of fully resolved harmonics).

Figure 6.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 6.

Model of harmonic resolvability. A, Resolvability boundaries for both marmosets (solid black line) and humans (dashed black line) derived from psychophysical ERB estimates. Harmonics falling to the left of each boundary are fully resolved, whereas harmonics falling to the right are unresolved or partially resolved. Compared with humans, marmosets have poorer spectral resolution ∼<300 Hz. Marmosets can fully resolve up to 12 harmonics at 400 Hz, a result that matches results from Experiment 1 where temporal envelope cues were no longer salient for H1–H9 or H4–H12 stimuli ∼>450 Hz. B, Comparison of harmonic resolvability for marmosets using both behavioral data and tuning bandwidths of neurons recorded from A1. Psychophysical curves form a lower envelope for resolvability estimates derived from neural tuning bandwidths.

Previous work has shown agreement between ERBs measured psychophysically and those derived from the tuning bandwidths of cortical neurons (Evans et al., 1992; Fishman and Steinschneider, 2006; Bitterman et al., 2008). We estimated the harmonic resolvability of marmoset A1 neurons based on previously measured cortical tuning bandwidths in this species (Bartlett et al., 2011) and show that these data show a qualitative agreement with our psychophysical estimates (Fig. 6B). Behavioral thresholds, which presumably reflect spectral resolvability in the auditory periphery, generally describe a lower resolvability boundary for these cortical neurons. This relationship is consistent with the idea that a sensory threshold is determined by the most sensitive neurons contributing to a given behavior (Parker and Newsome, 1998). The overall poorer spectral resolvability measured across neurons in cortex compared with psychophysical thresholds is likely the result of information being increasingly integrated at successive processing stages of the auditory system.

Discussion

We trained common marmosets to discriminate complex harmonic tones differing in their phase relationships to disentangle the relative use of temporal envelope and spectral cues in extracting pitch from complex sounds. Our results suggest that marmosets are sensitive to both cues and that the primary determinants of which mechanism is used for pitch extraction are f0 and harmonic order. Specifically, marmosets were able to use changes in the envelope repetition rate of these sounds to extract pitch when stimuli contained low f0 values and high harmonic orders, whereas spectral information was used for higher f0 values with low harmonic orders. In a second experiment, we estimated auditory filter ERBs using notched noise techniques. Results show that marmoset ERBs are generally comparable to other mammalian species. Marmosets had wider ERBs compared with humans, except at 7000 Hz, which is approximately the frequency of best hearing in this species and also the frequency where most of the spectral energy in marmoset vocalizations is concentrated (Osmanski and Wang, 2011).

These behavioral results can be summarized by our model of harmonic resolvability (Fig. 6A), which shows that frequency resolution in marmosets is strongly dependent on both f0 and harmonic number. Only the first four harmonics are resolved at low f0 values (∼150 Hz), whereas this number grows to include the first 12–16 harmonics at higher f0 values (∼400–1000 Hz). These model data match our pitch results from Experiment 1, where temporal envelope cues dominate performance at f0 values ∼<450 Hz when stimuli contain harmonic numbers ∼>5.

The influence of f0 on pitch perception in marmosets is qualitatively similar to humans, where spectral cues dominate at f0 values ∼>100 Hz and sensitivity to temporal envelope cues becomes important only at very low f0 values <100 Hz (Flanagan and Guttman, 1960a,b). Quantitatively, however, our results suggest that marmosets and humans will perceive different pitches across a range of acoustic stimuli in which temporal envelope and spectral cues indicate different fundamental frequencies. For example, our data suggest that, for an ALT phase stimulus with an f0 = 150 Hz and containing harmonics 5–8, marmosets will hear either an ambiguous pitch or a pitch one octave above the f0 (extracted from temporal envelope cues), whereas humans will hear a pitch equal to the f0 (extracted from spectral cues) (Shackleton and Carlyon, 1994).

Marmoset sensitivity to envelope repetition rate at f0 values higher that those reported in humans (450 Hz vs 100 Hz) likely results from the fact that the marmoset cochlea is less than half that of the human (Johnson et al., 2012), even though the two species have similar ranges of hearing (Osmanski and Wang, 2011). This anatomical difference likely puts large restrictions on how sounds are represented centrally. In other words, marmoset auditory filter bandwidths might be expected to be wider, at least at certain frequencies, to reflect the fact that there is limited peripheral space on which to encode those frequencies. Heightened sensitivity to temporal envelope cues (as seen in Experiment 1) may then be the result of poorer spectral resolving power due to wider filter bandwidths compared with humans (Shera et al., 2002). Differences in the size of human and marmoset auditory filters, and thus differences in spectral resolution, may reflect a series of tradeoffs between basilar membrane length, hearing range, and the need to process species-specific communication signals centered in a particular range of frequencies (∼7000 Hz). Smaller cochleae may also limit spectral resolvability in other animals relative to humans, measured as increased pitch discrimination thresholds (Shofner, 2002; Kalluri et al., 2008; see, for example, Walker et al., 2009; Klinge et al., 2010; Shofner and Chaney, 2012), and wider ERBs (Evans et al., 1992; Niemiec et al., 1992; May et al., 2006).

Our ERB estimates compare favorably with previous human data using simultaneous masking paradigms (e.g., Patterson, 1976). However, much sharper estimates of auditory filter tuning widths have been described in humans using a variant of the notched noise technique in which brief, near-threshold signals were presented in a forward masking paradigm (Shera et al., 2002; Oxenham and Shera, 2003). These results show that ERBs are substantially narrower in forward, compared with simultaneous, masking conditions and are thought to provide more accurate estimates of tuning at low levels. Auditory filter bandwidths are known to widen with increasing level (Moore and Glasberg, 1987), and our choice of a simultaneous masking paradigm at relatively high levels (∼35 dB) was based on the fact that our pitch task was designed to examine mechanisms of pitch perception, previously studied in humans, which operate well above threshold levels (average stimulus level was 70 dB SPL). Calculating ERBs at lower levels would potentially limit meaningful comparisons between our tuning estimates and our pitch results.

In human pitch matching experiments, subjects report a pitch intermediate between an f0 and 2f0 COS stimulus when both resolved and unresolved harmonics are present in an f0 ALT stimulus (Shackleton and Carlyon, 1994). Our results suggest that the same holds true for marmosets because they easily discriminate f0 ALT from both f0 (Experiment 1a) and 2f0 (Experiment 1c) COS stimuli. It is important to keep in mind, however, that subjects were trained to respond to any acoustic change in these experiments, and presentation of ALT stimuli is accompanied by a change in spectrum in Experiment 1c. Thus, timbre, and not pitch, could have formed the basis for discrimination. Ultimately, questions about stimulus similarity and perceptual grouping can only be properly answered using a more complex categorization or matching task in which stimuli are judged based on perceived similarity.

Finally, although we argue that marmosets are sensitive to spectral cues in these experiments, it is important to note that we cannot separate a spectral processing scheme in our experiments from a complementary scheme using temporal fine structure information (see, for example, Oxenham et al., 2004; Shofner and Campbell, 2012). For instance, similar predictions have been obtained whether one uses a harmonic template-based mechanism or one based on the waveform autocorrelation function (Cariani and Delgutte, 1996a,b; Meddis and O'Mard, 1997). In addition to temporal fine structure cues, marmosets could potentially be using other kinds of spectral cues to make pitch judgments for COS stimuli, including changes in the position of the highest harmonic. Further work will be required to tease apart these possible alternatives to determine whether marmosets are truly sensitive to spectral mechanisms of pitch extraction.

In conclusion, these behavioral data support a dual pitch processing mechanism whereby both harmonic templates (spectral) and envelope extraction (temporal) are used to compute pitch depending on the resolvability of the stimulus spectrum. Although previous work has shown that primates are able to use pitch cues to make judgments about sound (May et al., 1989; Wright et al., 2000; Izumi, 2001; Brosch et al., 2004), we think that these data represent the first rigorous psychophysical tests of pitch perception in nonhuman primates that begin to address questions of underlying mechanism. Marmosets, an emerging model system in auditory neuroscience with a large vocal repertoire and an easily accessible auditory cortex (Wang, 2000, 2007), are an ideal species for questions of this kind. Teasing apart the various mechanisms that give rise to pitch perception in marmosets will provide valuable comparative data, which will advance our understanding of how the auditory system works to create a unified pitch percept in different animal species, including humans.

Footnotes

  • This work was supported by the National Institutes of Health Grant DC 03180 to X.W. and Grant T32 DC000023. We thank J. Estes and N. Sotuyo for assistance with animal care; C. Cummings, R. Desideri, J. Howard, M. Maguire, and E. Nagourney for help running the behavior experiments; I. Lina for work on the analysis software for roex filter fits and ERB derivation; V. Sadagopan for calculating neural frequency tuning bandwidths to fit our resolvability model; and D. Bendor and I. Lina for providing valuable comments on an earlier draft of this manuscript.

  • The authors declare no competing financial interests.

  • Correspondence should be addressed to either Dr. Michael S. Osmanski or Dr. Xiaoqin Wang, Laboratory of Auditory Neurophysiology, Department of Biomedical Engineering, Johns Hopkins University School of Medicine, 720 Rutland Avenue, Traylor 410, Baltimore, MD 21205. michael.osmanski{at}jhu.edu or xiaoqin.wang{at}jhu.edu

References

  1. ↵
    1. Bartlett EL,
    2. Sadagopan S,
    3. Wang X
    (2011) Fine frequency tuning in monkey auditory cortex and thalamus. J Neurophysiol 106:849–859, doi:10.1152/jn.00559.2010, pmid:21613589.
    OpenUrlAbstract/FREE Full Text
  2. ↵
    1. Bendor D,
    2. Wang X
    (2005) The neuronal representation of pitch in primate auditory cortex. Nature 436:1161–1165, doi:10.1038/nature03867, pmid:16121182.
    OpenUrlCrossRefPubMed
  3. ↵
    1. Bendor D,
    2. Wang X
    (2006) Cortical representations of pitch in monkeys and humans. Curr Opin Neurobiol 16:391–399, doi:10.1016/j.conb.2006.07.001, pmid:16842992.
    OpenUrlCrossRefPubMed
  4. ↵
    1. Bendor D,
    2. Wang X
    (2010) Neural coding of periodicity in marmoset auditory cortex. J Neurophysiol 103:1809–1822, doi:10.1152/jn.00281.2009, pmid:20147419.
    OpenUrlAbstract/FREE Full Text
  5. ↵
    1. Bendor D,
    2. Osmanski MS,
    3. Wang X
    (2012) Dual pitch processing mechanisms in primate auditory cortex. J Neurosci 32:16149–16161, doi:10.1523/JNEUROSCI.2563-12.2012, pmid:23152599.
    OpenUrlAbstract/FREE Full Text
  6. ↵
    1. Bitterman Y,
    2. Mukamel R,
    3. Malach R,
    4. Fried I,
    5. Nelken I
    (2008) Ultra-fine frequency tuning revealed in single neurons of human auditory cortex. Nature 451:197–201, doi:10.1038/nature06476, pmid:18185589.
    OpenUrlCrossRefPubMed
  7. ↵
    1. Brosch M,
    2. Selezneva E,
    3. Bucks C,
    4. Scheich H
    (2004) Macaque monkeys discriminate pitch relationships. Cognition 91:259–272, doi:10.1016/j.cognition.2003.09.005, pmid:15168897.
    OpenUrlCrossRefPubMed
  8. ↵
    1. Cariani PA,
    2. Delgutte B
    (1996a) Neural correlates of the pitch of complex tones: I. Pitch and pitch salience. J Neurophysiol 76:1698–1716, pmid:8890286.
    OpenUrlAbstract/FREE Full Text
  9. ↵
    1. Cariani PA,
    2. Delgutte B
    (1996b) Neural correlates of the pitch of complex tones: II. Pitch shift, pitch ambiguity, phase invariance, pitch circularity, rate pitch, and the dominance region for pitch. J Neurophysiol 76:1717–1734, pmid:8890287.
    OpenUrlAbstract/FREE Full Text
  10. ↵
    1. Carlyon RP,
    2. Shackleton TM
    (1994) Comparing the fundamental frequencies of resolved and unresolved harmonics: evidence for two pitch mechanisms. J Acoust Soc Am 95:3541–3554, doi:10.1121/1.409971.
    OpenUrlCrossRef
  11. ↵
    1. Cynx J,
    2. Shapiro M
    (1986) Perception of missing fundamental by a species of songbird (Sturnus vulgaris) J Comp Psychol 100:356–360, doi:10.1037/0735-7036.100.4.356, pmid:3802780.
    OpenUrlCrossRefPubMed
  12. ↵
    1. de Cheveigné A
    (1998) Cancellation model of pitch perception. J Acoust Soc Am 103:1261–1271, doi:10.1121/1.423232, pmid:9514016.
    OpenUrlCrossRefPubMed
  13. ↵
    1. Dooling RJ,
    2. Leek MR,
    3. Gleich O,
    4. Dent ML
    (2002) Auditory temporal resolution in birds: discrimination of harmonic complexes. J Acoust Soc Am 112:999–1008, doi:10.1121/1.1494807, pmid:12243189.
    OpenUrlCrossRefPubMed
  14. ↵
    1. Evans EF,
    2. Pratt SR,
    3. Spenner H,
    4. Cooper NP
    (1992) in Auditory physiology and perception, Comparisons of physiological and behavioural properties: auditory frequency selectivity, eds Cazals Y, Demany L, Horne K (Pergamon, Oxford), pp 159–170.
  15. ↵
    1. Fishman YI,
    2. Steinschneider M
    (2006) Spectral resolution of monkey primary auditory cortex (A1) revealed with two-noise masking. J Neurophysiol 96:1105–1115, doi:10.1152/jn.00124.2006, pmid:16738218.
    OpenUrlAbstract/FREE Full Text
  16. ↵
    1. Flanagan JL,
    2. Guttman N
    (1960a) On the pitch of periodic pulses. J Acoust Soc Am 32:1308–1319, doi:10.1121/1.1907900.
    OpenUrlCrossRef
  17. ↵
    1. Flanagan JL,
    2. Guttman N
    (1960b) Pitch of periodic pulses without fundamental component. J Acoust Soc Am 32:1319–1328, doi:10.1121/1.1907901.
    OpenUrlCrossRef
  18. ↵
    1. Fletcher H
    (1940) Auditory patterns. Rev Mod Phys 12:47–61, doi:10.1103/RevModPhys.12.47.
    OpenUrlCrossRef
  19. ↵
    1. Gescheider GA
    (1985) Psychophysics: method, theory, and application (Lawrence Erlbaum, New York).
  20. ↵
    1. Glasberg BR,
    2. Moore BC
    (1990) Derivation of auditory filter shapes from notched-noise data. Hear Res 47:103–138, doi:10.1016/0378-5955(90)90170-T, pmid:2228789.
    OpenUrlCrossRefPubMed
  21. ↵
    1. Guttman N,
    2. Flanagan JL
    (1964) Pitch of high-pass-filtered pulse trains. J Acoust Soc Am 36:757–765, doi:10.1121/1.1919063.
    OpenUrlCrossRef
  22. ↵
    1. Heffner H,
    2. Whitfield IC
    (1976) Perception of the missing fundamental by cats. J Acoust Soc Am 59:915–919, doi:10.1121/1.380951, pmid:1262593.
    OpenUrlCrossRefPubMed
  23. ↵
    1. Izumi A
    (2001) Relative pitch perception in Japanese monkeys (Macaca fuscata) J Comp Psychol 115:127–131, doi:10.1037/0735-7036.115.2.127, pmid:11459159.
    OpenUrlCrossRefPubMed
  24. ↵
    1. Johnson LA,
    2. Della Santina CC,
    3. Wang X
    (2012) Temporal bone characterization and cochlear implant feasibility in the common marmoset (Callithrix jacchus) Hear Res 290:37–44, doi:10.1016/j.heares.2012.05.002, pmid:22583919.
    OpenUrlCrossRefPubMed
  25. ↵
    1. Kalluri S,
    2. Depireux DA,
    3. Shamma SA
    (2008) Perception and cortical neural coding of harmonic fusion in ferrets. J Acoust Soc Am 123:2701–2716, doi:10.1121/1.2902178, pmid:18529189.
    OpenUrlCrossRefPubMed
  26. ↵
    1. Klinge A,
    2. Itatani N,
    3. Klump GM
    (2010) in The neurophysiological basis of auditory perception, A comparative view on the perception of mistuning: constraints of the auditory periphery, eds Lopez-Poveda EA, Palmer AR, Meddis R (Springer, New York), pp 465–475.
  27. ↵
    1. Lina IA,
    2. Lauer AM
    (2013) Rapid measurement of auditory filter shape in mice using the auditory brainstem response and notched-noise. Hear Res 298:73–79, doi:10.1016/j.heares.2013.01.002, pmid:23347916.
    OpenUrlCrossRefPubMed
  28. ↵
    1. Lohr B,
    2. Dooling RJ
    (1998) Detection of changes in timbre and harmonicity in complex sounds by zebra finches (Taeniopygia guttata) and budgerigars Melopsittacus undulatus) J Comp Psychol 112:36–47, doi:10.1037/0735-7036.112.1.36, pmid:9528113.
    OpenUrlCrossRefPubMed
  29. ↵
    1. May BJ,
    2. Kimar S,
    3. Prosen CA
    (2006) Auditory filter shapes of CBA/CaJ mice: behavioral assessments. J Acoust Soc Am 120:321–330, doi:10.1121/1.2203593, pmid:16875229.
    OpenUrlCrossRefPubMed
  30. ↵
    1. May B,
    2. Moody DB,
    3. Stebbins WC
    (1989) Categorical perception of conspecific communication sounds by Japanes macaques, Macaca fuscata. J Acoust Soc Am 85:837–847, doi:10.1121/1.397555, pmid:2925998.
    OpenUrlCrossRefPubMed
  31. ↵
    1. Meddis R,
    2. Hewitt MJ
    (1991a) Virtual pitch and phase sensitivity of a computer model of the auditory periphery: I. Pitch identification. J Acoust Soc Am 89:2866–2882, doi:10.1121/1.400725.
    OpenUrlCrossRef
  32. ↵
    1. Meddis R,
    2. Hewitt MJ
    (1991b) Virtual pitch and phase sensitivity of a computer model of the auditory periphery: II. Phase sensitivity. J Acoust Soc Am 89:2883–2894, doi:10.1121/1.400726.
    OpenUrlCrossRef
  33. ↵
    1. Meddis R,
    2. O'Mard L
    (1997) A unitary model of pitch perception. J Acoust Soc Am 102:1811–1820, doi:10.1121/1.420088, pmid:9301058.
    OpenUrlCrossRefPubMed
  34. ↵
    1. Moore BCJ
    (2012) An introduction to the psychology of hearing (Emerald Group, United Kingdom).
  35. ↵
    1. Moore BC,
    2. Glasberg BR
    (1987) Formulae describing frequency selectivity as a function of frequency and level, and their use in calculating excitation patterns. Hear Res 28:209–225, doi:10.1016/0378-5955(87)90050-5, pmid:3654390.
    OpenUrlCrossRefPubMed
  36. ↵
    1. Moore BCJ,
    2. Ohgushi K
    (1993) Audibility of partials in inharmonic complex tones. J Acoust Soc Am 93:452–461, doi:10.1121/1.405625, pmid:8423261.
    OpenUrlCrossRefPubMed
  37. ↵
    1. Niemiec AJ,
    2. Yost WA,
    3. Shofner WP
    (1992) Behavioral measures of frequency selectivity in the chinchilla. J Acoust Soc Am 92:2636–2649, doi:10.1121/1.404380, pmid:1479127.
    OpenUrlCrossRefPubMed
  38. ↵
    1. Osmanski MS,
    2. Wang X
    (2011) Measurement of absolute auditory thresholds in the common marmoset (Callithrix jacchus) Hear Res 277:127–133, doi:10.1016/j.heares.2011.02.001, pmid:21303689.
    OpenUrlCrossRefPubMed
  39. ↵
    1. Oxenham AJ,
    2. Shera CA
    (2003) Estimates of human cochlear tuning at low levels using forward and simultaneous masking. J Assoc Res Otolaryngol 4:541–554, doi:10.1007/s10162-002-3058-y, pmid:14716510.
    OpenUrlCrossRefPubMed
  40. ↵
    1. Oxenham AJ,
    2. Benrstein JGW,
    3. Penagos H
    (2004) Correct tonotopic representation is necessary for complex pitch perception. Proc Natl Acad Sci U S A 101:1421–1425, doi:10.1073/pnas.0306958101, pmid:14718671.
    OpenUrlAbstract/FREE Full Text
  41. ↵
    1. Oxenham AJ,
    2. Micheyl C,
    3. Keebler MV,
    4. Loper A,
    5. Santurette S
    (2011) Pitch perception beyond the traditional existence region of pitch. Proc Natl Acad Sci U S A 108:7629–7634, doi:10.1073/pnas.1015291108, pmid:21502495.
    OpenUrlAbstract/FREE Full Text
  42. ↵
    1. Parker AJ,
    2. Newsome WT
    (1998) Sense and the single neuron: probing the physiology of perception. Annu Rev Neurosci 21:227–277, doi:10.1146/annurev.neuro.21.1.227, pmid:9530497.
    OpenUrlCrossRefPubMed
  43. ↵
    1. Patterson RD
    (1976) Auditory filter shapes derived with noise stimuli. J Acoust Soc Am 59:640–654, doi:10.1121/1.380914, pmid:1254791.
    OpenUrlCrossRefPubMed
  44. ↵
    1. Patterson RD,
    2. Uppenkamp S,
    3. Johnsrude IS,
    4. Griffiths TD
    (2002) The processing of temporal pitch and melody information in auditory cortex. Neuron 36:767–776, doi:10.1016/S0896-6273(02)01060-7, pmid:12441063.
    OpenUrlCrossRefPubMed
  45. ↵
    1. Penagos H,
    2. Melcher JR,
    3. Oxenham AJ
    (2004) A neural representation of pitch salience in nonprimary human auditory cortex revealed with functional magnetic resonance imaging. J Neurosci 24:6810–6815, doi:10.1523/JNEUROSCI.0383-04.2004, pmid:15282286.
    OpenUrlAbstract/FREE Full Text
  46. ↵
    1. Plack CJ,
    2. Oxenham AJ,
    3. Fay RR,
    4. Popper AN
    , eds (2005) Pitch: neural coding and perception (Springer, New York).
  47. ↵
    1. Plomp R
    (1964) The ear as a frequency analyzer. J Acoust Soc Am 98:1355–1364, pmid:5645825.
    OpenUrlPubMed
  48. ↵
    1. Plomp R,
    2. Mimpen AM
    (1968) The ear as a frequency analyzer II. J Acoust Soc Am 43:764–767, doi:10.1121/1.1910894.
    OpenUrlCrossRefPubMed
  49. ↵
    1. Remington ED,
    2. Osmanski MS,
    3. Wang X
    (2012) An operant conditioning method for studying auditory behaviors in marmoset monkeys. PLoS One 7:e47895, doi:10.1371/journal.pone.0047895, pmid:23110123.
    OpenUrlCrossRefPubMed
  50. ↵
    1. Shackleton TM,
    2. Carlyon RP
    (1994) The role of resolved and unresolved harmonics in pitch perception and frequency modulation discrimination. J Acoust Soc Am 95:3529–3540, doi:10.1121/1.409970, pmid:8046144.
    OpenUrlCrossRefPubMed
  51. ↵
    1. Shera CA,
    2. Guinan JJ Jr.,
    3. Oxenham AJ
    (2002) Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements. Proc Natl Acad Sci U S A 99:3318–3323, doi:10.1073/pnas.032675099, pmid:11867706.
    OpenUrlAbstract/FREE Full Text
  52. ↵
    1. Shofner WP
    (2002) Perception of the periodicity strength of complex sounds by the chinchilla. Hear Res 173:69–81, doi:10.1016/S0378-5955(02)00612-3, pmid:12372636.
    OpenUrlCrossRefPubMed
  53. ↵
    1. Shofner WP
    (2011) Perception of the missing fundamental by chinchillas in the presence of low-pass masking noise. J Assoc Res Otolaryngol 11:343–365, doi:10.1007/s10162-010-0237-0, pmid:20440634.
    OpenUrlCrossRefPubMed
  54. ↵
    1. Shofner WP,
    2. Campbell J
    (2012) Pitch strength of noise-vocoded harmonic tone complexes in normal hearing listeners. J Acoust Soc Am 132:EL398–EL404, doi:10.1121/1.4757697, pmid:23145701.
    OpenUrlCrossRefPubMed
  55. ↵
    1. Shofner WP,
    2. Chaney M
    (2012) Processing pitch in a nonhuman mammal (Chinchilla laniger) J Comp Psychol doi:10.1037/a0029734, doi:10.1037/a0029734, pmid:22985274, Advance online publication. Retrieved Sept. 17, 2012.
    OpenUrlCrossRefPubMed
  56. ↵
    1. Simmons AM,
    2. Buxbaum RC
    (1996) Neuroethological studies of cognitive and perceptual processes, eds Moss CF, Shettleworth SJ (Westview, Boulder, CO), pp 185–228.
  57. ↵
    1. Tomlinson RW,
    2. Schwarz DW
    (1988) Perception of the missing fundamental in nonhuman primates. J Acoust Soc Am 84:560–565, doi:10.1121/1.396833, pmid:3170947.
    OpenUrlCrossRefPubMed
  58. ↵
    1. Walker KM,
    2. Schnupp JW,
    3. Hart-Schnupp SM,
    4. King AJ,
    5. Bizley JK
    (2009) Pitch discrimination by ferrets for simple and complex sounds. J Acoust Soc Am 126:1321–1335, doi:10.1121/1.3179676, pmid:19739746.
    OpenUrlCrossRefPubMed
  59. ↵
    1. Wang X
    (2000) On cortical coding of vocal communication sounds in primates. Proc Natl Acad Sci U S A 97:11843–11849, doi:10.1073/pnas.97.22.11843, pmid:11050218.
    OpenUrlAbstract/FREE Full Text
  60. ↵
    1. Wang X
    (2007) Neural coding strategies in auditory cortex. Hear Res 229:81–93, doi:10.1016/j.heares.2007.01.019, pmid:17346911.
    OpenUrlCrossRefPubMed
  61. ↵
    1. Wang X,
    2. Walker KM
    (2012) Neural mechanisms for the abstraction and use of pitch information in auditory cortex. J Neurosci 32:13339–13342, doi:10.1523/JNEUROSCI.3814-12.2012, pmid:23015423.
    OpenUrlAbstract/FREE Full Text
  62. ↵
    1. Wright AA,
    2. Rivera JJ,
    3. Hulse SH,
    4. Shyan M,
    5. Neiworth JJ
    (2000) Music perception and octave generalization in rhesus monkeys. J Exp Psychol Gen 129:291–307, doi:10.1037/0096-3445.129.3.291, pmid:11006902.
    OpenUrlCrossRefPubMed
  63. ↵
    1. Yost WA,
    2. Patterson R,
    3. Sheft S
    (1996) A time domain description for the pitch strength of iterated rippled noise. J Acoust Soc Am 99:1066–1078, doi:10.1121/1.414593, pmid:8609290.
    OpenUrlCrossRefPubMed
Back to top

In this issue

The Journal of Neuroscience: 33 (21)
Journal of Neuroscience
Vol. 33, Issue 21
22 May 2013
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover
  • Index by author
  • Advertising (PDF)
  • Ed Board (PDF)
Email

Thank you for sharing this Journal of Neuroscience article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
The Role of Harmonic Resolvability in Pitch Perception in a Vocal Nonhuman Primate, the Common Marmoset (Callithrix jacchus)
(Your Name) has forwarded a page to you from Journal of Neuroscience
(Your Name) thought you would be interested in this article in Journal of Neuroscience.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Print
View Full Page PDF
Citation Tools
The Role of Harmonic Resolvability in Pitch Perception in a Vocal Nonhuman Primate, the Common Marmoset (Callithrix jacchus)
Michael S. Osmanski, Xindong Song (宋欣东), Xiaoqin Wang
Journal of Neuroscience 22 May 2013, 33 (21) 9161-9168; DOI: 10.1523/JNEUROSCI.0066-13.2013

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Respond to this article
Request Permissions
Share
The Role of Harmonic Resolvability in Pitch Perception in a Vocal Nonhuman Primate, the Common Marmoset (Callithrix jacchus)
Michael S. Osmanski, Xindong Song (宋欣东), Xiaoqin Wang
Journal of Neuroscience 22 May 2013, 33 (21) 9161-9168; DOI: 10.1523/JNEUROSCI.0066-13.2013
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Introduction
    • Materials and Methods
    • Results
    • Discussion
    • Footnotes
    • References
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF

Responses to this article

Respond to this article

Jump to comment:

No eLetters have been published for this article.

Related Articles

Cited By...

More in this TOC Section

Articles

  • Choice Behavior Guided by Learned, But Not Innate, Taste Aversion Recruits the Orbitofrontal Cortex
  • Maturation of Spontaneous Firing Properties after Hearing Onset in Rat Auditory Nerve Fibers: Spontaneous Rates, Refractoriness, and Interfiber Correlations
  • Insulin Treatment Prevents Neuroinflammation and Neuronal Injury with Restored Neurobehavioral Function in Models of HIV/AIDS Neurodegeneration
Show more Articles

Behavioral/Cognitive

  • Animal-to-Animal Variability in Partial Hippocampal Remapping in Repeated Environments
  • Brain System for Social Categorization by Narrative Roles
  • Frequency-Dependent Plasticity in the Temporal Association Cortex Originates from the Primary Auditory Cortex, and Is Modified by the Secondary Auditory Cortex and the Medial Geniculate Body
Show more Behavioral/Cognitive
  • Home
  • Alerts
  • Visit Society for Neuroscience on Facebook
  • Follow Society for Neuroscience on Twitter
  • Follow Society for Neuroscience on LinkedIn
  • Visit Society for Neuroscience on Youtube
  • Follow our RSS feeds

Content

  • Early Release
  • Current Issue
  • Issue Archive
  • Collections

Information

  • For Authors
  • For Advertisers
  • For the Media
  • For Subscribers

About

  • About the Journal
  • Editorial Board
  • Privacy Policy
  • Contact
(JNeurosci logo)
(SfN logo)

Copyright © 2022 by the Society for Neuroscience.
JNeurosci Online ISSN: 1529-2401

The ideas and opinions expressed in JNeurosci do not necessarily reflect those of SfN or the JNeurosci Editorial Board. Publication of an advertisement or other product mention in JNeurosci should not be construed as an endorsement of the manufacturer’s claims. SfN does not assume any responsibility for any injury and/or damage to persons or property arising from or related to any use of any material contained in JNeurosci.