Skip to main content

Main menu

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Collections
    • Podcast
  • ALERTS
  • FOR AUTHORS
    • Information for Authors
    • Fees
    • Journal Clubs
    • eLetters
    • Submit
  • EDITORIAL BOARD
  • ABOUT
    • Overview
    • Advertise
    • For the Media
    • Rights and Permissions
    • Privacy Policy
    • Feedback
  • SUBSCRIBE

User menu

  • Log in
  • My Cart

Search

  • Advanced search
Journal of Neuroscience
  • Log in
  • My Cart
Journal of Neuroscience

Advanced Search

Submit a Manuscript
  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Collections
    • Podcast
  • ALERTS
  • FOR AUTHORS
    • Information for Authors
    • Fees
    • Journal Clubs
    • eLetters
    • Submit
  • EDITORIAL BOARD
  • ABOUT
    • Overview
    • Advertise
    • For the Media
    • Rights and Permissions
    • Privacy Policy
    • Feedback
  • SUBSCRIBE
PreviousNext
Journal Club

Understanding How Neural Circuits Measure Pitch

Daniel Bendor
Journal of Neuroscience 2 March 2011, 31 (9) 3141-3142; DOI: https://doi.org/10.1523/JNEUROSCI.6077-10.2011
Daniel Bendor
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Info & Metrics
  • eLetters
  • PDF
Loading

Pitch perception plays an important role in audition, especially in the context of music and speech. The brain could theoretically compute pitch from either the spatial or temporal pattern of excitation in the auditory periphery. However, computational pitch models relying on only a single information source from the auditory periphery (spatial or temporal) have been unable to account for the entire set of psychophysical phenomenon associated with pitch perception. One attractive alternate model uses a synergistic approach, relying on the spatial pattern of temporal information (Shamma, 1985). Recently, Cedolin and Delgutte (2010) provided the first physiological evidence that spatiotemporal information from auditory nerve responses can be used to extract pitch.

We hear pitch when a sound is harmonic (spectrally periodic) or has a regularly repeating acoustic structure (temporally periodic). The frequency of this periodicity (fundamental frequency) determines what pitch we perceive, with a higher fundamental frequency generating a higher perceived pitch. Harmonic sounds like those generated by a musical instrument or the human vocal tract produce a strong pitch sensation, while white noise (which is aperiodic) does not generate a pitch percept.

No explicit representation for pitch exists in the cochlea for harmonic sounds; however, if the frequency of each harmonic component can be measured, then later stages of the auditory system can theoretically use this information to calculate the fundamental frequency. The frequencies of the harmonics can be detected by the spatial pattern of excitation along the cochlea's basilar membrane. This spatial pattern is then encoded by the firing rates of auditory nerve fibers, which are determined by the proximity of a fiber's best frequency to a harmonic. Although this works well at low sound levels, at higher sound levels the responses of auditory nerve fibers saturate, minimizing the difference in firing rate between fibers with best frequencies close to and far away from a given harmonic. Because we can still hear a sound's pitch at these sound levels, something other than an auditory nerve fiber's firing rate must be used to detect harmonics.

One way to detect harmonics of louder sounds is to synergistically use temporal information to enhance the contrast in the spatial pattern of excitation. This is possible because the cochlear traveling wave generated by the sound does not have a constant speed, but instead slows down (causing a phase transition) at locations along the basilar membrane with the highest resonance. In other words, the point of maximal excitation along the basilar membrane for a spectral peak will also have the largest phase transition in the cochlear traveling wave. This would translate into a larger delay in the responses of two neighboring auditory nerve fibers when their best frequency is closer to a harmonic's frequency. Thus harmonics could be detected by measuring where the largest differences are in the responses of neighboring auditory nerve fibers, which theoretically could be computed using lateral inhibition at a later stage in the auditory system.

To test the spatiotemporal pitch model, Cedolin and Delgutte (2010) recorded single-unit responses to harmonic complex tones from the auditory nerve of anesthetized cats. Given the difficulty in finely sampling neighboring auditory nerve fibers with single-unit recordings, Cedolin and Delgutte (2010) instead used the property of scaling invariance, where local spatiotemporal response patterns can be inferred from the response of a single auditory nerve fiber. The principle of scaling invariance states that a small shift in cochlear location will have a similar response (magnitude and phase) for a proportional shift in frequency. Therefore by measuring responses to multiple harmonic complex tones (with varying fundamental frequencies) within a single auditory nerve fiber, the authors could infer what the spatial and temporal pattern of activity across a population of auditory nerve fibers would look like for a single harmonic complex tone.

To measure the spatiotemporal representation's ability to encode a complex tone's harmonics, Cedolin and Delgutte (2010) calculated the mean absolute spatial derivative (MASD), which is the mean difference in activity between two neighboring auditory nerve fibers. With a delay between the two fibers, one fiber's response will slightly precede the other fiber's response, causing a difference in activity at that time point, increasing the MASD. When there is no delay, the temporal patterns of the two fibers are similar, minimizing the difference in activity and decreasing the MASD. If the MASD is largest for auditory nerve fibers with best frequencies near the frequencies of the harmonics, the MASD can represent pitch. The MASD representation can be directly compared to the traditional spatial representation of pitch (rate–place profile), in which peaks in the spatial distribution of firing rates correspond to the frequencies of the harmonics.

The authors first compared the MASD and rate–place profiles of harmonic complex tones at different sound levels. Rate–place profiles failed to detect harmonics at louder sound levels because of the response saturation of auditory nerve fibers. On the other hand, the spatiotemporal representation (based on the MASD profile) was still able to detect harmonics at these higher sound levels where rate–place profiles could not extract pitch. Because of the upper limit of phase locking (∼3–5 kHz), the spatiotemporal representation's ability to extract pitch was limited by frequency (to fundamental frequencies <1100 Hz). This is not a shortcoming of the spatiotemporal model, because our ability to perceive pitch is also frequency limited. In contrast, a rate–place representation only depends on the spectral resolvability of harmonics, which does not degrade with frequency.

Humans are insensitive to phase differences between harmonics when they are spectrally resolved on the basilar membrane. Because phase differences affect the relative timing of auditory nerve responses, these differences could compromise the spatiotemporal representation of pitch. Using three different phase relationships between harmonics (cosine phase, alternating phase, and Schroeder phase), the authors found no statistically significant difference in the MASD profile, showing an agreement between the spatiotemporal representation of pitch and human psychophysical results.

To extract pitch from a spatiotemporal representation, it is thought that the brain may use a harmonic template. Using responses pooled over the entire population of auditory nerve fibers, the authors next measured how accurately a harmonic template could estimate the fundamental frequency. A spatiotemporal representation of pitch was accurate (SD < 2%) for fundamental frequencies between 350 Hz and 1100 Hz. The lower limit of this range is similar to what has been shown for a rate–place-based model (Cedolin and Delgutte, 2005). Although this range covers only the upper portion of the range of fundamental frequencies associated with pitch, lower fundamental frequencies could still be measured using a secondary temporally based mechanism relying on unresolved harmonics (Carlyon and Shackleton, 1994).

Delgutte and colleagues have previously tested both the temporal (Cariani and Delgutte, 1996a,b) and rate–place (Cedolin and Delgutte, 2005) representation of pitch. This report by Cedolin and Delgutte (2010) confirms the viability of a third class of pitch models, relying on spatiotemporal information. Whether one or more of these representations of pitch are actually used by the auditory system cannot be determined by the available data from the auditory nerve. Recording from the next stage in the auditory system, the cochlear nucleus, may help identify which pitch mechanism(s) are used. Identification of lateral inhibition or cross-frequency coincidence detection could further support the use of spatiotemporal information in computing pitch.

Both the rate–place and spatiotemporal models require a harmonic template at some later stage of the auditory system to compute pitch. Recordings from auditory cortex have identified neurons tuned to multiple harmonically related frequencies that could act as a harmonic template (Kadia and Wang, 2003). Pitch-selective neurons in the low-frequency region of primate auditory cortex (Bendor and Wang, 2005) may provide another starting point for identifying the pitch mechanism(s) used by the auditory system. These neurons are tuned to a sound's fundamental frequency, and are sensitive to acoustic parameters important for pitch perception such as temporal regularity and repetition rate. By identifying the cortical and subcortical inputs to these neurons, pathways involved in processing pitch might be identified. Pitch may be computed using a single mechanism (Meddis and O'Mard, 1997), or multiple mechanisms may be used (Carlyon and Shackleton, 1994).

Our understanding of pitch perception is based largely on human psychophysical data. Although missing fundamental pitch perception has been demonstrated in cats, whether cats hear pitch in the same way as humans do is not well understood. Because the cochlear size in cats is smaller than in humans, spectral resolvability is worse. Properties of pitch perception (and potentially the involved mechanisms) are different when harmonics are resolved or unresolved. This means that for a given acoustic stimulus, the harmonics could be unresolved for a cat and resolved for a human, making human psychophysical data incompatible in predicting what the cat may perceive. Understanding species-specific differences in pitch perception (Walker et al., 2009) will be necessary to interpret potential neural mechanism(s) responsible for measuring a sound's pitch.

Footnotes

  • Editor's Note: These short, critical reviews of recent papers in the Journal, written exclusively by graduate students or postdoctoral fellows, are intended to summarize the important findings of the paper and provide additional insight and commentary. For more information on the format and purpose of the Journal Club, please see http://www.jneurosci.org/misc/ifa_features.shtml.

  • Support for this work was provided by a postdoctoral fellowship from Merck and the Helen Hay Whitney Foundation.

  • Correspondence should be addressed to Daniel Bendor, Picower Institute for Learning and Memory, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, 43 Vassar Street, Building 46, Room 5233, Cambridge, MA 02139. dbendor{at}mit.edu

References

  1. ↵
    1. Bendor D,
    2. Wang X
    (2005) The neuronal representation of pitch in primate auditory cortex. Nature 436:1161–1165.
    OpenUrlCrossRefPubMed
  2. ↵
    1. Cariani PA,
    2. Delgutte B
    (1996a) Neural correlates of the pitch of complex tones. I. Pitch and pitch salience. J Neurophysiol 76:1698–1716.
    OpenUrlAbstract/FREE Full Text
  3. ↵
    1. Cariani PA,
    2. Delgutte B
    (1996b) Neural correlates of the pitch of complex tones. II. Pitch shift, pitch ambiguity, phase invariance, pitch circularity, rate pitch, and the dominance region for pitch. J Neurophysiol 76:1717–1734.
    OpenUrlAbstract/FREE Full Text
  4. ↵
    1. Carlyon RP,
    2. Shackleton TM
    (1994) Comparing the fundamental frequencies of resolved and unresolved harmonics: evidence for two pitch mechanisms? J Acoust Soc Am 95:3541–3554.
    OpenUrlCrossRef
  5. ↵
    1. Cedolin L,
    2. Delgutte B
    (2005) Pitch of complex tones: rate-place and interspike interval representations in the auditory nerve. J Neurophysiol 94:347–362.
    OpenUrlAbstract/FREE Full Text
  6. ↵
    1. Cedolin L,
    2. Delgutte B
    (2010) Spatiotemporal representation of the pitch of harmonic complex tones in the auditory nerve. J Neurosci 30:12712–12724.
    OpenUrlAbstract/FREE Full Text
  7. ↵
    1. Kadia SC,
    2. Wang X
    (2003) Spectral integration in A1 of awake primates: neurons with single- and multipeaked tuning characteristics. J Neurophysiol 89:1603–1622.
    OpenUrlAbstract/FREE Full Text
  8. ↵
    1. Meddis R,
    2. O'Mard L
    (1997) A unitary model of pitch perception. J Acoust Soc Am 102:1811–1820.
    OpenUrlCrossRefPubMed
  9. ↵
    1. Shamma SA
    (1985) Speech processing in the auditory system II: lateral inhibition and the central processing of speech evoked activity in the auditory nerve. J Acoust Soc Am 78:1622–1632.
    OpenUrlCrossRefPubMed
  10. ↵
    1. Walker KM,
    2. Schnupp JW,
    3. Hart-Schnupp SM,
    4. King AJ,
    5. Bizley JK
    (2009) Pitch discrimination by ferrets for simple and complex sounds. J Acoust Soc Am 126:1321–1335.
    OpenUrlCrossRefPubMed
Back to top

In this issue

The Journal of Neuroscience: 31 (9)
Journal of Neuroscience
Vol. 31, Issue 9
2 Mar 2011
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover
  • Index by author
  • Advertising (PDF)
  • Ed Board (PDF)
Email

Thank you for sharing this Journal of Neuroscience article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Understanding How Neural Circuits Measure Pitch
(Your Name) has forwarded a page to you from Journal of Neuroscience
(Your Name) thought you would be interested in this article in Journal of Neuroscience.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Print
View Full Page PDF
Citation Tools
Understanding How Neural Circuits Measure Pitch
Daniel Bendor
Journal of Neuroscience 2 March 2011, 31 (9) 3141-3142; DOI: 10.1523/JNEUROSCI.6077-10.2011

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Respond to this article
Request Permissions
Share
Understanding How Neural Circuits Measure Pitch
Daniel Bendor
Journal of Neuroscience 2 March 2011, 31 (9) 3141-3142; DOI: 10.1523/JNEUROSCI.6077-10.2011
del.icio.us logo Digg logo Reddit logo Twitter logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Footnotes
    • References
  • Info & Metrics
  • eLetters
  • PDF

Responses to this article

Respond to this article

Jump to comment:

No eLetters have been published for this article.

Related Articles

Cited By...

More in this TOC Section

  • Parabrachial Projections to PAG-RVM Axis May Promote Placebo Hypoalgesia and Nocebo Hyperalgesia
  • Differential Dopamine Dynamics in Adolescents and Adults
  • Complementing Neuroregeneration: Deciphering the Role of Neuro-Immune Interactions in CNS Repair
Show more Journal Club
  • Home
  • Alerts
  • Visit Society for Neuroscience on Facebook
  • Follow Society for Neuroscience on Twitter
  • Follow Society for Neuroscience on LinkedIn
  • Visit Society for Neuroscience on Youtube
  • Follow our RSS feeds

Content

  • Early Release
  • Current Issue
  • Issue Archive
  • Collections

Information

  • For Authors
  • For Advertisers
  • For the Media
  • For Subscribers

About

  • About the Journal
  • Editorial Board
  • Privacy Policy
  • Contact
(JNeurosci logo)
(SfN logo)

Copyright © 2022 by the Society for Neuroscience.
JNeurosci Online ISSN: 1529-2401

The ideas and opinions expressed in JNeurosci do not necessarily reflect those of SfN or the JNeurosci Editorial Board. Publication of an advertisement or other product mention in JNeurosci should not be construed as an endorsement of the manufacturer’s claims. SfN does not assume any responsibility for any injury and/or damage to persons or property arising from or related to any use of any material contained in JNeurosci.