On the balance of envelope and temporal fine structure in the encoding of speech in the early auditory system

J Acoust Soc Am. 2013 May;133(5):2818-33. doi: 10.1121/1.4795783.

Abstract

There is much debate on how the spectrotemporal modulations of speech (or its spectrogram) are encoded in the responses of the auditory nerve, and whether speech intelligibility is best conveyed via the "envelope" (E) or "temporal fine-structure" (TFS) of the neural responses. Wide use of vocoders to resolve this question has commonly assumed that manipulating the amplitude-modulation and frequency-modulation components of the vocoded signal alters the relative importance of E or TFS encoding on the nerve, thus facilitating assessment of their relative importance to intelligibility. Here we argue that this assumption is incorrect, and that the vocoder approach is ineffective in differentially altering the neural E and TFS. In fact, we demonstrate using a simplified model of early auditory processing that both neural E and TFS encode the speech spectrogram with constant and comparable relative effectiveness regardless of the vocoder manipulations. However, we also show that neural TFS cues are less vulnerable than their E counterparts under severe noisy conditions, and hence should play a more prominent role in cochlear stimulation strategies.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acoustic Stimulation
  • Cochlear Implantation
  • Cochlear Nerve / physiopathology*
  • Correction of Hearing Impairment / methods
  • Cues*
  • Hearing Disorders / physiopathology*
  • Hearing Disorders / psychology
  • Hearing Disorders / therapy
  • Humans
  • Models, Neurological
  • Noise / adverse effects
  • Pattern Recognition, Automated
  • Perceptual Masking
  • Persons With Hearing Impairments / psychology
  • Persons With Hearing Impairments / rehabilitation
  • Signal Processing, Computer-Assisted
  • Sound Spectrography
  • Speech Acoustics*
  • Speech Intelligibility*
  • Speech Perception*
  • Time Factors
  • Time Perception*