Contributions of cochlea-scaled entropy and consonant-vowel boundaries to prediction of speech intelligibility in noise

J Acoust Soc Am. 2012 May;131(5):4104-13. doi: 10.1121/1.3695401.

Abstract

Recent evidence suggests that spectral change, as measured by cochlea-scaled entropy (CSE), predicts speech intelligibility better than the information carried by vowels or consonants in sentences. Motivated by this finding, the present study investigates whether intelligibility indices implemented to include segments marked with significant spectral change better predict speech intelligibility in noise than measures that include all phonetic segments paying no attention to vowels/consonants or spectral change. The prediction of two intelligibility measures [normalized covariance measure (NCM), coherence-based speech intelligibility index (CSII)] is investigated using three sentence-segmentation methods: relative root-mean-square (RMS) levels, CSE, and traditional phonetic segmentation of obstruents and sonorants. While the CSE method makes no distinction between spectral changes occurring within vowels/consonants, the RMS-level segmentation method places more emphasis on the vowel-consonant boundaries wherein the spectral change is often most prominent, and perhaps most robust, in the presence of noise. Higher correlation with intelligibility scores was obtained when including sentence segments containing a large number of consonant-vowel boundaries than when including segments with highest entropy or segments based on obstruent/sonorant classification. These data suggest that in the context of intelligibility measures the type of spectral change captured by the measure is important.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Cochlea / physiology*
  • Female
  • Humans
  • Male
  • Noise*
  • Perceptual Masking / physiology
  • Phonetics*
  • Signal-To-Noise Ratio
  • Sound Spectrography
  • Speech Intelligibility / physiology*