Long-term average spectral characteristics of Cantonese alaryngeal speech

Auris Nasus Larynx. 2009 Oct;36(5):571-7. doi: 10.1016/j.anl.2008.12.005. Epub 2009 Mar 3.

Abstract

Objective: In Hong Kong, esophageal (SE), tracheoesophageal (TE), electrolaryngeal (EL), and pneumatic artificial laryngeal (PA) speech are commonly used by laryngectomees as a means to regain verbal communication after total laryngectomy. While SE and TE speech has been studied to some extent, little is known regarding the EL and PA sound quality. The present study examined the sound quality associated with SE, TE, EL, and PA speech, and compared with that associated with laryngeal (NL) speech by using long-term average speech spectra (LTAS).

Methods: Continuous speech samples of reading a 136-word passage were obtained from NL, SE, TE, EL, and PA speakers of Cantonese. The alaryngeal speakers were all superior speakers selected from the New Voice Club of Hong Kong, which is a self-help organization for the laryngectomees in Hong Kong. TE speakers were fitted with Provox valve, and EL speakers used Servox-type electrolarynx. Speech samples were digitized at 20kHz and 16bits/sample by using Praat, based on which LTAS contours were developed. First spectral peak (FSP), mean spectral energy (MSE), and spectral tilt (ST) derived from the LTAS contours associated with different speaker groups were compared.

Results: Data revealed all speakers generally exhibited similar LTA contours. However, PA speakers exhibited the lowest average FSP value and the greatest average MSE value. NL phonation was associated with a significantly greater ST value than alaryngeal speech of Cantonese.

Conclusion: The differences in FSP, MSE, and ST values in different speaker groups may be related to the different sound sources being used by the laryngectomees, and the difference in the way the sound source is coupled with the vocal tract system.

Publication types

  • Comparative Study

MeSH terms

  • Aged
  • Aged, 80 and over
  • Analysis of Variance
  • Asian People*
  • Humans
  • Laryngectomy
  • Larynx, Artificial
  • Male
  • Middle Aged
  • Phonation
  • Phonetics*
  • Prosthesis Design
  • Sound Spectrography
  • Speech Acoustics
  • Speech, Alaryngeal* / methods
  • Speech, Esophageal
  • Time Factors
  • Voice Quality