Long-term average spectral characteristics of Cantonese alaryngeal speech

Manwa L Ng; Hanjun Liu; Qin Zhao; Paul K Y Lam

doi:10.1016/j.anl.2008.12.005

Long-term average spectral characteristics of Cantonese alaryngeal speech

Auris Nasus Larynx. 2009 Oct;36(5):571-7. doi: 10.1016/j.anl.2008.12.005. Epub 2009 Mar 3.

Authors

Manwa L Ng¹, Hanjun Liu, Qin Zhao, Paul K Y Lam

Affiliation

¹ The University of Hong Kong, Prince Philip Dental Hospital, Sai Ying Pun, Hong Kong, SAR, China. manwa@hku.hk

PMID: 19261410
DOI: 10.1016/j.anl.2008.12.005

Abstract

Objective: In Hong Kong, esophageal (SE), tracheoesophageal (TE), electrolaryngeal (EL), and pneumatic artificial laryngeal (PA) speech are commonly used by laryngectomees as a means to regain verbal communication after total laryngectomy. While SE and TE speech has been studied to some extent, little is known regarding the EL and PA sound quality. The present study examined the sound quality associated with SE, TE, EL, and PA speech, and compared with that associated with laryngeal (NL) speech by using long-term average speech spectra (LTAS).

Methods: Continuous speech samples of reading a 136-word passage were obtained from NL, SE, TE, EL, and PA speakers of Cantonese. The alaryngeal speakers were all superior speakers selected from the New Voice Club of Hong Kong, which is a self-help organization for the laryngectomees in Hong Kong. TE speakers were fitted with Provox valve, and EL speakers used Servox-type electrolarynx. Speech samples were digitized at 20kHz and 16bits/sample by using Praat, based on which LTAS contours were developed. First spectral peak (FSP), mean spectral energy (MSE), and spectral tilt (ST) derived from the LTAS contours associated with different speaker groups were compared.

Results: Data revealed all speakers generally exhibited similar LTA contours. However, PA speakers exhibited the lowest average FSP value and the greatest average MSE value. NL phonation was associated with a significantly greater ST value than alaryngeal speech of Cantonese.

Conclusion: The differences in FSP, MSE, and ST values in different speaker groups may be related to the different sound sources being used by the laryngectomees, and the difference in the way the sound source is coupled with the vocal tract system.

Publication types

Comparative Study

MeSH terms

Aged
Aged, 80 and over
Analysis of Variance
Asian People*
Humans
Laryngectomy
Larynx, Artificial
Male
Middle Aged
Phonation
Phonetics*
Prosthesis Design
Sound Spectrography
Speech Acoustics
Speech, Alaryngeal* / methods
Speech, Esophageal
Time Factors
Voice Quality