Cantonese AphasiaBank: An annotated database of spoken discourse and co-verbal gestures by healthy and language-impaired native Cantonese speakers

Behav Res Methods. 2019 Jun;51(3):1131-1144. doi: 10.3758/s13428-018-1043-6.

Abstract

This article reports the construction of a multimodal annotated database of spoken discourse and co-verbal gestures by native healthy speakers of Cantonese and individuals with language impairment: the Cantonese AphasiaBank. This corpus was established as a foundation for aphasiologists and clinicians to use in designing and conducting research investigations into theoretical and clinical issues related to acquired language disorders in Chinese. Details in terms of the purpose, structure, and levels of annotation of the database (containing part-of-speech-annotated orthographic transcripts with Romanization and the corresponding videos) are described. The discussion presents the challenges of building a spoken database of a language that is not linguistically well-researched and that does not have a standardized written form for many of its lexical items, as well as presenting how these issues were addressed. Most importantly, the article highlights the potential of Cantonese AphasiaBank as a powerful research tool for linguists and psycholinguists.

Keywords: Aphasia; Cantonese; Chinese; Discourse; Gestures; Language database; Stroke.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adolescent
  • Adult
  • Data Curation
  • Databases, Factual
  • Female
  • Gestures*
  • Humans
  • Language
  • Linguistics
  • Male
  • Middle Aged
  • Psycholinguistics
  • Speech*
  • Young Adult