Automatic Speech Assessment For Aphasic Patients Based On Syllable-Level Embedding And Supra-Segmental Duration Features
Keywords
Aphasia; ASR; Cantonese; Speech assessment; Word embedding
Abstract
Aphasia is a type of acquired language impairment resulting from brain injury. Speech assessment is an important part of the comprehensive assessment process for aphasic patients. It is based on the acoustical and linguistic analysis of patients' speech elicited through pre-defined story-telling tasks. This type of narrative spontaneous speech embodies multi-fold atypical characteristics related to the underlying language impairment. This paper presents an investigation on automatic speech assessment for Cantonese-speaking aphasic patients using an automatic speech recognition (ASR) system. A novel approach to extracting robust text features from erroneous ASR output is developed based on word embedding methods. The text features can effectively distinguish the stories told by an impaired speaker from those by unimpaired ones. On the other hand, a set of supra-segmental duration features are derived from syllable-level time alignments produced by the ASR system, to characterize the atypical prosody of impaired speech. The proposed text features, duration features and their combination are evaluated in a binary classification experiment as well as in automatic prediction of subjective assessment score. The results clearly show that the text features are very effective in the intended task of aphasia assessment, while using duration features could provide additional benefit.
Publication Date
9-10-2018
Publication Title
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume
2018-April
Number of Pages
5994-5998
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/ICASSP.2018.8461289
Copyright Status
Unknown
Socpus ID
85054218163 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/85054218163
STARS Citation
Qin, Ying; Lee, Tan; and Hin Kong, Anthony Pak, "Automatic Speech Assessment For Aphasic Patients Based On Syllable-Level Embedding And Supra-Segmental Duration Features" (2018). Scopus Export 2015-2019. 10147.
https://stars.library.ucf.edu/scopus2015/10147