Title
Speaker Identification Employing Waveform Based Speech Codec
Abstract
A novel approach for Automatic Speaker Identification (ASI) employing Waveform based signal representation in multiple domains is presented. The proposed approach involves two stages, namely, the encoding stage, and the decoding stage. During the encoding stage (training mode), mixed transform coding, in conjunction with split vector Quantization (MTSVQ) is employed to form representative codebooks for each speaker. During the decoding stage (running mode), the vectors that best represent the unknown input vector are selected to represent the speech vectors. A normalised matching accuracy measure is developed to evaluate the proposed algorithm's performance. The resulting technique is consistently found to obtain enhanced ASI accuracy in comparison with the earlier approaches as vector quantization employing single transform domains.
Publication Date
1-1-2002
Publication Title
Midwest Symposium on Circuits and Systems
Volume
3
Number of Pages
340-343
Document Type
Article
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/MWSCAS.2002.1187042
Copyright Status
Unknown
Socpus ID
0036976768 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/0036976768
STARS Citation
Mikhael, Wasfy B. and Premakanthan, Pravinkumar, "Speaker Identification Employing Waveform Based Speech Codec" (2002). Scopus Export 2000s. 2779.
https://stars.library.ucf.edu/scopus2000/2779