Title
Multidimensional Model Based Speech Signal Representations For Automatic Speaker Identification
Keywords
Linear Prediction; Speaker Identification; Vector Quantization
Abstract
A novel Model based Automatic Speaker Identification (M-ASI) technique employing multidimensional representations of the Linear Prediction (LP) coefficients, and the LP residual is proposed. During the training mode, the LP coefficients, and the LP residuals extracted from the speech signal are projected into multiple domains, and vector quantized codebooks are obtained using energy based split vector quantization. During the running mode, a closest match is found by comparing the speech vectors of the unknown speaker, and the reconstructed speech employing each of the known codebooks stored in the database. Employing a normalized matching accuracy measure, the proposed technique is consistently found to obtain enhanced ASI accuracy in comparison with Vector Quantization (VQ) employing existing single dimensional LP based ASI approaches at the expense of a modest increase in computational complexity. 100% speaker identification accuracy is obtained with a low signal-coding rate of less than 2.91 bps.
Publication Date
12-1-2004
Publication Title
Proceedings of the IASTED International Conference on Circuits, Signals, and Systems
Number of Pages
44-47
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
Copyright Status
Unknown
Socpus ID
11144350546 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/11144350546
STARS Citation
Premakanthan, Pravinkumar and Mikhael, Wasfy B., "Multidimensional Model Based Speech Signal Representations For Automatic Speaker Identification" (2004). Scopus Export 2000s. 4952.
https://stars.library.ucf.edu/scopus2000/4952