Title
An Improved Speaker Identification Technique Employing Multiple Representations Of The Linear Prediction Coefficients
Abstract
A novel Linear Prediction (LPC) based Automatic Speaker Identification (ASI) technique employing multiple representations of the LPC is presented. The proposed ASI system has two modes namely, the encoding mode, and the Speaker Identification (SI) mode. During the encoding mode, otherwise known as the training mode, the Linear Prediction Coefficients (LPC) are extracted for each speaker as speech features. Multiple Representation Split Vector Quantization (MRSVQ) [1] is employed to form representative codebooks corresponding to each representation, for each speaker. During SI (running) mode, the ASI system identifies the codebooks of the speaker in the database that best matches the LPC extracted from the speech signal of the unknown speaker. The synthesized all pole vocal tract transfer function is used as a measure of vocal tract for ASI. Employing the normalized vocal tract transfer function error measure, the proposed technique is consistently found to obtain enhanced ASI accuracy in comparison with vector quantization employing existing LPC representation, at the expense of a modest increase in computational complexity. The ASI technique presented here can be used in a stand-alone system or as part of an ASI environment.
Publication Date
7-14-2003
Publication Title
Proceedings - IEEE International Symposium on Circuits and Systems
Volume
2
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
Copyright Status
Unknown
Socpus ID
0037745901 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/0037745901
STARS Citation
Mikhael, W. B. and Premakanthan, Pravinkumar, "An Improved Speaker Identification Technique Employing Multiple Representations Of The Linear Prediction Coefficients" (2003). Scopus Export 2000s. 1686.
https://stars.library.ucf.edu/scopus2000/1686