An Improved Speaker Identification Technique Employing Multiple Representations Of The Linear Prediction Coefficients


A novel Linear Prediction (LPC) based Automatic Speaker Identification (ASI) technique employing multiple representations of the LPC is presented. The proposed ASI system has two modes namely, the encoding mode, and the Speaker Identification (SI) mode. During the encoding mode, otherwise known as the training mode, the Linear Prediction Coefficients (LPC) are extracted for each speaker as speech features. Multiple Representation Split Vector Quantization (MRSVQ) [1] is employed to form representative codebooks corresponding to each representation, for each speaker. During SI (running) mode, the ASI system identifies the codebooks of the speaker in the database that best matches the LPC extracted from the speech signal of the unknown speaker. The synthesized all pole vocal tract transfer function is used as a measure of vocal tract for ASI. Employing the normalized vocal tract transfer function error measure, the proposed technique is consistently found to obtain enhanced ASI accuracy in comparison with vector quantization employing existing LPC representation, at the expense of a modest increase in computational complexity. The ASI technique presented here can be used in a stand-alone system or as part of an ASI environment.

Publication Date


Publication Title

Proceedings - IEEE International Symposium on Circuits and Systems



Document Type

Article; Proceedings Paper

Personal Identifier


Socpus ID

0037745901 (Scopus)

Source API URL


This document is currently not available here.