Title

Multidimensional Model Based Speech Signal Representations For Automatic Speaker Identification

Keywords

Linear Prediction; Speaker Identification; Vector Quantization

Abstract

A novel Model based Automatic Speaker Identification (M-ASI) technique employing multidimensional representations of the Linear Prediction (LP) coefficients, and the LP residual is proposed. During the training mode, the LP coefficients, and the LP residuals extracted from the speech signal are projected into multiple domains, and vector quantized codebooks are obtained using energy based split vector quantization. During the running mode, a closest match is found by comparing the speech vectors of the unknown speaker, and the reconstructed speech employing each of the known codebooks stored in the database. Employing a normalized matching accuracy measure, the proposed technique is consistently found to obtain enhanced ASI accuracy in comparison with Vector Quantization (VQ) employing existing single dimensional LP based ASI approaches at the expense of a modest increase in computational complexity. 100% speaker identification accuracy is obtained with a low signal-coding rate of less than 2.91 bps.

Publication Date

12-1-2004

Publication Title

Proceedings of the IASTED International Conference on Circuits, Signals, and Systems

Number of Pages

44-47

Document Type

Article; Proceedings Paper

Personal Identifier

scopus

Socpus ID

11144350546 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/11144350546

This document is currently not available here.

Share

COinS