Speaker Identification Employing Waveform Based Speech Codec
A novel approach for Automatic Speaker Identification (ASI) employing Waveform based signal representation in multiple domains is presented. The proposed approach involves two stages, namely, the encoding stage, and the decoding stage. During the encoding stage (training mode), mixed transform coding, in conjunction with split vector Quantization (MTSVQ) is employed to form representative codebooks for each speaker. During the decoding stage (running mode), the vectors that best represent the unknown input vector are selected to represent the speech vectors. A normalised matching accuracy measure is developed to evaluate the proposed algorithm's performance. The resulting technique is consistently found to obtain enhanced ASI accuracy in comparison with the earlier approaches as vector quantization employing single transform domains.
Midwest Symposium on Circuits and Systems
Number of Pages
Source API URL
Mikhael, Wasfy B. and Premakanthan, Pravinkumar, "Speaker Identification Employing Waveform Based Speech Codec" (2002). Scopus Export 2000s. 2779.