The use of broad phonetic class models in speaker recognition

View all publications


Reference

Title: The use of broad phonetic class models in speaker recognition

Author(s): Johan Koolwaaij & Johan de Veth

Reference: Proceedings of the International Conference on Spoken Language Processing and Student Day (ICSLP'98), Vol. 7, pp. 3357-3362

Keywords: Speaker Recognition

There is a PostScript version (56798 bytes) available.

There is a PDF version (44082 bytes) available.

Abstract

In this paper we investigate the use of broad phonetic class (BPC) models in a text independent speaker recognition task. These models can be used to bring down the variability due to the intrinsic differences between mutual phonetic classes in the speech material used for training of the speaker models. Combining BPC recognition with text independent speaker recognition moves a bit in the direction of text dependent speaker recognition: a task which is known to reach better performance.

The performance of BPC modelling is compared to our baseline system using ergodic 5-state HMMs.
The question which BPC contains most speaker specific information is addressed. Also, it is investigated if and how the BPC alignment is correlated with the state alignment from the baseline system to check the assumption that states of an ergodic HMM can model broad phonetic classes. Error processing SSI file