Comparative Study of Several Novel Acoustic Features for Speaker Recognition

Title
Comparative Study of Several Novel Acoustic Features for Speaker Recognition
Publication Date
2008
Author(s)
Pervouchine, Vladimir
Leedham, Graham
Zhong, Haishan
Cho, David
Li, Haizhou
Editor
Editor(s): Pedro Encarnacao, Antonio Veloso
Type of document
Conference Publication
Language
en
Entity Type
Publication
Publisher
Institute for Systems and Technologies of Information, Control and Communication (INSTICC)
Place of publication
United States of America
UNE publication id
une:15529
Abstract
Finding good features that represent speaker identity is an important problem in speaker recognition area. Recently a number of new and novel acoustic features have been proposed for speaker recognition. The researchers use different data sets and sometimes different classifiers to evaluate the features and compare them to the baselines such as MFCC or LPCC. However, due to different experimental conditions direct comparison of those features to each other is difficult or impossible. This paper presents a study of five new acoustic features recently proposed. The feature extraction has been performed on the same data (NIST~2001~SRE), and the same UBM-GMM classifier has been used. The results are presented as DET curves with equal error ratios indicated. Also, an SVM-based combination of GMM scores produced on different features has been made in hope that classifier fusion can result in higher speaker recognition accuracy. The results for different features as well as for their combinations are directly comparable to each other and to those obtained with the baseline MFCC features.
Link
Citation
Proceedings of the First International Conference on Biomedical Electronics and Devices (BIOSIGNALS 2008), v.1, p. 220-223
ISBN
9789898111180
Start page
220
End page
223

Files:

NameSizeformatDescriptionLink