Please use this identifier to cite or link to this item:
https://hdl.handle.net/1959.11/15313
Title: | Comparative Study of Several Novel Acoustic Features for Speaker Recognition | Contributor(s): | Pervouchine, Vladimir (author); Leedham, Graham (author); Zhong, Haishan (author); Cho, David (author); Li, Haizhou (author) | Publication Date: | 2008 | Handle Link: | https://hdl.handle.net/1959.11/15313 | Abstract: | Finding good features that represent speaker identity is an important problem in speaker recognition area. Recently a number of new and novel acoustic features have been proposed for speaker recognition. The researchers use different data sets and sometimes different classifiers to evaluate the features and compare them to the baselines such as MFCC or LPCC. However, due to different experimental conditions direct comparison of those features to each other is difficult or impossible. This paper presents a study of five new acoustic features recently proposed. The feature extraction has been performed on the same data (NIST~2001~SRE), and the same UBM-GMM classifier has been used. The results are presented as DET curves with equal error ratios indicated. Also, an SVM-based combination of GMM scores produced on different features has been made in hope that classifier fusion can result in higher speaker recognition accuracy. The results for different features as well as for their combinations are directly comparable to each other and to those obtained with the baseline MFCC features. | Publication Type: | Conference Publication | Conference Details: | BIOSIGNALS 2008: International Conference on Bio-inspired Systems and Signal Processing, Funchal, Portugal, 28th - 31st January, 2008 | Source of Publication: | Proceedings of the First International Conference on Biomedical Electronics and Devices (BIOSIGNALS 2008), v.1, p. 220-223 | Publisher: | Institute for Systems and Technologies of Information, Control and Communication (INSTICC) | Place of Publication: | United States of America | Fields of Research (FoR) 2008: | 080106 Image Processing 080104 Computer Vision 080109 Pattern Recognition and Data Mining |
Socio-Economic Objective (SEO) 2008: | 810199 Defence not elsewhere classified 890299 Computer Software and Services not elsewhere classified 810107 National Security |
HERDC Category Description: | E2 Non-Refereed Scholarly Conference Publication | Publisher/associated links: | http://www.biosignals.biostec.org/Abstracts/2008/BIOSIGNALS_2008_Abstracts.htm |
---|---|
Appears in Collections: | Conference Publication |
Files in This Item:
File | Description | Size | Format |
---|
Page view(s)
1,172
checked on Apr 7, 2024
Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.