Combining Novel Acoustic Features using SVM to Detect Speaker Changing Points

Zhong, Haishan; Cho, David; Pervouchine, Vladimir; Leedham, Graham

Author(s)

Zhong, Haishan

Cho, David

Pervouchine, Vladimir

Leedham, Graham

Publication Date

2008

Abstract

Automatic speaker change point detection segments different speakers from continuous speech according to speaker characteristics. This is often a necessary step before applying speaker verification or identification systems. Among the features to represent a speaker in the speaker change point detection systems acoustic features are commonly used. Commonly used features are Mel Frequency Cepstral Coefficients (MFCC) and Linear Prediction Cepstral Coefficients (LPCC). However, the features are affected by speech content, environment, type of recording device, etc. So far, no features have been discovered, which values depend only on the speaker. In this paper four novel feature types proposed in recent major journals and conference papers for speaker verification problem, are applied to the problem of speaker change point detection. The features are also used to form a combination scheme via SVM classifier. The results shows that the proposed scheme improves the performance of speaker changing point detection as compared to the system that uses MFCC features. It was also found that some of the novel features of low dimensionality give comparable speaker change point detection accuracy to the high-dimensional MFCC features.

Citation

Proceedings of the First International Conference on Biomedical Electronics and Devices (BIOSIGNALS 2008), v.1, p. 224-227

ISBN

9789898111180

Link

https://hdl.handle.net/1959.11/15314

Language

en

Publisher

Institute for Systems and Technologies of Information, Control and Communication (INSTICC)

Title

Combining Novel Acoustic Features using SVM to Detect Speaker Changing Points

Type of document

Conference Publication

Entity Type

Publication

Author(s)	Zhong, Haishan Cho, David Pervouchine, Vladimir Leedham, Graham
Publication Date	2008
Abstract	Automatic speaker change point detection segments different speakers from continuous speech according to speaker characteristics. This is often a necessary step before applying speaker verification or identification systems. Among the features to represent a speaker in the speaker change point detection systems acoustic features are commonly used. Commonly used features are Mel Frequency Cepstral Coefficients (MFCC) and Linear Prediction Cepstral Coefficients (LPCC). However, the features are affected by speech content, environment, type of recording device, etc. So far, no features have been discovered, which values depend only on the speaker. In this paper four novel feature types proposed in recent major journals and conference papers for speaker verification problem, are applied to the problem of speaker change point detection. The features are also used to form a combination scheme via SVM classifier. The results shows that the proposed scheme improves the performance of speaker changing point detection as compared to the system that uses MFCC features. It was also found that some of the novel features of low dimensionality give comparable speaker change point detection accuracy to the high-dimensional MFCC features.
Citation	Proceedings of the First International Conference on Biomedical Electronics and Devices (BIOSIGNALS 2008), v.1, p. 224-227
ISBN	9789898111180
Link	https://hdl.handle.net/1959.11/15314
Language	en
Publisher	Institute for Systems and Technologies of Information, Control and Communication (INSTICC)
Title	Combining Novel Acoustic Features using SVM to Detect Speaker Changing Points
Type of document	Conference Publication
Entity Type	Publication

Combining Novel Acoustic Features using SVM to Detect Speaker Changing Points

Files: