He, Qi and Huang, Ting and Zhang, Hongbo (2013) Speaker Recognition System Based on the Baseband Correlation Score Reliability Fusion. Communications and Network, 05 (03). pp. 596-600. ISSN 1949-2421
CN_2013100914330617.pdf - Published Version
Download (187kB)
Abstract
Emotion mismatch between training and testing will cause system performance decline sharply which is emotional speaker recognition. It is an important idea to solve this problem according to the emotion normalization of test speech. This method proceeds from analysis of the differences between every kind of emotional speech and neutral speech. Besides, it takes the baseband mismatch of emotional changes as the main line. At the same time, it gives the corresponding algorithm according to four technical points which are emotional expansion, emotional shield, emotional normalization and score compensation. Compared with the traditional GMM-UBM method, the recognition rate in MASC corpus and EPST corpus was increased by 3.80% and 8.81% respectively.
Item Type: | Article |
---|---|
Subjects: | Euro Archives > Computer Science |
Depositing User: | Managing Editor |
Date Deposited: | 04 Apr 2023 04:15 |
Last Modified: | 01 Jan 2024 12:26 |
URI: | http://publish7promo.com/id/eprint/841 |