Have a personal or library account? Click to login
High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition Cover

High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition

By: Jiang Zhipeng and  Huang Chengwei  
Open Access
|Nov 2015

Abstract

In this paper we study the cross-language speech emotion recognition using high-order Markov random fields, especially the application in Vietnamese speech emotion recognition. First, we extract the basic speech features including pitch frequency, formant frequency and short-term intensity. Based on the low level descriptor we further construct the statistic features including maximum, minimum, mean and standard deviation. Second, we adopt the high-order Markov random fields (MRF) to optimize the cross-language speech emotion model. The dimensional restrictions may be modeled by MRF. Third, based on the Vietnamese and Chinese database we analyze the efficiency of our emotion recognition system. We adopt the dimensional emotion model (arousal-valence) to verify the efficiency of MRF configuration method. The experimental results show that the high-order Markov random fields can improve the dimensional emotion recognition in the cross-language experiments, and the configuration method shows promising robustness over different languages.

DOI: https://doi.org/10.1515/cait-2015-0054 | Journal eISSN: 1314-4081 | Journal ISSN: 1311-9702
Language: English
Page range: 50 - 57
Published on: Nov 26, 2015
Published by: Bulgarian Academy of Sciences, Institute of Information and Communication Technologies
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2015 Jiang Zhipeng, Huang Chengwei, published by Bulgarian Academy of Sciences, Institute of Information and Communication Technologies
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.