An Ensemble Framework of Voice-based Emotion Recognition

Alibaba Tech
Apr 11, 2018 · 2 min read

This article is part of the Academic Alibaba series and is taken from the paper entitled “An Ensemble Framework of Voice-Based Emotion Recognition System for Films and TV Programs” by Fei Tao, Gang Liu, and Qingen Zhao, accepted by IEEE ICASSP 2018. The full paper can be read here.

The importance of emotion recognition is gaining more and more traction with improving user experience and the engagement of human-computer interfaces (HCI). Developing emotion recognition systems that are based on speech, as opposed to facial expressions, has practical application benefits due to low hardware requirements. However, these benefits are somewhat negated by real-world background noise impairing speech-based emotion recognition performance when the system is employed in practical applications.

To overcome these issues, researchers from the Alibaba tech team and The University of Texas at Dallas have developed an ensemble framework of speech-based emotion recognition that captures characteristics from audio from several aspects, including low-level utterance descriptors, high-level utterance representations, sequential acoustic frame features, and lexical information. By thoroughly capturing acoustic information in this way, the ensemble framework effectively overcomes background noise issues. A breakdown of the ensemble framework is illustrated below.

In order to evaluate this framework, the research team used movies and TV shows that have real-world sound profiles. The proposed ensemble framework outperformed state-of-the-art baselines that utilize deep learning. This achievement facilitates the employment of the emotion recognition system to practical applications.

Read the full paper here.


Alibaba Tech

First-hand and in-depth information about Alibaba’s latest technology → Search “Alibaba Tech” on Facebook

Alibaba Tech

Written by

First-hand & in-depth information about Alibaba's tech innovation in Artificial Intelligence, Big Data & Computer Engineering. Follow us on Facebook!

Welcome to a place where words matter. On Medium, smart voices and original ideas take center stage - with no ads in sight. Watch
Follow all the topics you care about, and we’ll deliver the best stories for you to your homepage and inbox. Explore
Get unlimited access to the best stories on Medium — and support writers while you’re at it. Just $5/month. Upgrade