Week 6 — Audio Emotion Recognition System (Part IV)

Published in

bbm406f19

2 min readJan 7, 2020

--

Hello everyone ! This is our last blog about our machine learning project. Last week, we shared the results of methods that we use. This week, in addition to them, we share some results too.

Let’s start !

We applied CNN algorithm to our resized grayscaled amplitude images and average result is 0,35. Following image respresents the weights of the first layer..

Then, we tried another classifier to improve our accuracy. We took the mean of mfcc and “chroma_stft”, “chroma_cqt”, “chroma_cens”, “rms”, “spectral_contrast”, “spectral_bandwidth”, “tonnetz”, “zcr” features and applied Decision Tree algorithm. Accuracy is 0,35.

Then, we used XGBoost as classifier. Result of this algorithm is 0,40.

In conclusion, we worked with many algorithms to improve our model’s success. According to all results, best model is CNN with mfcc feature as input.

You can watch the video presentation from here →

Thank you for reading !

Nur Altıparmak — https://www.linkedin.com/in/nur-alt%C4%B1parmak-5722a9146/

Ece Omurtay — https://www.linkedin.com/in/ece-omurtay/

Previous posts:

Week 1 — https://medium.com/bbm406f19/week-1-introduction-557d0143e753

Week 2 — https://medium.com/bbm406f19/week-2-data-analysis-687ec86c0a71

Week 3 — https://medium.com/bbm406f19/week-3-audio-emotion-recognition-system-1e37a4991a48

Week 4 — https://medium.com/bbm406f19/week-4-audio-emotion-recognition-system-part-ii-ba8f24704db5

Week 5 — https://medium.com/bbm406f19/week-5-audio-emotion-recognition-system-part-iii-a4f6ee87f458

Machine Learning

Emotion Recognition

Ece Omurtay

Written by Ece Omurtay

Writer for

bbm406f19

CS Student at Hacettepe University

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams