Week 6 — Audio Emotion Recognition System (Part IV)

Ece Omurtay
bbm406f19
Published in
2 min readJan 7, 2020

Hello everyone ! This is our last blog about our machine learning project. Last week, we shared the results of methods that we use. This week, in addition to them, we share some results too.

Let’s start !

We applied CNN algorithm to our resized grayscaled amplitude images and average result is 0,35. Following image respresents the weights of the first layer..

Then, we tried another classifier to improve our accuracy. We took the mean of mfcc and “chroma_stft”, “chroma_cqt”, “chroma_cens”, “rms”, “spectral_contrast”, “spectral_bandwidth”, “tonnetz”, “zcr” features and applied Decision Tree algorithm. Accuracy is 0,35.

Then, we used XGBoost as classifier. Result of this algorithm is 0,40.

In conclusion, we worked with many algorithms to improve our model’s success. According to all results, best model is CNN with mfcc feature as input.

You can watch the video presentation from here →

Thank you for reading !

Nur Altıparmak — https://www.linkedin.com/in/nur-alt%C4%B1parmak-5722a9146/

Ece Omurtay — https://www.linkedin.com/in/ece-omurtay/

--

--