Hum a Fingerprint, Extract a Melody — Dogac Basaran, CNRS — Voice Tech Podcast ep.009
Episode description:
This is the second part of my conversation with Dogac Basaran, a post-doctoral researcher at CNRS, the French national scientific research centre. If you missed the first part, you might want to go back and listen to the previous episode on Signal Processing Basics for Audio.
Today, in part 2 of 2, we explore Dogac’s research into audio fingerprinting, alignment, and melody extraction. By analysing the magnitude of frequency peaks and their relative spacing, Dogac shows us how it’s possible to create audio fingerprints that can be used to detect and match audio recordings, even if they contain noise or are incomplete. These fingerprints have a variety of uses, including aligning multiple recordings of a single speaker/performance, and identifying a particular recording.
We also discuss query by humming, the state-of-the-art technique that takes an audio fingerprint of a person humming a melody, and matches it to a database of music recordings. Dogac also explains why learning how to build neural networks has become an essential skill in this field.
Links from the show:
Full show notes : http://bit.ly/voicetechpodcast
Dogac Basaran on Github: https://github.com/dogacbasaran
Dogac Basaran’s websites: https://dbasaran.wp.imt.fr/ and http://dogacbasaran.com/
Signal Processing MOOC on Coursera: https://www.coursera.org/learn/dsp
MATLAB: https://matlab.mathworks.com/
Python Scipy STFT package: https://docs.scipy.org/doc/scipy/reference/generated/scipy.signal.stft.html
Humming databases: Jang, ThinkIT, IOSCAS and Task2
ISMIR association: http://www.ismir.net/
ISMIR 2018 conference: http://ismir2018.ircam.fr/
CNRS: http://www.cnrs.fr/
IRCAM: https://www.ircam.fr/
MIREX 2018: http://www.music-ir.org/mirex/wiki/2018:Main_Page
ACR Cloud Console: http://console.acrcloud.com
SoundHound’s Houndify service: https://www.soundhound.com/houndify
MusixMatch: https://www.musixmatch.com
Query by Humming article 1: https://www.acrcloud.com/blog/what-is-query-by-humming
Query by Humming article 2: https://en.wikipedia.org/wiki/Query_by_humming
Subscribe to get future episodes:
Apple iTunes : https://apple.co/2LqW4ol
Google Podcasts : http://bit.ly/voicetechpodcast-google
Google Android : http://bit.ly/voicetechpodcast-android
Stitcher : http://bit.ly/voicetechpodcast-stitcher
Spotify : https://spoti.fi/2IZr5hm
Alexa : https://amzn.to/2mr8mCj
Website : http://bit.ly/voicetechpodcast
Join the discussion:
Newsletter : http://bit.ly/voicetechpodcast-newsletter
Reddit : http://bit.ly/voicetechpodcast-reddit
Facebook group : http://bit.ly/voicetechpodcast-facebook-group
Facebook page : http://bit.ly/voicetechpodcast-facebook-page
Follow on Twitter : http://bit.ly/voicetechpodcast-twitter
Email me : carl@voicetechpodcast.com
Support the Voice Tech Podcast:
Tell a friend about us or share on social media!
Leave a 5 star review on iTunes: https://apple.co/2LqW4ol
Leave a 5 star review on Stitcher: http://bit.ly/voicetechpodcast-stitcher
Become a patron at: http://bit.ly/voicetechpodcast-patreon