Hum a Fingerprint, Extract a Melody — Dogac Basaran, CNRS — Voice Tech Podcast ep.009

Carl Robinson
Voice Tech Podcast
Published in
2 min readSep 2, 2018
Hum a Fingerprint, Extract a Melody - Dogac Basaran, CNRS - Voice Tech Podcast ep.009

Full episode:
https://voicetechpodcast.com/episodes/hum-a-fingerprint-extract-a-melody-dogac-basaran-cnrs-voice-tech-podcast-ep-009/

Episode description:
This is the second part of my conversation with Dogac Basaran, a post-doctoral researcher at CNRS, the French national scientific research centre. If you missed the first part, you might want to go back and listen to the previous episode on Signal Processing Basics for Audio.

Today, in part 2 of 2, we explore Dogac’s research into audio fingerprinting, alignment, and melody extraction. By analysing the magnitude of frequency peaks and their relative spacing, Dogac shows us how it’s possible to create audio fingerprints that can be used to detect and match audio recordings, even if they contain noise or are incomplete. These fingerprints have a variety of uses, including aligning multiple recordings of a single speaker/performance, and identifying a particular recording.

We also discuss query by humming, the state-of-the-art technique that takes an audio fingerprint of a person humming a melody, and matches it to a database of music recordings. Dogac also explains why learning how to build neural networks has become an essential skill in this field.

Links from the show:
Full show notes : http://bit.ly/voicetechpodcast
Dogac Basaran on Github: https://github.com/dogacbasaran
Dogac Basaran’s websites: https://dbasaran.wp.imt.fr/ and http://dogacbasaran.com/
Signal Processing MOOC on Coursera: https://www.coursera.org/learn/dsp
MATLAB: https://matlab.mathworks.com/
Python Scipy STFT package: https://docs.scipy.org/doc/scipy/reference/generated/scipy.signal.stft.html
Humming databases: Jang, ThinkIT, IOSCAS and Task2
ISMIR association: http://www.ismir.net/
ISMIR 2018 conference: http://ismir2018.ircam.fr/
CNRS: http://www.cnrs.fr/
IRCAM: https://www.ircam.fr/
MIREX 2018: http://www.music-ir.org/mirex/wiki/2018:Main_Page
ACR Cloud Console: http://console.acrcloud.com
SoundHound’s Houndify service: https://www.soundhound.com/houndify
MusixMatch: https://www.musixmatch.com
Query by Humming article 1: https://www.acrcloud.com/blog/what-is-query-by-humming
Query by Humming article 2: https://en.wikipedia.org/wiki/Query_by_humming

Subscribe to get future episodes:
Apple iTunes : https://apple.co/2LqW4ol
Google Podcasts : http://bit.ly/voicetechpodcast-google
Google Android : http://bit.ly/voicetechpodcast-android
Stitcher : http://bit.ly/voicetechpodcast-stitcher
Spotify : https://spoti.fi/2IZr5hm
Alexa : https://amzn.to/2mr8mCj
Website : http://bit.ly/voicetechpodcast

Join the discussion:
Newsletter : http://bit.ly/voicetechpodcast-newsletter
Reddit : http://bit.ly/voicetechpodcast-reddit
Facebook group : http://bit.ly/voicetechpodcast-facebook-group
Facebook page : http://bit.ly/voicetechpodcast-facebook-page
Follow on Twitter : http://bit.ly/voicetechpodcast-twitter
Email me : carl@voicetechpodcast.com

Support the Voice Tech Podcast:
Tell a friend about us or share on social media!
Leave a 5 star review on iTunes: https://apple.co/2LqW4ol
Leave a 5 star review on Stitcher: http://bit.ly/voicetechpodcast-stitcher
Become a patron at: http://bit.ly/voicetechpodcast-patreon

--

--

Carl Robinson
Voice Tech Podcast

Podcasts 10X faster. Co-founder CEO, Rumble Studio. Host of the Voice Tech Podcast