Ebad SayedThe AI Orchestra: Generating Melodies with LSTM Models Part 1Music, a timeless art form steeped in creativity and theory, is now encountering a transformative partner: Artificial Intelligence. In this…2d ago
Jeremy SavageUsing OpenAI’s Whisper to Transcribe Real-time AudioThe availability of advanced technology and tools, in particular, AI is increasing at an ever-rapid rate, I am going to see just how easy…Apr 12
Ebad SayedThe AI Orchestra: Generating Melodies with LSTM Models Part 2In the previous article, we learned about music theory. In this article, we will see how to preprocess songs to train a neural network and…2d ago2d ago
tttzof351Build text-to-speech from scratch.In the series of small articles, we will write step-by-step a toy text-to-speech model. It will be a simple model with a modest goal — to…Aug 2, 20231Aug 2, 20231
Alexey N.Word Detection in Audio Using STFTInvestigation of STFT coefficients usage for pattern detectionJul 9Jul 9
Ebad SayedThe AI Orchestra: Generating Melodies with LSTM Models Part 1Music, a timeless art form steeped in creativity and theory, is now encountering a transformative partner: Artificial Intelligence. In this…2d ago
Jeremy SavageUsing OpenAI’s Whisper to Transcribe Real-time AudioThe availability of advanced technology and tools, in particular, AI is increasing at an ever-rapid rate, I am going to see just how easy…Apr 12
Ebad SayedThe AI Orchestra: Generating Melodies with LSTM Models Part 2In the previous article, we learned about music theory. In this article, we will see how to preprocess songs to train a neural network and…2d ago
tttzof351Build text-to-speech from scratch.In the series of small articles, we will write step-by-step a toy text-to-speech model. It will be a simple model with a modest goal — to…Aug 2, 20231
Alexey N.Word Detection in Audio Using STFTInvestigation of STFT coefficients usage for pattern detectionJul 9
Elena Mascareñas GarcíaDeepFake Audio: Detecting AI-generated speech with Machine Learning Models using MFCCsBy Dror Arbiv & Elena Mascareñas GarcíaJul 7
Jaimon JacobRemoving background noise from speech using SpeechBrain modelsSpeechBrain is an open-source, all-in-one toolkit designed for speech processing. Built on PyTorch, it offers a comprehensive suite of…May 17