The most insightful stories about Audio Processing - Medium

Audio Processing

Machine Learning

Speech Recognition

Artificial Intelligence

Signal Processing

Digital Signal Processing

Audio Processing

Topic

·

30 Followers

·

159 Stories

Recommended stories

Ebad Sayed
The AI Orchestra: Generating Melodies with LSTM Models Part 1
Music, a timeless art form steeped in creativity and theory, is now encountering a transformative partner: Artificial Intelligence. In this…
2d ago
Jeremy Savage
Using OpenAI’s Whisper to Transcribe Real-time Audio
The availability of advanced technology and tools, in particular, AI is increasing at an ever-rapid rate, I am going to see just how easy…
Apr 12
Ebad Sayed
The AI Orchestra: Generating Melodies with LSTM Models Part 2In the previous article, we learned about music theory. In this article, we will see how to preprocess songs to train a neural network and…
2d ago
2d ago
tttzof351
Build text-to-speech from scratch.In the series of small articles, we will write step-by-step a toy text-to-speech model. It will be a simple model with a modest goal — to…
Aug 2, 2023
1
Aug 2, 2023
1
Alexey N.
Word Detection in Audio Using STFTInvestigation of STFT coefficients usage for pattern detection
Jul 9
Jul 9

The AI Orchestra: Generating Melodies with LSTM Models Part 1

The AI Orchestra: Generating Melodies with LSTM Models Part 1

Ebad Sayed

The AI Orchestra: Generating Melodies with LSTM Models Part 1

Music, a timeless art form steeped in creativity and theory, is now encountering a transformative partner: Artificial Intelligence. In this…

2d ago

Image of a man who is processing audio files in front of a microphone

Image of a man who is processing audio files in front of a microphone

Jeremy Savage

Using OpenAI’s Whisper to Transcribe Real-time Audio

The availability of advanced technology and tools, in particular, AI is increasing at an ever-rapid rate, I am going to see just how easy…

Apr 12

The AI Orchestra: Generating Melodies with LSTM Models Part 2

Ebad Sayed

The AI Orchestra: Generating Melodies with LSTM Models Part 2

In the previous article, we learned about music theory. In this article, we will see how to preprocess songs to train a neural network and…

2d ago

Build text-to-speech from scratch.

tttzof351

Build text-to-speech from scratch.

In the series of small articles, we will write step-by-step a toy text-to-speech model. It will be a simple model with a modest goal — to…

Aug 2, 2023

Word Detection in Audio Using STFT

Alexey N.

Word Detection in Audio Using STFT

Investigation of STFT coefficients usage for pattern detection

Jul 9

Speaker Diarization in Python: A Step-by-Step Guide

Apparao Mulpuri

Speaker Diarization in Python: A Step-by-Step Guide

Introduction

Nov 28, 2023

DeepFake Audio: Detecting AI-generated speech with Machine Learning Models using MFCCs

Elena Mascareñas García

DeepFake Audio: Detecting AI-generated speech with Machine Learning Models using MFCCs

By Dror Arbiv & Elena Mascareñas García

Jul 7

Removing background noise from speech using SpeechBrain models

Jaimon Jacob

Removing background noise from speech using SpeechBrain models

SpeechBrain is an open-source, all-in-one toolkit designed for speech processing. Built on PyTorch, it offers a comprehensive suite of…

May 17

See more recommended stories