SakasegawaBuild Your Own Personal Assistant AI: A Simple Speech-to-Speech System with ChatVRMHello! I’m Sakasegawa (https://x.com/gyakuse). Today, I’ll be working on this (the backend part). The frontend is powered by ChatVRM, a…Dec 3
DatadriftersSay Hello to ‘Her’: Real-Time AI Voice Agents with 500ms Latency, Now Open SourceVoice Mode is hands down one of the coolest features in ChatGPT, right?Aug 1711
Emad DehnaviMoshi: A New Era in Spoken Dialogue AI and Real-Time ConversationsIntroductionNov 25Nov 25
Jaimon JacobTesting the Meta Spirit LM speech-to-speech generation capabilitiesTypically, text-to-speech pipelines involve three main steps: first, speech is transcribed using automatic speech recognition (ASR); next…Oct 21Oct 21
ODSC - Open Data ScienceThe Evolution of GenAI Speech-to-Speech Technology: Where We’re HeadedGenerative artificial intelligence (AI)-powered speech-to-speech technology has evolved greatly since its inception. As such, so have the…Oct 30Oct 30
SakasegawaBuild Your Own Personal Assistant AI: A Simple Speech-to-Speech System with ChatVRMHello! I’m Sakasegawa (https://x.com/gyakuse). Today, I’ll be working on this (the backend part). The frontend is powered by ChatVRM, a…Dec 3
DatadriftersSay Hello to ‘Her’: Real-Time AI Voice Agents with 500ms Latency, Now Open SourceVoice Mode is hands down one of the coolest features in ChatGPT, right?Aug 1711
Jaimon JacobTesting the Meta Spirit LM speech-to-speech generation capabilitiesTypically, text-to-speech pipelines involve three main steps: first, speech is transcribed using automatic speech recognition (ASR); next…Oct 21
ODSC - Open Data ScienceThe Evolution of GenAI Speech-to-Speech Technology: Where We’re HeadedGenerative artificial intelligence (AI)-powered speech-to-speech technology has evolved greatly since its inception. As such, so have the…Oct 30
InAVIV Product & Tech BlogbyLaurent HUSSONEnhancing User Experience with OpenAI’s Realtime API and Function Calling: A Seamless…As AI technology continues to evolve, the way we interact with machines is undergoing significant changes. At latest OpenAI’s DevDay, the…Oct 17
Jaimon JacobMeta’s Seamless4T — The right step towards video translationMeta’s Seamless4T is all about making video translation smooth and accessible. Picture this: Instead of waiting for a video to be…Oct 231
Muhammad AbueleninThe Ultimate Guide to Types of Automatic Transcription SoftwareAutomatic transcription software has transformed how we convert audio into text, saving time and boosting productivity for professionalsDec 6