Alibaba TechI See What You’re Saying: From Audio-only to Audio-visual Speech RecognitionThis article is part of the Academic Alibaba series and is taken from the ICASSP paper entitled “Robust Audio-visual Speech Recognition…Apr 25, 2019Apr 25, 2019
Alibaba TechHearing the Character in Things: Alibaba Improves Mandarin Speech RecognitionThis article is part of the Academic Alibaba series and is taken from the ICASSP 2019 paper entitled “Investigation of Modeling Units for…May 16, 20191May 16, 20191
KoreUnderstanding speech recognition to design better Voice interfacesWhen designing a good voice user interface, it is always advantageous to know how the technology works.May 25, 2018May 25, 2018
Sara RobinsonMaking audio searchable with Cloud SpeechLast month Cloud Speech introduced a new word-level timestamps feature: audio transcriptions now include the start and end timestamp for…Sep 26, 20176Sep 26, 20176
Sara RobinsonSpeech to text transcription in 40 lines of BashEver wanted to build an app that takes audio input from users? There are tons of benefits for integrating audio into an app— from simply…Jul 11, 20177Jul 11, 20177
Senko RašićHear no evilVoice-controlled AI assistants are advanced enough to be dangerousJan 7, 2017Jan 7, 2017
Intelligent VoiceWe live in the Big Cloud: And we hate it… Is it time for Hipster IT?Nigel Cannings, CTO Intelligent VoiceDec 1, 2016Dec 1, 2016