Photo by Matt Botsford on Unsplash

Effective September 30, 2020, IBM is rolling out new cloud pricing plans for the Watson Speech to Text service.

Watson Speech to Text is an API-based service available on the IBM Watson cloud platform, used in applications that require the transcription of audio into digital text form. Typical use case applications include voice-automated chatbots, customer-service call center analytics, multi-media and conference call transcription, among many others.

Pricing Update Summary


Enable better customer interaction and conversation with clear, crisp, more natural-sounding voice technology.

Photo by Oleg Laptev on Unsplash

We are pleased to announce that IBM Watson Text to Speech (TTS) service has introduced a new set of voices based on the latest neural techniques and technologies that provide a more human-sounding synthesized speech. These new voices are now generally available in both our public cloud and private cloud offerings.

Previously, speech synthesis was based on copying short speech segments from a recorded voice data set and then concatenating them together.

Today, our speech synthesis is based on the latest voice technologies in three (3) deep neural networks (DNNs), which learn various aspects of speech during the training process.


Photo Credit: _HealthyMond on Unsplash

We are pleased to announce that IBM Watson™ Speech to Text: Customer Care is now generally available (GA) on IBM® Cloud Private as of September 21, 2018. Upon this initial release, the service will support the Customer Care domain in three languages: U.S. English, Japanese and Korean.

IBM Watson Speech to Text provides speech recognition capabilities by leveraging machine learning to combine knowledge of grammar, language structure, and the composition of audio and voice signals to accurately transcribe the human voice into the written word.

But, what does it mean to now offer “IBM Watson™ Speech to Text: Customer Care…


Photo by Ben White on Unsplash

In consideration of today’s too long; didn’t read (tl;dr) mentality, my desire to tell you EVERYTHING about Watson’s Speech-to-Text Service is directly confronted.

Being a writer, I personally enjoy writing in full description and meaningful explanation. Long prose can be satisfying in a way — a somewhat cathartic baring of the subject at hand, exposed for all to read and know.

Oh, the stories I could tell about the various uses of our services, like:

  • A call center transcribing audio conversations between customers and agents to analyze common call patterns and issues.
  • A medical service provider creating an application for…

Kati Venturato

Product Manager @IBMWatson

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store