The Best Free Way to Automatically Transcribe Audio and Video Files

SpeechText AI
SpeechText.AI
Published in
3 min readJun 15, 2020

Transcribing audio and video files is a great way to improve SEO results and get strong Google Search positions for your websites. Transcripts help your content rank higher in several ways. Google uses transcription results to crawl and understand the important points in your files. If you have transcriptions for all audio/video files you help the search engine to find and rank your content properly. Also transcription results provide more content for website visitors to consume. For example, a video transcript will give an assist if a user needs to read website information in a quiet environment.

But doing manual audio or video transcription is a boring and exhausting process. In this article, we will show you how to convert audio and videos into text using automatic transcription software with close to human accuracy.

Tutorial: How to Transcribe Audio and Video Files

1. Create an account on SpeechText.AI (it’s free)

The registration process is simple and quick. SpeechText.AI offers a free trial plan for all new users. No credit card required. To sign up, you will need an active e-mail account to verify that you are not a robot.

2. On the user dashboard homepage, you’ll see the area where you can upload audio or video files. We support different file formats: MP3, WAV, OGG, M4A, AVI, MP4, FLV, MOV, etc.

Let’s see how the transcription engine works on this video (spoiler: Rebel Wilson joke about the Best Director category was golden!!!):

3. Select the transcription language, industry domain, and file type

To improve the quality of automatic speech recognition you should properly select the correct industry domain and type of a file. SpeechText AI service uses machine learning algorithms and speech recognition technology to convert audio data into text. To better understand the domain-specific terminology the transcription service will apply domain-optimized Artificial Intelligence models to transcribe audio/video files with high accuracy level. If you don’t know what are relevant options for your files it’s better to select the ‘General’ domain/type for your data.

4. Hit the ‘Transcribe’ button and wait while we transcribe your files. It usually takes half of the file length to transcribe a file completely.

5. Proofread and correct automatic transcription results

No software can do 100% accurate speech recognition. Even the best human transcriber that natively speaks your language would have a hard time getting close to 100%. That’s why we’ve created the proofreading interface that helps users to edit and correct speech recognition results in real-time.

6. Export transcription results in the format of your choice

Click on the ‘Download’ icon and save the transcription file in various formats (txt, pdf, docx, etc.).

Let’s see the final result. Here is the transcription of the original video automatically generated and edited with SpeechText.AI service. It costs me 0$ and takes about 5 minutes to create the transcript!

--

--