Caption and Transcript File Types

latakoo support
2 min readAug 14, 2017

--

Explore the types of files available with latakoo’s integrated service

With latakoo, you can upload almost any video file type and receive a transcript or caption of that file returned to you quickly for publishing or editing. Here’s a list of supported file types.

SRT

This is the most common subtitle/caption file format, especially for YouTube or Facebook captioning. It is a text format that originated in the DVD-ripping software SubRip and stands for “SubRip Subtitle” file.

Transcript

This format is a text file that can be imported into a text editor and converted to a Word document. Although this file is downloaded as .transcript, it can be renamed to a .txt.

DFXP

This is the most common format used for captioning Flash video. It’s a timed-text format that was developed by W3C and stands for “Distribution Format Exchange Profile.” It is also supported by YouTube.

DFXP files are xml based closed caption files. They generally contain formatting information like font color and alignment, among others. One DFXP file can contain captions in multiple languages. The viewer will show either the language that it normally displays by default, or whatever language the user chooses (if the file contains that language.)

DFXP is certainly a much less common format than .srt files, but most professional closed caption creators can export them.

ECHO

This file type provides integration with a specific active learning platform.

QT

Quicktime timed text — recommended for QuickTime players only.

SAMI

Used for Windows Media video or audio. It was developed by Microsoft and stands for “Synchronized Accessible Media Interchange.”

SBV

This is a YouTube caption file format that stands for “SubViewer.” It’s what you get when you download captions from YouTube. It’s a text format that is very similar to SRT.

SCC

Popular standard used for Line 21 broadcast closed captions, web media, DVD, as well as subtitles for iTunes, iPods, iPads, and iPhones. It was originally developed by Sonic and stands for “Scenarist Closed Caption.” Can also be imported into some editors, such as Adobe Premiere to produce videos with embedded captions.

TPM

Contains metadata that describes how to transform a UML (Unified Modeling Language) file into EGL (Enterprise Generation Language) code; enables graphical modeling of complex applications with IBM Rational software.

TWX

The TWX file is a zip file. You can use any unzipping software to view the contents. This exercise is for diagnostic purposes to find where assets are or where a specific JavaScript line is used.

WEB_VTT

This caption format is created for HTML5 media players.

--

--