Why use Timed Text Speech

Fast

Get results in minutes instead of days, save time by reviewing results online while the speech engine continues processing

Easy to use

Just upload a media file, we transcribe the audio and return timed text in a variety of formats

Cost effective

Priced per minute of transcribed media, pay only for what you use

International

Supports multiple languages including Arabic, Brazilian Portuguese, English (UK and US), French, Japanese, Chinese, Spanish (Castilian, Latin American, and North American)

Highly accurate transcription

Improve results using custom vocabularies (names, acronyms, places, specialized terms)

Integrates with existing tools

Transcribe directly from within Stanza

Timed Text Speech Cloud Video Transcription

Who should use Timed Text Speech?

Content creators in need to produce captions and subtitles for time-sensitive content that must be aired on TV or the Internet with short turnaround times, as well as short-form content such as promos that require captioning.

Captioning service companies looking for time-efficient solution to speed up their service with cost-effective and easy-to-use automated transcription option.

Corporations, government organizations and educational institutions wishing to bring captioning in-house.

How it works

Timed Text Speech uses the latest Machine Learning (ML) based technology to accurately transcribe human speech (extracted from a video) into time stamped text.

The transcript can then be edited and formatted for a variety of caption and subtitle standards.

Using the Telestream Cloud console:

Create a transcription project, add optional vocabulary terms and select a timed text file format (SRT, JSON, CSV)
Upload a video file to the project
Telestream Cloud extracts the audio and the transcribed results appear in the web console
Review and edit the transcription
Download the timed text file

Standard pricing for Timed Text Speech transcriptions

Timed Text Speech

Instantly create highly accurate captions and subtitles with Machine Learning based speech to text technology.

Transcribe the audio and return timed text in a variety of formats (SRT, JSON, CSV, TXT)
Priced per minute of transcribed media, pay only for what you use
Multiple languages including Arabic, Brazilian Portuguese, English (UK and US), French, Japanese, Chinese, Spanish (Castilian, Latin American, and North American)
Transcribe directly from within Stanza

Pay As You Go Plan

Activate the service now and pay as you go for only what you use. No activation charge, and no hidden costs outside actual usage.

Managed solution that scales seamlesly while eliminating infrastructure costs and need for maintenance work on your side. State-of-the art infrastructure from leading Cloud providers ensures rock solid performance for any processing volume.

$200/month minimum with credit card. $2,400 minimum up front for the year if requesting invoice.

$0.10
per billable minute

Enterprise

In addition to convenient pay-as-you-go options, Telestream offers enterprise plans that can be tailored to your specific needs. Enterprise plans include:

Significant discounts for volume & term commitments
API & console training
Onboarding & integration services
Flexible payment & billing options

For more information:

A Telestream Cloud account is required to access Timed Text Speech

Telestream Cloud Pricing Calculator

How are minutes calculated?

Multipliers are applied based on whether a proxy is needed and whether a custom vocabulary is required to perform the transcription.

Type	Multiplier	Example
Standard Transcription	–	–
Standard Transcription with proxy video	1.4x	10 minutes of standard transcription with proxy video count as 14 minutes
Custom Vocabulary Transcription	2x	10 minutes of transcription with custom vocabulary counts as 20 minutes
Custom Vocabulary Transcription with proxy video	2.4x	10 minutes of custom vocabulary transcription with proxy video count as 24 minutes