Accepted Sources

To initiate a Summarization job or an Extraction job, you must specify the source of your input data.

The data source can have different types, which are described in the following table:

🚧

Some sources could require additional parameters, which are specified in the table below in the Additional parameters column.

SourceDescriptionAdditional parameters
genericGeneric-format .txt and JSON transcripts
audioSupported audio file formats include flac, m4a, mp3, mpga, ogg, and wav (keep it <100Mb)
wordcab_transcriptWordcab transcripttranscript_id
signed_urlAWS or GCP signed URLs pointing to an audio filesigned_url
assembly_aiUse the JSON array from AssemblyAI's utterances object as input
deepgramUse the JSON array from Deepgram's utterances object as input
rev_aiUse the JSON array from Rev.ai's monologues object as input
vttEither a .vtt file or raw vtt text in the request body
otterOtter.ai transcript
firefliesFireflies.ai transcript
sonixSonix.ai transcript
descriptDescript transcript

Generic

The generic source is the most basic source type. It could be either a .txt file or a .json file.

Text file

Each line of the .txt file represents a single utterance with the following format:

[<start_time> --> <end_time>] <speaker>: <utterance>

Example:

[00:01:23 --> 00:01:25] Joe: This is an example of a generic transcript.
[00:01:26 --> 00:01:27] Jill: Oh wow!

📘

Timestamps are optional and can be placed anywhere on the line.

JSON file

The .json file must be a valid JSON array of objects. It should include a transcript key with a list of strings as value. Each element of the list should be a speaker utterance.

Example:

{
  "transcript": [
    	"Joe: This is an example of a generic transcript.",
    	"Jill: Oh wow!"
}

Audio

The audio source is used to specify an audio file as input data. As mentioned in the table above, supported audio file formats are .mp3, .wav, .flac, .ogg, mpga and .m4a.

Wordcab Transcript

We defined what is a Wordcab Transcript in the API concepts section (see Transcripts).

If you have a Wordcab Transcript, you can use it as input data for a Summarization job or an Extraction job by adding the transcript_id parameter to the request.

Signed URL

The signed_url source is used to specify a signed URL as input data. Signed URLs are helpful when you want to use a file stored in a private bucket or server. The file must be accessible via a signed URL from AWS (see AWS Signed URLs) or Google Cloud Storage (see GCP Signed URLs).

AssemblyAI

AssemblyAI provides a transcription service that can be used as a source for your transcript by providing AssemblyAI utterances (see Core Transcription).

Deepgram

Deepgram provides a transcription service that can be used as a source for your transcript by providing Deepgram utterances (see Utterances).

Rev.ai

Rev.ai provides a transcription service that can be used as a source for your transcript by providing Rev.ai monologues (see Get Started).

VTT

The vtt source is used to specify a .vtt file or a raw .vtt string as input data.

The .vtt file must be a valid WebVTT file. You can use this WebVTT validator before initiating a job.

Otter.ai

Otter.ai provides a transcription service that can be used as a source.

Learn more about Otter.ai proper formatting in our blog post: Otter.ai Transcripts in Wordcab.

Fireflies.ai

Fireflies.ai provides a transcription service that can be used as a source.

Learn more about Fireflies.ai proper formatting in our blog post: Fireflies.ai Transcripts in Wordcab.

Sonix.ai

Sonix.ai provides a transcription service that can be used as a source.

Learn more about Sonix.ai proper formatting in our blog post: Sonix.ai Transcripts in Wordcab.

Descript

Descript provides a transcription service that can be used as a source.

Learn more about Descript proper formatting in our blog post: Descript Transcripts in Wordcab


What’s Next

Let's check Job Status.