Get Caption

Get the caption file for a job. The job must have completed transcription before a caption can be downloaded.

HTTP Method

GET /api/job/get_caption

Query String Parameters — Required

Name Details
v Description The version of the API to use
Allowed Values 1
Example v=1
api_token Description The API token used for this session
Allowed Values Hex String
Example api_token=7ca5dc5c7cce449fb0fff719307e8f5f
job_id Description The ID of the job
Allowed Values Hex String
Example job_id=64bea283eff6475ea6596027a6ba0929
caption_format Description The format of the caption file to return
Allowed Values Caption Format Enumeration
Example caption_format=SRT

Query String Parameters — Optional

Name Details
build_url
Rather than returning the file, return a permanent URL to the
file.
Allowed Values Boolean
Default Value false
Example build_url=true
caption_words_min
Minimum number of words allowed in a caption.
Allowed Values Integer
Default Value 1
Example caption_words_min=3
caption_by_sentence
When true, puts each sentence into its own caption. When
false, more than one sentence may appear in a single
caption.
Allowed Values Boolean
Default Value true
Example caption_by_sentence=false
characters_per_caption_line
Maximum number of characters to be displayed on each
caption line.
Allowed Values Integer
Default Value 42
Example characters_per_caption_line=30
dfxp_header
Allows you to specify a custom header for your DFXP
caption file. The header should be the entire contents of
the header including the opening and closing tags.
Ignored if caption_format does not equal DFXP.
Allowed Values XML String
Default Value “”
Example dfxp_header=<head></head>
disallow_dangling
Will prevent captions from having the last word in a sentence
start a new line. Last words will ALWAYS be kept on the same
line, even if it breaks the characters_per_caption_line option.
Allowed Values Boolean
Default Value false
Example disallow_dangling=true
display_effects_speaker_as
Determines what speaker name should used for sound effects.
Allowed Values String
Default Value Effects
Example display_effects_speaker_as=Sound
display_speaker_id
Determines the way speakers are identified in the captions.
Choose "no" to not display speaker identities at all:
">> example"
Choose "number" to display only the speaker number:
">> Speaker 1: example"
Choose "name" to display the speaker name:
">> John Doe: example".
If you choose "name", the speaker number will be displayed
if the name is not available.
Allowed Values Speaker ID Enumeration
Default Value name
Example display_speaker_id=number
elementlist_version
The version of element list to generate the captions from.
If not specified, the caption will be generated from the
latest version.
Allowed Values ISO 8601 Date String
Default Value “”
Example elementlist_version=2014-07-31T12:35:52.324389
emit_speaker_change_tokens_as
Determine what characters to use to denote speaker changes.
Allowed Values String
Default Value “>>”
Example emit_speaker_change_tokens_as=--
force_case
Force the contents of the captions to be all UPPER or
lower case. If blank, the case of the captions is not
changed.
Allowed Values Case Enumeration
Default Value “”
Example force_case=lower
include_dfxp_metadata
When true, and the caption format requested is DFXP,
the jobs name, ID and language will be added to the DFXP
metadata header. When false, these data are omitted from
the header.
Ignored if caption_format does not equal DFXP.
Allowed Values Boolean
Default Value true
Example include_dfxp_metadata=false
layout_target_caption_length_ms
Captions generated will, on average, be this duration. However,
they may vary significantly based on other parameters you set.
Allowed Values Integer
Default Value 5000
Example layout_target_caption_length_ms=4000
line_break_on_sentence
Inserts a line break in between sentences that are in
the same caption.
Allowed Values Boolean
Default Value false
Example line_break_on_sentence=true
line_ending_format
Determine the end of line (EOL) character to use for the
captions.
Allowed Values Line Ending Enumeration
Default Value “UNIX”
Example line_ending_format=OSX
lines_per_caption
Number of lines to be displayed for each caption.
Allowed Values Integer
Default Value 2
Example lines_per_caption=3
mask_profanity
Replace profanity with asterisks.
Allowed Values Boolean
Default Value false
Example mask_profanity=true
maximum_caption_duration
No captions longer than this (in milliseconds) will be
produced. If not specified, there is no maximum.
Allowed Values Integer
Default Value None (no maximum)
Example maximum_caption_duration=10000
merge_gap_interval
Captions with a gap between them that is smaller than
this (in milliseconds) will have their start and/or
end times changed so there is no time gap between the captions.
Allowed Values Integer
Default Value 1000
Example merge_gap_interval=1500
minimum_caption_length_ms
Extends the duration of short captions to the this minimum
length. Additional time is taken from later caption blocks
to meet this minimum time.
Allowed Values Integer
Default Value None (no minimum)
Example minimum_caption_length_ms=1500
minimum_gap_between_captions_ms
Adds a minimum time between captions such as there will
always be some time between captions where no text is
displayed. When captions are very close together, time
will be removed from the caption duration to make the gap.
Allowed Values Integer
Default Value None (no minimum)
Example minimum_gap_between_captions_ms=100
qt_seamless
Does not put time gaps of any kind between caption blocks.
Ignored if caption_format does not equal QT.
Allowed Values Boolean
Default Value false
Example qt_seamless=true
remove_disfluencies
Remove verbal disfluencies from the generated transcript.
Common disfluencies such as "um" and "ah" are removed while
maintaining appropriate punctuation.
Allowed Values Boolean
Default Value true
Example remove_disfluencies=false
remove_sounds_list
A list of sounds to not show in the caption. This is a
JSON style list, and should look like ["MUSIC", "LAUGH"].
Ignored if remove_sound_references is true.
Allowed Values Sound Tag Enumeration
Default Value []
Example remove_sounds_list=["MUSIC","LAUGH"]
remove_sound_references
Remove ALL non-verbal sound and noise references from the
generated transcript. Sounds and unidentified noises are
depicted in the caption as [SOUND], [COUGH] and [NOISE].
If this parameter is set, these identifiers are omitted
from the caption.
Allowed Values Boolean
Default Value true
Example remove_sound_references=false
replace_slang
Replace common slang terms from the generated transcript.
Common replacements are "want to" for "wanna", "going to"
for "gonna", etc.
Allowed Values Boolean
Default Value false
Example replace_slang=true
silence_max_ms
If there is a interval of silence in the middle of a sentence
longer than this, then the caption will be split.
Allowed Values Integer
Default Value 2000
Example silence_max_ms=1000
single_speaker_per_caption
When true, puts each speaker into its own caption. When false,
more than one speaker may appear in a single caption.
Allowed Values Boolean
Default Value false
Example single_speaker_per_caption=true
sound_boundaries
Specifies the characters to surround sound references with.
The default will generate sound references that look like
this: [MUSIC].
Allowed Values Character Tuple
Default Value (‘[‘,’]’)
Example sound_boundaries=('{','}')
sound_threshold
Sound references that are longer than this threshold will
be made their own caption entirely, and will not have any
text included with them. If not set, Sound references will
be included back to back with text no matter the duration of
the sound.
Allowed Values Integer
Default Value None
Example sound_threshold=5000
sound_tokens_by_caption
If true, all sound references will always be in their own
caption. If false, more than one sound reference may
appear in a single caption.
Allowed Values Boolean
Default Value false
Example sound_tokens_by_caption=true
sound_tokens_by_line
If true, all sound references will always be in their own
line. If false, more than one sound reference may appear
in a single line.
Allowed Values Boolean
Default Value false
Example sound_tokens_by_line=true
sound_tokens_by_caption_list
If non-empty, the specified sound references will always
be in their own caption. If empty, more than one
sound reference may appear in a single caption.
Ignored if sound_tokens_by_caption is true.
Allowed Values Sound Tag Enumeration
Default Value [“BLANK_AUDIO”,”MUSIC”]
Example sound_tokens_by_caption_list=[]
sound_tokens_by_line_list
If non-empty, the specified sound references will always
be in their own line. If empty, more than one
sound reference may appear in a single line.
Ignored if sound_tokens_by_line is true.
Allowed Values Sound Tag Enumeration
Default Value [“BLANK_AUDIO”,”MUSIC]
Example sound_tokens_by_line_list=["NOISE"]
speaker_on_new_line
If true, a speaker change will cause a new caption to be
made. If false, multiple speakers may appear in a single
caption.
Allowed Values Boolean
Default Value true
Example speaker_on_new_line=false
srt_format
If the caption format is SRT, determines what the caption
blocks will look like. The default, prints caption blocks
that look like this:

   1:
   00:00:06,060 --> 00:00:16,060
   This is the caption text.

You can alter the caption block by re-arranging or removing
the substitution string values, shown enclosed in braces "{}"
in the default value below. Substitution strings may used
more than once if desired. Any text that is not a substitution
string will be displayed as written. To add new lines, include
a \n. Note, you may need to escape the \n with an extra
backslash when encoding the request.
Allowed Values String
Default Value
{caption_number:d}\n{start_hour:02d}:
{start_minute:02d}:{start_second:02d},
{start_millisecond:03d} -->{end_hour:02d}:
{end_minute:02d}:{end_second:02d},
{end_millisecond:03d}\n{caption_text}\n\n
Example srt_format=...
strip_square_brackets
Removes all square brackets like '[' or ']' from captions.
By default square brackets surround sound references like
'[MUSIC]', but they may exist as part of the caption text as well.
Allowed Values Boolean
Default Value false
Example strip_square_brackets=true
utf8_mark
Adds a utf8 bytemark to the beginning of the caption. This
should only be used if the system you are loading the caption
files into needs a byte marker. The vast majority of systems
do not.
Allowed Values Boolean
Default Value false
Example utf8_mark=true

Responses

Returns requested caption file as body data, or if build_url was set to true returns caption URL.
HTTP Code Details
200 Description Success
Contents
Caption file as body data
200 Description Success
Contents
{
  "CaptionUrl": "Hosted URL"
}
400 Description An error occurred
Contents Error description (see Error Format for details)

Example Requests

GET /api/job/get_caption?v=1&api_token=7ca5dc5c7cce449fb0fff719307e8f5f
&job_id=64bea283eff6475ea6596027a6ba0929&caption_format=SRT HTTP/1.1
Host: api.cielo24.com

Example Response

HTTP/1.1 200 OK
Content-Type: text/plain

1
00:02:17,440 --> 00:02:20,375
Some text on first line.

2
00:02:20,476 --> 00:02:22,501
More text on second line.
HTTP/1.1 200 OK
Content-Type: application/json

{ "CaptionUrl": "http://domain.com/path/file.srt" }