Lyrics and Speech Alignment

Match audio containing speech or singing with corresponding subtitle lines or words, providing word-by-word and line-by-line aligned data in JSON format.

Use This Module Contact Sales

Documentation

Settings

Name
language
Type
string
Description
The main language of the input audio file.
Name
alignment
Type
string
Description
The alignment that should be applied.
Name
diarization
Type
boolean
Description
Apply speaker identification for each subtitle sentence based on input audio.

Input

Name
audioInputFileUrl
Type
string
Description
Audio file containing speech or singing that you want to align with the subtitles
Name
subtitleInputFileUrl
Type
string
Description
Subtitle file you want to align with the audio

Output

Name
alignmentV2
Type
string
Description
Transcribed and aligned content.

Related Modules

Audio Activity Detection

Generate a timeline marking periods of audio activity.

Pad

Add silence to the beginning and/or end of your audio file to create space before and after the audio content.

Segment

Extract a specified segment from an audio file, retaining chosen duration starting from a designated point in time.

Segments

Extract segments from an audio file based on a segment array with 'start' and 'end' properties.

Silence Trim

Remove silence from audio files.

Static File

Create consistent transformations of your content with a static reference file. Ideal for reference mastering and other modules where you want to a static file as part of your workflow.

Vocal Pitch Shift Suggestion

Compute pitch shift suggestions based on vocal range map and input audio target.

Ready to innovate your business?

Get Started Contact Sales