Lyrics and Speech Alignment

Match audio containing speech or singing with corresponding subtitle lines or words, providing word-by-word and line-by-line aligned data in JSON format.

Documentation

Settings

  • Name
    language
    Type
    string
    Description

    The main language of the input audio file.

  • Name
    alignment
    Type
    string
    Description

    The alignment that should be applied.

  • Name
    diarization
    Type
    boolean
    Description

    Apply speaker identification for each subtitle sentence based on input audio.

Input

  • Name
    audioInputFileUrl
    Type
    string
    Description

    Audio file containing speech or singing that you want to align with the subtitles

  • Name
    subtitleInputFileUrl
    Type
    string
    Description

    Subtitle file you want to align with the audio

Output

  • Name
    alignmentV2
    Type
    string
    Description

    Transcribed and aligned content.

Ready to innovate your business?