Lyrics and speech alignment

Match audio containing speech or singing with corresponding subtitle lines or words, providing word-by-word and line-by-line aligned data in JSON format.

Module documentation

Settings

  • Name
    word
    Type
    boolean
    Description
  • Name
    language
    Type
    string
    Description

    The main language of the input audio file.

  • Name
    syllable
    Type
    boolean
    Description
  • Name
    diarization
    Type
    boolean
    Description

Input

  • Name
    audioInputFileUrl
    Type
    string
    Description

    Audio file containing speech or singing that you want to align with the subtitles

  • Name
    subtitleInputFileUrl
    Type
    string
    Description

    Subtitle file you want to align with the audio

Output

  • Name
    alignmentV2
    Type
    string
    Description

    Transcribed and aligned content

Ready to take your project to the next level?

Start now — or reach out for assistance.