Speech

Transcribe and align Speech from any audio file. Converting spoken content into textual form. This model is optimized for speech transcription. For singing/lyrics transcription, please use the Lyrics Transcription module.

Module documentation

Settings

  • Name
    language
    Type
    string
    Description

    Expected spoken language of the audio. Defaults to Auto-detection.

Input

  • Name
    inputFileUrl
    Type
    string
    Description

    Audio to process.

Output

  • Name
    alignment
    Type
    string
    Description

    Transcribed and aligned speech.

  • Name
    transcription
    Type
    string
    Description

    Transcribed speech.

Ready to take your project to the next level?

Start now — or reach out for assistance.