Skip to content

How to generate english subtitles, with translation of foreign languages, via the CLI #357

@balupton

Description

@balupton

I'm a bit confused about these options:

  --task <task>           Task to perform (transcribe or translate) (default: transcribe)
  --language <language>   Language spoken in the audio

I want to essentially generate two subtitles:

  1. A srt file that is only english dialogue is transcribed, without any (speaking in french) etc subtitles, for when there are hardcoded translations on the screen
  2. A srt file that is only english, with any foreign language (if any) translated to english

Currently, with:

whisperkit-cli transcribe --verbose --task transcribe --report --skip-special-tokens --chunking-strategy vad --language en --model medium.en

I'm getting .srt files for french documentaries that are mute for like a minute, then some (speaking in french) and then several minutes in starts becoming english translations.

Is it something like where I need to use the large or turbo model, omit --task and --language, but do --prompt like:

  1. Transcribe english dialogue only, ignore all other dialogue and sounds.
  2. Transcribe all dialogue into english; including translating all non-english dialogue into english. Do not transcribe sounds, only transcribe and translate speech.

Would that work?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions