How to generate english subtitles, with translation of foreign languages, via the CLI

I'm a bit confused about these options:

```
  --task <task>           Task to perform (transcribe or translate) (default: transcribe)
  --language <language>   Language spoken in the audio
```

I want to essentially generate two subtitles:

1. A srt file that is only english dialogue is transcribed, without any `(speaking in french)` etc subtitles, for when there are hardcoded translations on the screen
2. A srt file that is only english, with any foreign language (if any) translated to english

Currently, with:

```
whisperkit-cli transcribe --verbose --task transcribe --report --skip-special-tokens --chunking-strategy vad --language en --model medium.en
```

I'm getting .srt files for french documentaries that are mute for like a minute, then some `(speaking in french)` and then several minutes in starts becoming english translations.

Is it something like where I need to use the large or turbo model, omit `--task` and `--language`, but do ` --prompt` like:

1. `Transcribe english dialogue only, ignore all other dialogue and sounds.`
2. `Transcribe all dialogue into english; including translating all non-english dialogue into english. Do not transcribe sounds, only transcribe and translate speech.`

Would that work?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to generate english subtitles, with translation of foreign languages, via the CLI #357

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to generate english subtitles, with translation of foreign languages, via the CLI #357

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions