-
Notifications
You must be signed in to change notification settings - Fork 473
Open
Description
I'm a bit confused about these options:
--task <task> Task to perform (transcribe or translate) (default: transcribe)
--language <language> Language spoken in the audio
I want to essentially generate two subtitles:
- A srt file that is only english dialogue is transcribed, without any
(speaking in french)etc subtitles, for when there are hardcoded translations on the screen - A srt file that is only english, with any foreign language (if any) translated to english
Currently, with:
whisperkit-cli transcribe --verbose --task transcribe --report --skip-special-tokens --chunking-strategy vad --language en --model medium.en
I'm getting .srt files for french documentaries that are mute for like a minute, then some (speaking in french) and then several minutes in starts becoming english translations.
Is it something like where I need to use the large or turbo model, omit --task and --language, but do --prompt like:
Transcribe english dialogue only, ignore all other dialogue and sounds.Transcribe all dialogue into english; including translating all non-english dialogue into english. Do not transcribe sounds, only transcribe and translate speech.
Would that work?
Metadata
Metadata
Assignees
Labels
No labels