Whisper.jl

Port of OpenAI's whisper model.

Installation

Clone the repository and instantiate it.

Usage

Specify GPU backend in LocalPreferences.toml file (either AMDGPU or CUDA) if using GPU for inference.
Run the model:

julia> using AMDGPU # If using AMDGPU for inference.
julia> using CUDA   # If using CUDA for inference.
 
julia> using Whisper, Flux

# GPU inference at FP16 precision.
julia> Whisper.transcribe(
    "./input.flac", "./output.srt";
    model_name="tiny.en", dev=gpu, precision=f16)

# CPU inference.
julia> Whisper.transcribe(
    "./input.flac", "./output.srt";
    model_name="tiny.en", dev=cpu, precision=f32)

Multilingual support

To perform transcribtion from non-English language, specify language argument (optional) and drop .en from the model name.

julia> Whisper.transcribe(
    "ukrainian-sample.flac", "./output.srt";
    model_name="medium", language="ukrainian", dev=cpu, precision=f32)

To see what languages are supported, execute:

julia> values(Whisper.LANGUAGES)

Details

Supported input file: .flac with 1 channel and 16k sample rate.
Other input files are converted to it using ffmpeg which must be installed on your system and accessible from PATH.

TODO

Beam search decoder.
Streaming support.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
samples		samples
src		src
.gitignore		.gitignore
LICENSE.md		LICENSE.md
LocalPreferences.toml		LocalPreferences.toml
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Whisper.jl

Installation

Usage

Details

TODO

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

JuliaNeuralGraphics/Whisper.jl

Folders and files

Latest commit

History

Repository files navigation

Whisper.jl

Installation

Usage

Details

TODO

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages