Skip to content

Using fine tuned whisper model with whisper-timestamped #69

Answered by Jeronymous
blueskyleaf asked this question in Q&A
Discussion options

You must be logged in to vote

Oh, that's a really relevant suggestion: adding support for models finetuned by HuggingFace's transformers or SpeechBrain.

I've just pushed something, adding things in whisper_timestamped.load_model.

Now you should be able to just do:

import whisper_timestamped as whisper

audio = whisper.load_audio("audio.mp3")
model = whisper.load_model("NbAiLab/whisper-large-v2-nob", device="cuda")
result = whisper.transcribe(model, audio)

Note: for now, the load_model will work with HuggingFace repo and local folders as soon as they include the model size in their names ("tiny", "small", ..., "large-v2").

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@blueskyleaf
Comment options

@Jeronymous
Comment options

@blueskyleaf
Comment options

Answer selected by Jeronymous
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants