huggingface / distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How can I assign parameters for another language transcribe task?

liuamo opened this issue · comments

user this methods can not resolved problem.
model_large = hf_hub_download(model_id, filename="original-model.bin") model = load_model(model_large) result = model.transcribe("./audio.mp3", language='Chinese')

Hey @liuamo - as mentioned on the README, the 'official' checkpoints are for English-only: https://github.com/huggingface/distil-whisper#distil-whisper

But you can quite easily distil a Whisper model of your own for Mandarin using the provided training code: https://github.com/huggingface/distil-whisper/tree/main/training

Thanks for your reply!

Hey @liuamo - as mentioned on the README, the 'official' checkpoints are for English-only: https://github.com/huggingface/distil-whisper#distil-whisper

But you can quite easily distil a Whisper model of your own for Mandarin using the provided training code: https://github.com/huggingface/distil-whisper/tree/main/training