Hugging Face pretrained models integration

Question

Hugging Face pretrained models integration

mkatic007 opened this issue 4 months ago · comments

Could you please explain how to add a Hugging Face pretrained model to work with your solution?

Abdeladim S. · Answer 1 · Thu Apr 04 2024 03:08:50 GMT+0800 (China Standard Time)

@mkatic007, I've added the hugging face implementation to the supported models.
You can use any pretrained model from the hub as long as it is compatible with the Automatic Speech Recognition task.
Please give it a try and let me know if you find any issues.

mkatic007 · Answer 2 · Mon Apr 08 2024 05:39:10 GMT+0800 (China Standard Time)

Thank you! I tried with: subsai D:/TranSource/03.mp3 --model japanese-asr/distil-whisper-large-v3-ja-reazonspeech-large --model-configs "{"model_type": "large-v3"}" --format srt -tm mbart50 -tsl japanese -ttl english
But it gives the error: return AVAILABLE_MODELS[model_name]'class' KeyError: 'japanese-asr/distil-whisper-large-v3-ja-reazonspeech-large'
I did not download the model from HF, so I am not sure if I am missing any steps :)
Please be so kind as to instruct me on what to do.

Abdeladim S. · Answer 3 · Tue Apr 09 2024 11:55:00 GMT+0800 (China Standard Time)

The command should look like:

subsai D:/TranSource/03.mp3 --model HuggingFaceModel  --model-configs "{"model_id": "japanese-asr/distil-whisper-large-v3-ja-reazonspeech-large"}" --format srt -tm mbart50 -tsl japanese -ttl english

mkatic007 · Answer 4 · Wed Apr 17 2024 05:20:59 GMT+0800 (China Standard Time)

Thank you, I tried but now I am getting this error:
"json.decoder.JSONDecodeError: Expecting property name enclosed in double quotes: line 1 column 2 (char 1)".