Text to Speech MLX model.

Question

Text to Speech MLX model.

javileyes opened this issue 5 months ago · comments

Javier Giménez Moya commented 5 months ago

There are MLX models for text generation (llama 3) and for text recognition (whisper) but I think that to have a complete NLP environment it would be necessary to create a text to scpeech MLX. How would it be possible to create, for example, an MLX model of facebook/fastspeech2-en-ljspeech?

Awni Hannun · Answer 1 · Thu May 09 2024 21:10:46 GMT+0800 (China Standard Time)

It should be possible. There is a port of Suno's Bark model already: https://github.com/j-csc/mlx_bark

I think it still depends on PyTorch for the encodec model though.