Keith-Hon / vits-cantonese

Cantonese Text to Speech with VITS implementation

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Demo outputs:

  1. 我愛你
audio.mp4

Instructions

  1. Create a virtual environment python -m venv ./venv

  2. open demo.ipynb and install the dependencies

  3. move files to opencc lib after installing opencc

    1. mv ./opencc/jyutjyu.json ./venv/lib/site-packages/opencc/clib/share/opencc/jyutjyu.json
    2. mv ./opencc/jyutjyu.ocd2 ./venv/lib/site-packages/opencc/clib/share/opencc/jyutjyu.ocd2
  4. build cpython codes cd vits/monotonic_align python setup.py build_ext --inplace

  5. download models

    1. https://huggingface.co/xiaomaiiwn/vits-cantonese/blob/main/model/G.pth

Acknowledgement

I do not own the models and you should follow the license of the models.

About

Cantonese Text to Speech with VITS implementation

License:MIT License


Languages

Language:Python 63.2%Language:Jupyter Notebook 34.3%Language:C++ 1.8%Language:Cython 0.5%Language:Dockerfile 0.2%