jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Home Page:https://jaywalnut310.github.io/vits-demo/index.html

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

VITS paper ?

rishikksh20 opened this issue · comments

@jaywalnut310 I am unable to find the paper on which this repo based on.

@rishikksh20 Thanks for you interest! The paper will be uploaded on arxiv after a couple of days (if no problem arises). When it's uploaded, I'll update README and make a link to it. Please wait until then :).

Now the paper is available: https://arxiv.org/abs/2106.06103

@jaywalnut310 thanks. I set-up collab for LJ-Speech : https://colab.research.google.com/drive/1aNMn2PHDzhQ2nFU5RoWsPSjesDeedm5B?usp=sharing

@rishikksh20 Thans for your amazing work! Is it okay to introduce your collab in README?
It would be much grateful, if your sentence of the last line is like "We propose VITS, a Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech."

Yeah Sure, I have update the collab notebook.

@rishikksh20 Based on your work, I made a new notebook including multi speaker examples. Thanks again for your work, and I referred you in README!