sophiefy / VITS

ACG Text-to-Speech

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Stella TTS Based on VITS

Contents

Update

  • japanese_triphone_cleaners is available.

Introduction

Just have some fun...

The datasets and models are only for research use, NOT commercial!

MoeGoe

You can run my models on MoeGoe developed by CjangCjengh! Please first read the instructions of MoeGoe and then download my models and corresponding configuration files.

Models


Shiki Natsume

triphone

  • Description

To further utilize the contextual information of the text, I wrote a physiscal triphone cleaners for Japanese. For more details about triphone, refer to this material.

Raw text:

さあ、はりきって行きましょう。

Phonemes:

s a a,h a r i k i Q t e i k i m a ʃ o o.

Triphones (physical):

s+a s-a+a a-a,h+a h-a+r a-r+i r-i+k i-k+i k-i+Q i-Q+t Q-t+e t-e+i e-i+k i-k+i k-i+m i-m+a m-a+ʃ a-ʃ+o ʃ-o+o o-o.

The following 2 speakers are supported.

Name ID
四季ナツメ 0
司波深雪 1
  • Model:

  • Configuration file:

  • Demo:


Chipanese Bilingual Model

chipanese

  • Description

I modified the Japanese cleaner and conbined it with a Chinese cleaner into a bilingual one, called "Chipanese cleaner"! The training data are from Genshin, Café Stella and the Reaper's Butterflies and Kami-sama no You na Kimi e. For more information, please refer to this repo: VITS-Bilingual.

MoeGoe is NOT supported currently!

The following 2 speakers are supported.

Name ID
四季ナツメ 0
派蒙 1

Café Stella and the Reaper's Butterflies

cafe stella

  • Description
Name ID
四季ナツメ 0
明月栞那 1
墨染希 2
火打谷愛衣 3
汐山涼音 4

Yosuga No Sora

yosuga no sora

  • Description
Name ID
春日野穹 0
天女目瑛 1
依媛奈緒 2
渚一葉 3
春日野悠 (reserved) 4

Bishojo Mangekyo

mangekyo

  • Description
Name ID
蓮華 0
篝ノ霧枝 1
沢渡雫 2
亜璃子 3
灯露椎 4
覡夕莉 5

Genshin

genshin

  • Description

Both single speaker model of Paimon and multi-speaker model of 46 characters will be supported!


A Certain Scientific Railgun (collecting data...)

railgun

  • Characters
Name ID Name ID
上条当麻 0 削板軍覇 7
一方通行 1 御坂妹 8
垣根帝督 2 最終信号 9
御坂美琴 3 白井黒子 10
麦野沈利 4 佐天涙子 11
食蜂操祈 5 飾利初春 12
正体不明 6 インデックス 13
  • Model:

  • Configuration file:

  • Demo

Contact

QQ: 2235306122

BILIBILI: Francis-Komizu

References

Original code

Reference code

Triphone

About

ACG Text-to-Speech

License:MIT License


Languages

Language:Python 99.0%Language:Cython 1.0%