Plachtaa / FAcodec

Training code for FAcodec presented in NaturalSpeech3

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Audio format in dataset files

r666ay opened this issue · comments

commented

Thanks for you great work on implementing FACodec!
I found the data file in https://github.com/Plachtaa/FAcodec/blob/master/data/val.txt has some labels, like speaker id, phonemes. How can I get these labels? Will these labels be auto-generated in the training process?

It was from VCTK dataset for legacy implementation. For the current version in this repo, annotation is not required. Auto-generated labels will not be saved during training process

commented

It was from VCTK dataset for legacy implementation. For the current version in this repo, annotation is not required. Auto-generated labels will not be saved during training process

Thanks for your reply. What models are used to generate these annotations? I want to export the auto-generated labels.