Training Model (Semantics)
bastiansurya77 opened this issue · comments
bastiansurya77 commented
I had the data of multiple ~8 seconds audio clips (.wav). If I understand it correctly, do I need to generate the semantics output, fine output and course output to able to train it using my own dataset? and is it able to generate a natural synthetis audio by training it using my own datasets?
DagsHub commented