declare-lab / tango

A family of diffusion models for text-to-audio generation.

Home Page:https://tango2-web.github.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Producing audio in different Sample Rate

cvillela opened this issue · comments

Hey!

I was wondering if it was possible to train the model in 48kHz audio, and then generate audio directly in 48kHz. Has anyone attempted this?

That is definitely possible and would be really great to have! We could not try this due to computational constraints.

Awesome! Will try it out. How much VRAM do you think is necessary for attempting it?

@deepanwayx Also, I see that the "Tango Prompt Bank" is all in 16.000Hz. Would you guys have the raw dataset, not resampled, available?