declare-lab / tango

A family of diffusion models for text-to-audio generation.

Home Page:https://tango2-web.github.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Noisy audio samples

schowdhury671 opened this issue · comments

Hi,

I am trying to reproduce the results and have run the inference code. However, the generated audio samples are completely noisy. Any suggestions as to what might be going wrong here? I am sharing my inference.py script and the command I have used to run the code

CUDA_VISIBLE_DEVICES=0 python inference.py --test_file="../audiocaps/test_audiocaps_subset.json" --text_encoder_name="google/flan-t5-large" --scheduler_name="configs/stable_diffusion_2.1.json" --unet_model_config="configs/diffusion_model_config.json" --model="../audioldm-s-full.ckpt" --batch_size=6

tango_files.zip