Text-to-Audio / Make-An-Audio-2

PyTorch Implementation of Make-An-Audio-2

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation

PyTorch Implementation of Make-An-Audio 2

arXiv Hugging Face GitHub Stars

We will provide our implementation and pretrained models as open source in this repository recently.

Visit our demo page for audio samples.

Citations

If you find this code useful in your research, please consider citing:

@misc{huang2023makeanaudio,
      title={Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation}, 
      author={Jiawei Huang and Yi Ren and Rongjie Huang and Dongchao Yang and Zhenhui Ye and Chen Zhang and Jinglin Liu and Xiang Yin and Zejun Ma and Zhou Zhao},
      year={2023},
      eprint={2305.18474},
      archivePrefix={arXiv},
      primaryClass={cs.SD}
}

Disclaimer

Any organization or individual is prohibited from using any technology mentioned in this paper to generate someone's speech without his/her consent, including but not limited to government leaders, political figures, and celebrities. If you do not comply with this item, you could be in violation of copyright laws.

About

PyTorch Implementation of Make-An-Audio-2