Zineng Tang's repositories
Perceiver_VL
PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)
ContinuousFlowNLG
Pytorch version of Continuous Language Generative Flow (ACL 2021)
zinengtang.github.io
Personal Website
audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
CS184_FinalProject
Computational Design of High-level Interlocking Puzzles (Siggraph 2022 Journal Track Paper)
make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
youtube-dl
Command-line program to download videos from YouTube.com and other video sites