zinengtang

Zineng Tang's repositories

PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)

Language:Jupyter NotebookMIT118 3 16

Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))

Language:Python56 4 7

PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)

Language:PythonMIT32 3 3

Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)

Language:Python17 3 3

Pytorch version of Continuous Language Generative Flow (ACL 2021)

Language:Python11 2 1

Language:Jupyter NotebookMIT3 10

Personal Website

Language:HTML3 10

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Language:Jupyter NotebookGPL-3.0000

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Language:PythonMIT000

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonNOASSERTION000

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

MIT000

Computational Design of High-level Interlocking Puzzles (Siggraph 2022 Journal Track Paper)

000

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch

Language:PythonMIT000

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Apache-2.0000

Command-line program to download videos from YouTube.com and other video sites

Language:PythonUnlicense000