Zineng Tang (zinengtang)

zinengtang

Geek Repo

Home Page:zinengtang.github.io/

Twitter:@ZinengTang

Github PK Tool:Github PK Tool

Zineng Tang's repositories

TVLT

PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)

Language:Jupyter NotebookLicense:MITStargazers:118Issues:3Issues:16

VidLanKD

Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))

Perceiver_VL

PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)

Language:PythonLicense:MITStargazers:32Issues:3Issues:3

DeCEMBERT

Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)

ContinuousFlowNLG

Pytorch version of Continuous Language Generative Flow (ACL 2021)

Language:Jupyter NotebookLicense:MITStargazers:3Issues:1Issues:0

zinengtang.github.io

Personal Website

Language:HTMLStargazers:3Issues:1Issues:0

audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:0Issues:0

AudioCLIP

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

CS184_FinalProject

Computational Design of High-level Interlocking Puzzles (Siggraph 2022 Journal Track Paper)

Stargazers:0Issues:0Issues:0

make-a-video-pytorch

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ViLT

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

License:Apache-2.0Stargazers:0Issues:0Issues:0

youtube-dl

Command-line program to download videos from YouTube.com and other video sites

Language:PythonLicense:UnlicenseStargazers:0Issues:0Issues:0