Nanyang Wang's starred repositories
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
alpaca-lora
Instruct-tune LLaMA on consumer hardware
dalle-mini
DALL·E Mini - Generate images from a text prompt
open_flamingo
An open-source framework for training large multimodal models.
Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
phenaki-pytorch
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
Subject-Diffusion
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
instruction-tuned-sd
Code for instruction-tuning Stable Diffusion.
T2I-CompBench
[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
distribution_augmentation
Code for the paper, "Distribution Augmentation for Generative Modeling", ICML 2020.
punctuator
A small seq2seq punctuator tool based on DistilBERT
pytorch_tvc
A PyTorch implementation of TVC