Xiaodong Wang's starred repositories
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
generative-models
Generative Models by Stability AI
dalle-mini
DALL·E Mini - Generate images from a text prompt
llama-recipes
Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
robotic-transformer-pytorch
Implementation of RT1 (Robotic Transformer) in Pytorch
Instruct2Act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Visual-LLaMA
Open LLaMA Eyes to See the World
SceneScape
Official Pytorch Implementation for "SceneScape: Text-Driven Consistent Scene Generation"
SSHT-plus-plus
SSHT++