Xiaodong Wang's repositories
GPT2-BaikeChat
a Baike Chat robot using GPT2
SSHT-plus-plus
SSHT++
3d-photo-inpainting_git
a modify version of 3d-photo-inpainting
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
BLIP
clean code for BLIP
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Emu
Emu: An Open Multimodal Generalist
FateZero
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
Improved-Vista
A Generalizable World Model for Autonomous Driving
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
LLaVA-1.5
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
Open-Sora-Plan
This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
OpenVoice
Instant voice cloning by MyShell.
Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
SceneScape
Official Pytorch Implementation for "SceneScape: Text-Driven Consistent Scene Generation"
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.