Xiaodong Wang (Wang-Xiaodong1899)

Wang-Xiaodong1899

Geek Repo

Company:Peking University

Home Page:https://wang-xiaodong1899.github.io/

Github PK Tool:Github PK Tool

Xiaodong Wang's starred repositories

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:35567Issues:1003Issues:186

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29128Issues:341Issues:267

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23279Issues:249Issues:277

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonLicense:Apache-2.0Stargazers:14699Issues:112Issues:155

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:13860Issues:159Issues:169

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11085Issues:162Issues:217

al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:HTMLLicense:MITStargazers:9905Issues:23Issues:527

cupy

NumPy & SciPy for GPU

Language:PythonLicense:MITStargazers:7973Issues:128Issues:2210

llama-recipes

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7850Issues:68Issues:227

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4301Issues:48Issues:395

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3442Issues:30Issues:250

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:2871Issues:37Issues:188

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2589Issues:31Issues:152

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:2112Issues:23Issues:158

gigagan-pytorch

Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs

Language:PythonLicense:MITStargazers:1705Issues:72Issues:49

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonLicense:Apache-2.0Stargazers:1582Issues:42Issues:20

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1568Issues:21Issues:85

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:902Issues:71Issues:22

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonLicense:MITStargazers:884Issues:9Issues:17

SEED

Official implementation of SEED-LLaMA (ICLR 2024).

Language:PythonLicense:NOASSERTIONStargazers:518Issues:14Issues:43

ReVersion

ReVersion: Diffusion-Based Relation Inversion from Images

Language:PythonLicense:NOASSERTIONStargazers:434Issues:20Issues:7
Language:JavaScriptStargazers:397Issues:26Issues:0

robotic-transformer-pytorch

Implementation of RT1 (Robotic Transformer) in Pytorch

Language:PythonLicense:MITStargazers:358Issues:10Issues:6

Instruct2Act

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

Visual-LLaMA

Open LLaMA Eyes to See the World

SceneScape

Official Pytorch Implementation for "SceneScape: Text-Driven Consistent Scene Generation"

NUWA-LIP

NÜWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN

ORES

ORES: Open-vocabulary Responsible Visual Synthesis

Language:PythonLicense:MITStargazers:12Issues:2Issues:0
Language:PythonStargazers:2Issues:1Issues:0