Nanyang Wang (nywang16)

nywang16

Geek Repo

Company:Alibaba Group

Github PK Tool:Github PK Tool

Nanyang Wang's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:34727Issues:347Issues:1672
Language:PythonLicense:NOASSERTIONStargazers:34273Issues:316Issues:339

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:28920Issues:339Issues:266

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18242Issues:156Issues:467

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:16801Issues:153Issues:1305

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonLicense:Apache-2.0Stargazers:14654Issues:111Issues:155

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6392Issues:108Issues:292
Language:PythonLicense:NOASSERTIONStargazers:5999Issues:66Issues:113

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5826Issues:68Issues:268

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3497Issues:47Issues:168

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonLicense:MITStargazers:3466Issues:100Issues:159
Language:Jupyter NotebookLicense:MITStargazers:2811Issues:53Issues:156

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2697Issues:49Issues:87

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1511Issues:21Issues:83

composer

Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"

phenaki-pytorch

Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch

Language:PythonLicense:MITStargazers:720Issues:39Issues:30

bubogpt

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

Language:PythonLicense:BSD-3-ClauseStargazers:477Issues:10Issues:19

X-VLM

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

Language:PythonLicense:BSD-3-ClauseStargazers:434Issues:5Issues:33

gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:384Issues:14Issues:39

CM3Leon

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Language:PythonLicense:MITStargazers:328Issues:21Issues:15

GRiT

GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)

Language:PythonLicense:MITStargazers:277Issues:2Issues:17

Subject-Diffusion

Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning

Language:PythonLicense:MITStargazers:257Issues:8Issues:10

MagicBrush

[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

Language:PythonLicense:NOASSERTIONStargazers:250Issues:6Issues:10

instruction-tuned-sd

Code for instruction-tuning Stable Diffusion.

Language:PythonLicense:Apache-2.0Stargazers:181Issues:4Issues:15

T2I-CompBench

[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Language:PythonLicense:MITStargazers:144Issues:2Issues:16

distribution_augmentation

Code for the paper, "Distribution Augmentation for Generative Modeling", ICML 2020.

Language:PythonLicense:MITStargazers:119Issues:10Issues:2
Language:PythonLicense:Apache-2.0Stargazers:72Issues:4Issues:11

punctuator

A small seq2seq punctuator tool based on DistilBERT

Language:PythonLicense:Apache-2.0Stargazers:47Issues:3Issues:1

pytorch_tvc

A PyTorch implementation of TVC

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:20Issues:5Issues:3