Beast code in Giters

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.05826 68 268

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonMIT3497 47 168

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonMIT3466 100 159

textual_inversion

Language:Jupyter NotebookMIT2811 53 156

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookApache-2.02697 49 87

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonApache-2.01511 21 83

composer

Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"

MIT1491 176 8

dreambooth

CC-BY-4.0762 12 4

phenaki-pytorch

Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch

Language:PythonMIT720 39 30

bubogpt

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

Language:PythonBSD-3-Clause477 10 19

X-VLM

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

Language:PythonBSD-3-Clause434 5 33

gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Language:Jupyter NotebookApache-2.0384 14 39

CM3Leon

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Language:PythonMIT328 21 15

GRiT

GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)

Language:PythonMIT277 2 17

Subject-Diffusion

Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning

Language:PythonMIT257 8 10

MagicBrush

[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

Language:PythonNOASSERTION250 6 10

instruction-tuned-sd

Code for instruction-tuning Stable Diffusion.

Language:PythonApache-2.0181 4 15

T2I-CompBench

[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Language:PythonMIT144 2 16

distribution_augmentation

Code for the paper, "Distribution Augmentation for Generative Modeling", ICML 2020.

Language:PythonMIT119 10 2

HIVE

Language:PythonApache-2.072 4 11

punctuator

A small seq2seq punctuator tool based on DistilBERT

Language:PythonApache-2.047 3 1

pytorch_tvc

A PyTorch implementation of TVC

Language:Jupyter NotebookNOASSERTION20 5 3