Beast code in Giters

liangzimei's starred repositories

TaskMatrix

Language:PythonNOASSERTION34456 309 348

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.028785 336 266

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause24867 219 438

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.022504 184 3488

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonApache-2.017250 181 724

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.016101 151 1254

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause8722 93 601

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookNOASSERTION8427 75 433

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language:Jupyter NotebookMIT7450 93 145

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonApache-2.07116 104 284

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookApache-2.05480 50 133

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.04990 63 362

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.04987 32 266

Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

4229 120 24

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLApache-2.03963 45 31

stablediffusion-infinity

Outpainting with Stable Diffusion on an infinite canvas

Language:PythonApache-2.03801 40 165

mPLUG-Owl

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Language:PythonMIT1930 27 202

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonApache-2.01869 31 225

CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Language:PythonMIT1470 15 75

Multimodal-GPT

Language:PythonApache-2.01401 12 15

unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Language:PythonAGPL-3.01247 17 32

DDNM

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

Language:Python1021 27 67

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Language:Python967 14 36

VideoX

VideoX: a collection of video cross-modal models

Language:PythonNOASSERTION929 22 109

MAT

MAT: Mask-Aware Transformer for Large Hole Image Inpainting

Language:PythonNOASSERTION680 10 112

X-LLM

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

Language:PythonApache-2.0273 10 15

Youku-mPLUG

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

Language:PythonApache-2.0256 5 26

ICLRec

Language:PythonBSD-3-Clause75 4 8

protoclip

📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)

Language:PythonNOASSERTION41 80

MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Language:Python2500