Zanlin Ni's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46788Issues:306Issues:662

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:24133Issues:255Issues:302

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:19383Issues:159Issues:1487

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15898Issues:106Issues:1028

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:15843Issues:200Issues:76

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:PythonLicense:NOASSERTIONStargazers:7648Issues:84Issues:100

mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Language:PythonLicense:Apache-2.0Stargazers:3752Issues:55Issues:52

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3660Issues:47Issues:175

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2576Issues:36Issues:100

Painter

Painter & SegGPT Series: Vision Foundation Models from BAAI

Language:PythonLicense:MITStargazers:2500Issues:37Issues:68

s4

Structured state space sequence models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2369Issues:52Issues:134

mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Language:PythonLicense:MITStargazers:2240Issues:30Issues:224

Caption-Anything

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

Language:PythonLicense:BSD-3-ClauseStargazers:1661Issues:15Issues:24

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1609Issues:21Issues:86

Dromedary

Dromedary: towards helpful, ethical and reliable LLMs.

Language:PythonLicense:GPL-3.0Stargazers:1109Issues:23Issues:12

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Agent-Attention

Official repository of Agent Attention (ECCV2024)

FLatten-Transformer

Official repository of FLatten Transformer (ICCV2023)

open-muse

Open reproduction of MUSE for fast text2image generation.

Language:PythonLicense:Apache-2.0Stargazers:322Issues:38Issues:27

Smooth-Diffusion

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024

Language:PythonLicense:MITStargazers:302Issues:21Issues:15

LLaVA-UHD

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

open-diffusion

Simple large-scale training of stable diffusion with multi-node support.

MUSE-Pytorch

An in-context conditioning version of MUSE with pre-trained checkpoints.

Language:PythonLicense:MITStargazers:106Issues:3Issues:5

ARC

[ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection

Language:PythonLicense:Apache-2.0Stargazers:101Issues:3Issues:20

Rank-DETR

[NeurIPS 2023] Rank-DETR for High Quality Object Detection

Language:PythonLicense:Apache-2.0Stargazers:86Issues:2Issues:4

LAUDNet

[IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition

Language:Jupyter NotebookStargazers:40Issues:2Issues:1

Dynamic_Perceiver

Official implementation of Dynamic Perceiver

Language:PythonLicense:MITStargazers:39Issues:2Issues:1

FamO2O

Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)

Language:PythonLicense:MITStargazers:37Issues:1Issues:1

SEEM

Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

Language:PythonStargazers:19Issues:1Issues:0