lxtGH

Xiangtai Li's starred repositories

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonApache-2.028578 324 5217

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION21552 172 162

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT20236 200 109

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonApache-2.010189 123 196

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonApache-2.04076 55 155

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT3617 112 62

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonApache-2.03037 25 120

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonApache-2.02959 30 372

nerfies.github.io

Language:JavaScript1957 35 5

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonMIT1220 23 30

OMG-Seg

[CVPR-2024] One Model For Image/Video/Instractive/Open-Vocabulary Segmentation

Language:PythonNOASSERTION796 18 8

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookNOASSERTION713 16 20

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

705 21 10

Entity

EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation

Language:Jupyter NotebookNOASSERTION674 22 43

ovsam

[arXiv preprint] The official code of paper "Open-Vocabulary SAM".

Language:PythonNOASSERTION604 13 22

Awesome-Segmentation-With-Transformer

[Arxiv-04-2023] Transformer-Based Visual Segmentation: A Survey

595 10 5

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Language:PythonNOASSERTION440 20 15

Mira

Language:PythonGPL-3.0276 18 6

RAP-SAM

Language:PythonMIT196 10 6

mmdit

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Language:PythonMIT154 30

CLIPSelf

[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

Language:PythonNOASSERTION144 6 21

Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

Language:HTML129 4 1

ADer

ADer is an open source visual anomaly detection toolbox based on PyTorch, which supports multiple popular AD datasets and approaches.

Language:Python76 5 10

MambaAD

Official implementation of MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection.

Language:Python61 6 6

betrayed-by-captions

(ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation

Language:Jupyter Notebook43 6 8

PointCloudMamba

Point Cloud Mamba: Point Cloud Learning via State Space Model

Language:Python4300

ml-rpm-bench

Language:PythonNOASSERTION29 90

VG4D

Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition（ICRA 2024）

1000

DAQ-VS

Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

700

BA-SAM

Official code for BA-SAM:Scalable Bias-Mode Attention Mask for Segment Anything Model

Language:Python7 2 3