Xiangtai  Li (lxtGH)

lxtGH

Geek Repo

Company:Bytedance

Location:Singapore

Home Page:https://lxtgh.github.io/

Twitter:@xtl994

Github PK Tool:Github PK Tool

Xiangtai Li's starred repositories

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:28578Issues:324Issues:5217

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:21552Issues:172Issues:162

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:20236Issues:200Issues:109

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10189Issues:123Issues:196

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4076Issues:55Issues:155

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3617Issues:112Issues:62

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3037Issues:25Issues:120

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:2959Issues:30Issues:372

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonLicense:MITStargazers:1220Issues:23Issues:30

OMG-Seg

[CVPR-2024] One Model For Image/Video/Instractive/Open-Vocabulary Segmentation

Language:PythonLicense:NOASSERTIONStargazers:796Issues:18Issues:8

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:713Issues:16Issues:20

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

Entity

EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:674Issues:22Issues:43

ovsam

[arXiv preprint] The official code of paper "Open-Vocabulary SAM".

Language:PythonLicense:NOASSERTIONStargazers:604Issues:13Issues:22

Awesome-Segmentation-With-Transformer

[Arxiv-04-2023] Transformer-Based Visual Segmentation: A Survey

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Language:PythonLicense:NOASSERTIONStargazers:440Issues:20Issues:15
Language:PythonLicense:GPL-3.0Stargazers:276Issues:18Issues:6
Language:PythonLicense:MITStargazers:196Issues:10Issues:6

mmdit

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Language:PythonLicense:MITStargazers:154Issues:3Issues:0

CLIPSelf

[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

Language:PythonLicense:NOASSERTIONStargazers:144Issues:6Issues:21

Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

ADer

ADer is an open source visual anomaly detection toolbox based on PyTorch, which supports multiple popular AD datasets and approaches.

MambaAD

Official implementation of MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection.

betrayed-by-captions

(ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation

Language:Jupyter NotebookStargazers:43Issues:6Issues:8

PointCloudMamba

Point Cloud Mamba: Point Cloud Learning via State Space Model

Language:PythonStargazers:43Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:29Issues:9Issues:0

VG4D

Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)

Stargazers:10Issues:0Issues:0

DAQ-VS

Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

Stargazers:7Issues:0Issues:0

BA-SAM

Official code for BA-SAM:Scalable Bias-Mode Attention Mask for Segment Anything Model