lxtGH

Xiangtai Li's starred repositories

PointCloudMamba

Point Cloud Mamba: Point Cloud Learning via State Space Model

Language:Python4400

Awesome-Segmentation-With-Transformer

[Arxiv-04-2023] Transformer-Based Visual Segmentation: A Survey

59700

BA-SAM

Official code for BA-SAM:Scalable Bias-Mode Attention Mask for Segment Anything Model

Language:Python700

genview

Official repository of "GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning"

Language:PythonApache-2.0600

grok-1

Grok open release

Language:PythonApache-2.04896900

RefLDMSeg

800

Video-K-Net

[CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation

Language:PythonMIT15000

remax_deeplab

Language:Python300

bias

MIT9000

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonAGPL-3.0237500

Language-Driven-Video-Inpainting

(CVPR 2024) Official code for paper "Towards Language-Driven Video Inpainting via Multimodal Large Language Models"

Language:Python3800

PCM

Point Could Mamba: Point Cloud Learning via State Space Model

5600

IntrinsicImageDiffusion

Intrinsic Image Diffusion for Single-view Material Estimation

Language:PythonNOASSERTION12300

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonApache-2.01066500

robust-ref-seg

(TIP 2024) Towards Robust Referring Image Segmentation

Language:Python1500

EMO

[ICCV 2023] Official PyTorch implementation of "Rethinking Mobile Block for Efficient Attention-based Models"

Language:Jupyter Notebook21700

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

Apache-2.0184300

Skeleton-in-Context

[CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning

Language:Python2700

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonMIT416200

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonApache-2.0508600

gemma

Open weights LLM from Google DeepMind.

Language:Jupyter NotebookApache-2.0214600

fast-DiT

Fast Diffusion Models with Transformers

Language:PythonNOASSERTION58400

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonNOASSERTION248100

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION539900

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookMIT639800

dst-det

state-of-the-art open vocabulary detector on COCO/LVIS/V3Det

Language:Python2200

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonMIT330000

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

86700

PointNeXt

[NeurIPS'22] PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

Language:ShellMIT71800

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonMIT543700