Xiangtai  Li (lxtGH)

lxtGH

Geek Repo

Company:Bytedance

Location:Singapore

Home Page:https://lxtgh.github.io/

Twitter:@xtl994

Github PK Tool:Github PK Tool

Xiangtai Li's starred repositories

PointCloudMamba

Point Cloud Mamba: Point Cloud Learning via State Space Model

Language:PythonStargazers:44Issues:0Issues:0

Awesome-Segmentation-With-Transformer

[Arxiv-04-2023] Transformer-Based Visual Segmentation: A Survey

Stargazers:597Issues:0Issues:0

BA-SAM

Official code for BA-SAM:Scalable Bias-Mode Attention Mask for Segment Anything Model

Language:PythonStargazers:7Issues:0Issues:0

genview

Official repository of "GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning"

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:48969Issues:0Issues:0
Stargazers:8Issues:0Issues:0

Video-K-Net

[CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation

Language:PythonLicense:MITStargazers:150Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0
License:MITStargazers:90Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2375Issues:0Issues:0

Language-Driven-Video-Inpainting

(CVPR 2024) Official code for paper "Towards Language-Driven Video Inpainting via Multimodal Large Language Models"

Language:PythonStargazers:38Issues:0Issues:0

PCM

Point Could Mamba: Point Cloud Learning via State Space Model

Stargazers:56Issues:0Issues:0

IntrinsicImageDiffusion

Intrinsic Image Diffusion for Single-view Material Estimation

Language:PythonLicense:NOASSERTIONStargazers:123Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:Apache-2.0Stargazers:10665Issues:0Issues:0

robust-ref-seg

(TIP 2024) Towards Robust Referring Image Segmentation

Language:PythonStargazers:15Issues:0Issues:0

EMO

[ICCV 2023] Official PyTorch implementation of "Rethinking Mobile Block for Efficient Attention-based Models"

Language:Jupyter NotebookStargazers:217Issues:0Issues:0

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

License:Apache-2.0Stargazers:1843Issues:0Issues:0

Skeleton-in-Context

[CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning

Language:PythonStargazers:27Issues:0Issues:0

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4162Issues:0Issues:0

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5086Issues:0Issues:0

gemma

Open weights LLM from Google DeepMind.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2146Issues:0Issues:0

fast-DiT

Fast Diffusion Models with Transformers

Language:PythonLicense:NOASSERTIONStargazers:584Issues:0Issues:0

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonLicense:NOASSERTIONStargazers:2481Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5399Issues:0Issues:0

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookLicense:MITStargazers:6398Issues:0Issues:0

dst-det

state-of-the-art open vocabulary detector on COCO/LVIS/V3Det

Language:PythonStargazers:22Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:3300Issues:0Issues:0

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

Stargazers:867Issues:0Issues:0

PointNeXt

[NeurIPS'22] PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

Language:ShellLicense:MITStargazers:718Issues:0Issues:0

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonLicense:MITStargazers:5437Issues:0Issues:0