Xiangtai  Li (lxtGH)

lxtGH

Geek Repo

Company:Bytedance

Location:Singapore

Home Page:https://lxtgh.github.io/

Twitter:@xtl994

Github PK Tool:Github PK Tool

Xiangtai Li's starred repositories

Language:PythonLicense:MITStargazers:51Issues:0Issues:0

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Stargazers:2649Issues:0Issues:0

kmax-deeplab

a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.

Language:PythonLicense:Apache-2.0Stargazers:64Issues:0Issues:0

Prompt-Diffusion

Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"

Language:PythonLicense:Apache-2.0Stargazers:358Issues:0Issues:0

GPT4RoI

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Language:PythonLicense:NOASSERTIONStargazers:471Issues:0Issues:0

StableSR

Exploiting Diffusion Prior for Real-World Image Super-Resolution

Language:PythonLicense:NOASSERTIONStargazers:1918Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17431Issues:0Issues:0

InternLM

Official release of InternLM2 7B and 20B base and chat models. 200K context support

Language:PythonLicense:Apache-2.0Stargazers:5439Issues:0Issues:0

CrossKD

CrossKD: Cross-Head Knowledge Distillation for Dense Object Detection

Language:PythonLicense:NOASSERTIONStargazers:116Issues:0Issues:0

VoxFormer

Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]

Language:PythonLicense:NOASSERTIONStargazers:982Issues:0Issues:0

StreamPETR

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Language:PythonLicense:NOASSERTIONStargazers:500Issues:0Issues:0

UniAD

[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:2999Issues:0Issues:0

Point-In-Context

[NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding

Language:PythonStargazers:60Issues:0Issues:0

OmniObject3D

[ CVPR 2023 Award Candidate ] OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation

Language:PythonStargazers:425Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:372Issues:0Issues:0

hiera

Hiera: A fast, powerful, and simple hierarchical vision transformer.

Language:PythonLicense:Apache-2.0Stargazers:708Issues:0Issues:0

Voyager

An Open-Ended Embodied Agent with Large Language Models

Language:JavaScriptLicense:MITStargazers:5282Issues:0Issues:0

ContextDET

Contextual Object Detection with Multimodal Large Language Models

License:NOASSERTIONStargazers:168Issues:0Issues:0

InternVideo

Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1050Issues:0Issues:0

InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language:PythonLicense:Apache-2.0Stargazers:3155Issues:0Issues:0

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:35292Issues:0Issues:0

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:5582Issues:0Issues:0

learning_research

本人的科研经验

Stargazers:4587Issues:0Issues:0

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:7997Issues:0Issues:0

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3522Issues:0Issues:0

SegmentAnyRGBD

Segment Any RGBD

Language:PythonLicense:NOASSERTIONStargazers:736Issues:0Issues:0

Multimodal-GPT

Multimodal-GPT

Language:PythonLicense:Apache-2.0Stargazers:1432Issues:0Issues:0

eqlv2

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection. https://arxiv.org/abs/2012.08548 https://arxiv.org/abs/2003.05176

Language:PythonLicense:Apache-2.0Stargazers:153Issues:0Issues:0

BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Language:PythonLicense:Apache-2.0Stargazers:6353Issues:0Issues:0