yxchng's starred repositories

superpoint_transformer

Official PyTorch implementation of Superpoint Transformer introduced in [ICCV'23] "Efficient 3D Semantic Segmentation with Superpoint Transformer" and SuperCluster introduced in [3DV'24 Oral] "Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering"

Language:PythonLicense:MITStargazers:520Issues:0Issues:0

EMO

[ICCV 2023] Official PyTorch implementation of "Rethinking Mobile Block for Efficient Attention-based Models"

Language:Jupyter NotebookStargazers:218Issues:0Issues:0

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookLicense:MITStargazers:1305Issues:0Issues:0

DiT-3D

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Language:PythonLicense:Apache-2.0Stargazers:199Issues:0Issues:0

rnn-icrag

Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"

Language:PythonStargazers:23Issues:0Issues:0

SgMg

[ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.

Language:PythonLicense:NOASSERTIONStargazers:75Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:26Issues:0Issues:0

CAGroup3D

[NeurIPS2022] This is the official code of "CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds".

Language:PythonStargazers:88Issues:0Issues:0

Swin3D

A shift-window based transformer for 3D sparse tasks

Language:CudaLicense:MITStargazers:192Issues:0Issues:0

LocalMamba

Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan

Language:PythonLicense:Apache-2.0Stargazers:178Issues:0Issues:0

3detr

Code & Models for 3DETR - an End-to-end transformer model for 3D object detection

Language:PythonLicense:Apache-2.0Stargazers:606Issues:0Issues:0

V-DETR

[ICLR 2024] This is the official code of the paper "V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection"

Language:PythonStargazers:74Issues:0Issues:0

PCM

Point Could Mamba: Point Cloud Learning via State Space Model

Stargazers:61Issues:0Issues:0

PointMamba

PointMamba: A Simple State Space Model for Point Cloud Analysis

Language:PythonLicense:Apache-2.0Stargazers:314Issues:0Issues:0

Awesome-state-space-models

Collection of papers on state-space models

Stargazers:487Issues:0Issues:0

IMProv

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Language:PythonStargazers:57Issues:0Issues:0
Language:PythonStargazers:9Issues:0Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:4048Issues:0Issues:0

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1280Issues:0Issues:0

SLD

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Language:PythonLicense:MITStargazers:137Issues:0Issues:0

VMamba

VMamba: Visual State Space Models,code is based on mamba

Language:PythonLicense:MITStargazers:1894Issues:0Issues:0

Vision-RWKV

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Language:PythonLicense:Apache-2.0Stargazers:302Issues:0Issues:0

LLaVA-HR

LLaVA-HR: High-Resolution Large Language-Vision Assistant

Language:PythonLicense:Apache-2.0Stargazers:192Issues:0Issues:0

DCNv4

[CVPR 2024] Deformable Convolution v4

Language:PythonLicense:MITStargazers:429Issues:0Issues:0
Language:PythonLicense:MITStargazers:307Issues:0Issues:0

VisionLLaMA

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Language:PythonStargazers:341Issues:0Issues:0

MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Language:PythonLicense:Apache-2.0Stargazers:901Issues:0Issues:0

DVIS

DVIS: Decoupled Video Instance Segmentation Framework

Language:PythonLicense:MITStargazers:120Issues:0Issues:0

TOAST

Official code for "TOAST: Transfer Learning via Attention Steering"

Language:PythonStargazers:185Issues:0Issues:0

ovsam

[arXiv preprint] The official code of paper "Open-Vocabulary SAM".

Language:PythonLicense:NOASSERTIONStargazers:742Issues:0Issues:0