yxchng's starred repositories

DenseSSM

A repository for DenseSSMs

Language:PythonStargazers:83Issues:0Issues:0

LSK3DNet

This is the official implementation of "LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels" (Accepted at CVPR 2024).

License:MITStargazers:20Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:6993Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:70Issues:0Issues:0

scaling_on_scales

When do we not need larger vision models?

Language:PythonLicense:MITStargazers:253Issues:0Issues:0

oneformer3d

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Language:PythonLicense:NOASSERTIONStargazers:253Issues:0Issues:0

superpoint_transformer

Official PyTorch implementation of Superpoint Transformer introduced in [ICCV'23] "Efficient 3D Semantic Segmentation with Superpoint Transformer" and SuperCluster introduced in [3DV'24 Oral] "Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering"

Language:PythonLicense:MITStargazers:493Issues:0Issues:0

EMO

[ICCV 2023] Official PyTorch implementation of "Rethinking Mobile Block for Efficient Attention-based Models"

Language:Jupyter NotebookStargazers:219Issues:0Issues:0

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookLicense:MITStargazers:1287Issues:0Issues:0

DiT-3D

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Language:PythonLicense:Apache-2.0Stargazers:194Issues:0Issues:0

rnn-icrag

Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"

Language:PythonStargazers:23Issues:0Issues:0

SgMg

[ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.

Language:PythonLicense:NOASSERTIONStargazers:76Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:22Issues:0Issues:0

CAGroup3D

[NeurIPS2022] This is the official code of "CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds".

Language:PythonStargazers:88Issues:0Issues:0

Swin3D

A shift-window based transformer for 3D sparse tasks

Language:CudaLicense:MITStargazers:190Issues:0Issues:0

LocalMamba

Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan

Language:PythonLicense:Apache-2.0Stargazers:170Issues:0Issues:0

3detr

Code & Models for 3DETR - an End-to-end transformer model for 3D object detection

Language:PythonLicense:Apache-2.0Stargazers:605Issues:0Issues:0

V-DETR

[ICLR 2024] This is the official code of the paper "V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection"

Language:PythonStargazers:70Issues:0Issues:0

PCM

Point Could Mamba: Point Cloud Learning via State Space Model

Stargazers:60Issues:0Issues:0

PointMamba

PointMamba: A Simple State Space Model for Point Cloud Analysis

Language:PythonLicense:Apache-2.0Stargazers:305Issues:0Issues:0

Awesome-state-space-models

Collection of papers on state-space models

Stargazers:474Issues:0Issues:0

IMProv

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Language:PythonStargazers:57Issues:0Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:3910Issues:0Issues:0

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1243Issues:0Issues:0

SLD

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Language:PythonLicense:MITStargazers:132Issues:0Issues:0

VMamba

VMamba: Visual State Space Models,code is based on mamba

Language:PythonLicense:MITStargazers:1834Issues:0Issues:0

Vision-RWKV

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Language:PythonLicense:Apache-2.0Stargazers:289Issues:0Issues:0

LLaVA-HR

LLaVA-HR: High-Resolution Large Language-Vision Assistant

Language:PythonLicense:Apache-2.0Stargazers:188Issues:0Issues:0