yxchng

followers

following

stars

yxchng's starred repositories

DenseSSM

A repository for DenseSSMs

Language:Python8300

LSK3DNet

This is the official implementation of "LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels" (Accepted at CVPR 2024).

MIT2000

Awesome-Robotics-Foundation-Models

MIT74000

LWM

Language:PythonApache-2.0699300

DenseFormer

Language:PythonApache-2.07000

scaling_on_scales

When do we not need larger vision models?

Language:PythonMIT25300

oneformer3d

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Language:PythonNOASSERTION25300

superpoint_transformer

Official PyTorch implementation of Superpoint Transformer introduced in [ICCV'23] "Efficient 3D Semantic Segmentation with Superpoint Transformer" and SuperCluster introduced in [3DV'24 Oral] "Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering"

Language:PythonMIT49300

EMO

[ICCV 2023] Official PyTorch implementation of "Rethinking Mobile Block for Efficient Attention-based Models"

Language:Jupyter Notebook21900

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookMIT128700

DiT-3D

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Language:PythonApache-2.019400

rnn-icrag

Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"

Language:Python2300

SgMg

[ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.

Language:PythonNOASSERTION7600

VD-IT

Language:PythonNOASSERTION2200

CAGroup3D

[NeurIPS2022] This is the official code of "CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds".

Language:Python8800

Swin3D

A shift-window based transformer for 3D sparse tasks

Language:CudaMIT19000

LocalMamba

Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan

Language:PythonApache-2.017000

3detr

Code & Models for 3DETR - an End-to-end transformer model for 3D object detection

Language:PythonApache-2.060500

V-DETR

[ICLR 2024] This is the official code of the paper "V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection"

Language:Python7000

PCM

Point Could Mamba: Point Cloud Learning via State Space Model

6000

PointMamba

PointMamba: A Simple State Space Model for Point Cloud Analysis

Language:PythonApache-2.030500

Awesome-state-space-models

Collection of papers on state-space models

IMProv

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Language:Python5700

Patch-Aligned-Contrastive-Learning

Language:Python1000

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonGPL-3.0391000

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonApache-2.0124300

SLD

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Language:PythonMIT13200

VMamba

VMamba: Visual State Space Models，code is based on mamba

Language:PythonMIT183400

Vision-RWKV

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Language:PythonApache-2.028900

LLaVA-HR

LLaVA-HR: High-Resolution Large Language-Vision Assistant

Language:PythonApache-2.018800