Renrui Zhang (ZrrSkywalker)

ZrrSkywalker

Geek Repo

Company:CUHK MMLab

Location:Hong Kong

Home Page:https://zrrskywalker.github.io/

Github PK Tool:Github PK Tool

Renrui Zhang's repositories

Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Language:PythonLicense:MITStargazers:1495Issues:27Issues:45

Point-NN

[CVPR 2023] Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis

Language:PythonLicense:MITStargazers:477Issues:17Issues:38

MonoDETR

[ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer

PointCLIP

[CVPR 2022] PointCLIP: Point Cloud Understanding by CLIP

I2P-MAE

[CVPR 2023] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders

Point-M2AE

[NeurIPS 2022] Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training

Language:PythonLicense:MITStargazers:203Issues:11Issues:17

MathVerse

[ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Language:PythonLicense:MITStargazers:134Issues:7Issues:5

MAVIS

Mathematical Visual Instruction Tuning for Multi-modal Large Language Models

LLaMA-Adapter

Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

CaFo

[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

License:MITStargazers:35Issues:3Issues:0

MonoDETR-MV

The multi-view version of MonoDETR on nuScenes dataset

CALIP

Enhancing Zero-shot CLIP with Cross-Modality Attention

Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

MathVista

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

Language:Jupyter NotebookLicense:CC-BY-SA-4.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0