ZrrSkywalker

Renrui Zhang's repositories

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Language:PythonMIT1495 27 45

[CVPR 2023] Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis

Language:PythonMIT477 17 38

[ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer

Language:Python338 11 65

[CVPR 2022] PointCLIP: Point Cloud Understanding by CLIP

Language:Python323 10 18

[CVPR 2023] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders

Language:Python211 17 11

[NeurIPS 2022] Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training

Language:PythonMIT203 11 17

[ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Language:PythonMIT134 7 5

Mathematical Visual Instruction Tuning for Multi-modal Large Language Models

MIT86 5 2

Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

GPL-3.082 4 1

[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

MIT35 30

The multi-view version of MonoDETR on nuScenes dataset

Enhancing Zero-shot CLIP with Cross-Modality Attention

010

Language:Python010

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

Language:Jupyter NotebookCC-BY-SA-4.0010

Language:Python010