ZrrSkywalker

Renrui Zhang's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.044612 294 640

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

9579 225 92

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookMIT8018 124 409

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION7952 99 83

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.05832 68 268

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.05550 78 141

DragGAN

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" （DragGAN 全功能实现，在线Demo，本地部署试用，代码、模型已全部开源，支持Windows, macOS, Linux）

Language:Python4994 66 112

Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Language:PythonApache-2.04090 55 133

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLApache-2.04017 46 32

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonNOASSERTION2559 36 128

Painter

Painter & SegGPT Series: Vision Foundation Models from BAAI

Language:PythonMIT2440 36 65

Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Language:PythonMIT1445 27 44

prolificdreamer

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)

Language:PythonApache-2.01396 114 21

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonApache-2.0639 10 24

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonNOASSERTION468 11 17

PointLLM

[arXiv 2023] PointLLM: Empowering Large Language Models to Understand Point Clouds

Language:Python396 10 25

Point-Bind_Point-LLM

Align 3D Point Cloud with Multi-modalities for Large Language Models

Language:PythonMIT371 14 12

MonoDETR

[ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer

Language:Python318 10 62

JourneyDB

132 6 10

APE

[ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"

Language:Jupyter Notebook125 9 12

lightning-GPT

Train and run GPTs with Lightning

Language:PythonApache-2.091 14 3

IAE

[ICCV 2023] "Implicit Autoencoder for Point-Cloud Self-Supervised Representation Learning"

Language:Python89 11 19

MUTR

[AAAI 2024] Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation

Language:PythonMIT56 3 3

ViewRefer3D

Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance' (ICCV2023)

Language:C++51 9 7

MV-JAR

[CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training

47 7 4

Point-PEFT

Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models(AAAI2024)

Language:Python36 2 1

TFS3D

[CVPR2024] Less is more: Towards efficient few-shot 3D semantic segmentation via training-free networks

Language:Python27 1 3

SSD-MonoDETR

Language:Python16 3 4

DS-Net

Language:PythonApache-2.0600

MonoDETR_paddle

Language:PythonApache-2.04 1 1