Neal2020GitHub

Zhifeng Gu's starred repositories

taichi

Productive, portable, and performant GPU programming in Python.

Language:C++Apache-2.025427 389 2651

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookApache-2.014884 113 386

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT10443 68 105

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION8256 99 89

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonApache-2.06839 49 211

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.05709 78 142

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookApache-2.04726 44 123

Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Language:PythonApache-2.01867 21 103

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonApache-2.01787 11 143

MetaTransformer

Meta-Transformer for Unified Multimodal Learning

Language:PythonApache-2.01508 21 67

3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Language:PythonMIT921 16 62

UniRepLKNet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Language:PythonApache-2.0905 12 19

Awesome-Robotics-Foundation-Models

MIT859 25 2

open_x_embodiment

Language:Jupyter NotebookApache-2.0798 18 73

PoseDiffusion

[ICCV 2023] PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment

Language:PythonNOASSERTION700 23 35

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonNOASSERTION568 11 24

TRIPS

Language:C++MIT514 60 52

kmeans_pytorch

kmeans using PyTorch

Language:Jupyter NotebookMIT472 7 37

ENeRF

SIGGRAPH Asia 2022: Code for "Efficient Neural Radiance Fields for Interactive Free-viewpoint Video"

Language:PythonNOASSERTION414 22 54

garfield

[CVPR'24] Group Anything with Radiance Fields

Language:PythonMIT374 8 30

feature-3dgs

[CVPR 2024 Highlight] Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields

Language:C++NOASSERTION326 8 42

PAC-NeRF

Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification

Language:PythonMIT257 4 9

GPTEval3D

[ CVPR 2024 ] Implementation for "GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation"

Language:Python221 10 3

SceneVerse

Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"

Language:PythonMIT176 11 24

MultiPLY

Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World

Language:Python116 11 5

M2PT

[CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Language:PythonApache-2.089 8 2

MaskClustering

[CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation

Language:Python75 5 6

3D-CLR-Official

[CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"

Language:Python7300

wildrgbd

Language:PythonMIT50 3 4

GCPose

[ICCV 2023] Learning Symmetry-Aware Geometry Correspondences for 6D Object Pose Estimation

11 7 1