Zhifeng Gu's starred repositories

taichi

Productive, portable, and performant GPU programming in Python.

Language:C++License:Apache-2.0Stargazers:25427Issues:389Issues:2651

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14884Issues:113Issues:386

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:10443Issues:68Issues:105

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:8256Issues:99Issues:89

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:6839Issues:49Issues:211

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:5709Issues:78Issues:142

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4726Issues:44Issues:123

Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Language:PythonLicense:Apache-2.0Stargazers:1867Issues:21Issues:103

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1787Issues:11Issues:143

MetaTransformer

Meta-Transformer for Unified Multimodal Learning

Language:PythonLicense:Apache-2.0Stargazers:1508Issues:21Issues:67

3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Language:PythonLicense:MITStargazers:921Issues:16Issues:62

UniRepLKNet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Language:PythonLicense:Apache-2.0Stargazers:905Issues:12Issues:19
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:798Issues:18Issues:73

PoseDiffusion

[ICCV 2023] PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment

Language:PythonLicense:NOASSERTIONStargazers:700Issues:23Issues:35

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonLicense:NOASSERTIONStargazers:568Issues:11Issues:24

kmeans_pytorch

kmeans using PyTorch

Language:Jupyter NotebookLicense:MITStargazers:472Issues:7Issues:37

ENeRF

SIGGRAPH Asia 2022: Code for "Efficient Neural Radiance Fields for Interactive Free-viewpoint Video"

Language:PythonLicense:NOASSERTIONStargazers:414Issues:22Issues:54

garfield

[CVPR'24] Group Anything with Radiance Fields

Language:PythonLicense:MITStargazers:374Issues:8Issues:30

feature-3dgs

[CVPR 2024 Highlight] Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields

Language:C++License:NOASSERTIONStargazers:326Issues:8Issues:42

PAC-NeRF

Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification

Language:PythonLicense:MITStargazers:257Issues:4Issues:9

GPTEval3D

[ CVPR 2024 ] Implementation for "GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation"

SceneVerse

Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"

Language:PythonLicense:MITStargazers:176Issues:11Issues:24

MultiPLY

Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World

M2PT

[CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Language:PythonLicense:Apache-2.0Stargazers:89Issues:8Issues:2

MaskClustering

[CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation

3D-CLR-Official

[CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"

Language:PythonStargazers:73Issues:0Issues:0
Language:PythonLicense:MITStargazers:50Issues:3Issues:4

GCPose

[ICCV 2023] Learning Symmetry-Aware Geometry Correspondences for 6D Object Pose Estimation