ChaimZhu (ZCMax)

ZCMax

Geek Repo

Company:HKU IDS | HKU-MMLab

Location:Hong Kong SAR

Github PK Tool:Github PK Tool

ChaimZhu's starred repositories

Awesome-3D-Vision-and-Language

A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.

License:MITStargazers:95Issues:0Issues:0

P3Former

[IJCV 2024] P3Former: Position-Guided Point Cloud Panoptic Segmentation Transformer

Language:PythonStargazers:73Issues:0Issues:0

Refer-it-in-RGBD

Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

3D-VisTA

Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"

Language:PythonLicense:MITStargazers:176Issues:0Issues:0

NeRF-Det

[ICCV 2023] Code for NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

Language:PythonLicense:NOASSERTIONStargazers:270Issues:0Issues:0

gradslam

gradslam is an open source differentiable dense SLAM library for PyTorch

Language:PythonLicense:MITStargazers:1298Issues:0Issues:0

3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Language:PythonLicense:MITStargazers:862Issues:0Issues:0

concept-fusion

Code release for ConceptFusion [RSS 2023]

License:MITStargazers:161Issues:0Issues:0

3D-CLR-Official

[CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"

Language:PythonStargazers:71Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11209Issues:0Issues:0

InternLM

Official release of InternLM2.5 7B base and chat models. 1M context support

Language:PythonLicense:Apache-2.0Stargazers:5853Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3334Issues:0Issues:0

Segment-Any-Point-Cloud

[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

Language:PythonStargazers:530Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:71Issues:0Issues:0

Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Language:PythonStargazers:2133Issues:0Issues:0

MVT-3DVG

Multi-View Transformer for 3D Visual Grounding [CVPR 2022]

Language:C++Stargazers:63Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:61928Issues:0Issues:0

vlmaps

[ICRA2023] Implementation of Visual Language Maps for Robot Navigation

Language:PythonLicense:MITStargazers:325Issues:0Issues:0

EmbodiedQA

Train embodied agents that can answer questions in environments

Language:PythonLicense:NOASSERTIONStargazers:290Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15111Issues:0Issues:0

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15636Issues:0Issues:0

LAMM

[NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents

Language:PythonStargazers:285Issues:0Issues:0

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8135Issues:0Issues:0

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25175Issues:0Issues:0

Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

Language:PythonStargazers:412Issues:0Issues:0

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

Stargazers:1044Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:93Issues:0Issues:0

3RScan

3RScan Toolkit

Language:C++License:MITStargazers:178Issues:0Issues:0

OmniObject3D

[ CVPR 2023 Award Candidate ] OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation

Language:PythonStargazers:432Issues:0Issues:0

Scan2Cap

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Language:PythonLicense:NOASSERTIONStargazers:98Issues:0Issues:0