ChaimZhu (ZCMax)

ZCMax

Geek Repo

Company:HKU IDS | HKU-MMLab

Location:Hong Kong SAR

Github PK Tool:Github PK Tool

ChaimZhu's starred repositories

llama.cpp

LLM inference in C/C++

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25304Issues:221Issues:458

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15893Issues:106Issues:1028

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15686Issues:133Issues:615

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11847Issues:171Issues:230

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8217Issues:73Issues:407

InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Language:PythonLicense:Apache-2.0Stargazers:6259Issues:54Issues:331

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3783Issues:23Issues:509

Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

gradslam

gradslam is an open source differentiable dense SLAM library for PyTorch

Language:PythonLicense:MITStargazers:1311Issues:47Issues:38

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Language:PythonLicense:MITStargazers:905Issues:16Issues:62

Segment-Any-Point-Cloud

[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

OmniObject3D

[ CVPR 2023 Award Candidate ] OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation

Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

vlmaps

[ICRA2023] Implementation of Visual Language Maps for Robot Navigation

Language:PythonLicense:MITStargazers:357Issues:11Issues:55

LAMM

[NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents

EmbodiedQA

Train embodied agents that can answer questions in environments

Language:PythonLicense:NOASSERTIONStargazers:294Issues:22Issues:20

NeRF-Det

[ICCV 2023] Code for NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

Language:PythonLicense:NOASSERTIONStargazers:280Issues:28Issues:16

3RScan

3RScan Toolkit

Language:C++License:MITStargazers:184Issues:9Issues:16

3D-VisTA

Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"

Language:PythonLicense:MITStargazers:180Issues:5Issues:26

concept-fusion

Code release for ConceptFusion [RSS 2023]

Language:PythonLicense:NOASSERTIONStargazers:99Issues:4Issues:22

Scan2Cap

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Language:PythonLicense:NOASSERTIONStargazers:99Issues:7Issues:23

Awesome-3D-Vision-and-Language

A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.

P3Former

[IJCV 2024] P3Former: Position-Guided Point Cloud Panoptic Segmentation Transformer

3D-CLR-Official

[CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"

Language:PythonStargazers:72Issues:0Issues:0

MVT-3DVG

Multi-View Transformer for 3D Visual Grounding [CVPR 2022]

Refer-it-in-RGBD

Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021

Language:PythonLicense:MITStargazers:39Issues:2Issues:5