gaopengpjlab

followers

following

stars

gaopengpjlab's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.045162 299 650

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.05594 78 141

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonNOASSERTION2597 35 132

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonMIT1767 27 66

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Language:PythonMIT609 17 68

Tip-Adapter

Language:Python498 5 26

ConvMAE

ConvMAE: Masked Convolution Meets Masked Autoencoders

Language:PythonMIT468 11 36

CLIP-Adapter

Language:Python422 16 11

Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

Language:Python400 6 18

CaFo

[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

Language:PythonMIT331 12 12

MonoDETR

[ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer

Language:Python320 10 63

PointCLIP

[CVPR 2022] PointCLIP: Point Cloud Understanding by CLIP

Language:Python301 10 18

Stable-Pix2Seq

A full-fledged version of Pix2Seq

Language:PythonApache-2.0235 7 19

PointCLIP_V2

[ICCV 2023] PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning

Language:PythonMIT208 10 27

I2P-MAE

[CVPR 2023] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders

Language:Python206 17 10

Point-M2AE

[NeurIPS 2022] Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training

Language:PythonMIT194 11 14

SMCA-DETR

Language:Python166 5 25

efficient-video-recognition

Language:Python159 5 22

llama-mps

Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2

Language:PythonGPL-3.085 30

Q-ViT

The official implementation of the NeurIPS 2022 paper Q-ViT.

Language:Python75 3 15

maskalign

[CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"

Language:PythonApache-2.062 5 3

MMT-Bench

ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

Language:Python62 5 6

FastConvMAE

Language:PythonMIT56 2 4

ProCA

[ECCV 2022] Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation

Language:PythonMIT51 2 8

FeatAug-DETR

Official repository of paper: "FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation"

Apache-2.022 5 1

svl_adapter

SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models

Language:Python19 2 2

MonoDETR-MV

The multi-view version of MonoDETR on nuScenes dataset

POS-BERT

Apache-2.016 5 3

Official-ConvMAE-Det

Language:PythonApache-2.015 10

DMJD

PyTorch implementation of Disjoint Masking with Joint Distillation for Efficient Masked Image Modeling

Language:PythonMIT10 2 1