DV Lab

dvlab-research

Deep Vision Lab

DV Lab's repositories

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonApache-2.02983 24 107

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonApache-2.02467 13 163

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonApache-2.01481 10 119

VoxelNeXt

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

Language:PythonApache-2.0647 8 59

LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Language:PythonApache-2.0584 11 88

DeepUPE

Underexposed Photo Enhancement Using Deep Illumination Estimation

Language:Python559 24 79

3D-Box-Segment-Anything

We extend Segment Anything to 3D perception by combining it with VoxelNeXt.

Language:Jupyter Notebook508 11 16

PointGroup

PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation

Language:PythonApache-2.0365 13 62

FocalsConv

Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)

Language:PythonApache-2.0359 3 34

Video-P2P

Video-P2P: Video Editing with Cross-attention Control

Language:Python332 9 14

PFENet

PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).

Language:Python297 9 81

SphereFormer

The official implementation for "Spherical Transformer for LiDAR-based 3D Recognition" (CVPR 2023).

Language:PythonApache-2.0277 5 71

LLMGA

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

Language:PythonApache-2.0258 4 5

Parametric-Contrastive-Learning

Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)

Language:PythonMIT222 7 23

LargeKernel3D

LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs (CVPR 2023)

Language:PythonApache-2.0182 6 16

Context-Aware-Consistency

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

Language:PythonMIT154 5 30

SparseTransformer

A fast and memory-efficient libarary for sparse transformer with varying token numbers (e.g., 3D point cloud).

Language:PythonApache-2.0142 6 7

MOOD

Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution Detection.

Language:Python131 3 11

RIVAL

[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain

Language:PythonApache-2.0129 17 8

Ref-NPR

[CVPR 2023] Ref-NPR: Reference-Based Non-PhotoRealistic Radiance Fields

Language:PythonApache-2.0119 6 13

Prompt-Highlighter

[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Language:PythonMIT102 2 2

Imbalanced-Learning

Imbalanced learning tool for imbalanced recognition and segmentation

Language:Python77 4 3

Mask-Attention-Free-Transformer

Official Implementation for "Mask-Attention-Free Transformer for 3D Instance Segmentation"

Language:Python58 3 11

MoTCoder

This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.

Language:Python5402

ProposeReduce

Video Instance Segmentation with a Propose-Reduce Paradigm (ICCV 2021)

Language:Python41 4 4

TriVol

The official code of TriVol in CVPR-2023

Language:Python38 7 5

GroupContrast

[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

MIT3500

LBGAT

Learnable Boundary Guided Adversarial Training (ICCV2021)

Language:PythonMIT33 3 4

MR-GSM8K

Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

Language:Python33 2 2

APD

Language:Python4 20