DV Lab (dvlab-research)

DV Lab

dvlab-research

Geek Repo

Deep Vision Lab

Github PK Tool:Github PK Tool

DV Lab's repositories

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:2983Issues:24Issues:107

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonLicense:Apache-2.0Stargazers:2467Issues:13Issues:163

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1481Issues:10Issues:119

VoxelNeXt

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

Language:PythonLicense:Apache-2.0Stargazers:647Issues:8Issues:59

LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:584Issues:11Issues:88

DeepUPE

Underexposed Photo Enhancement Using Deep Illumination Estimation

3D-Box-Segment-Anything

We extend Segment Anything to 3D perception by combining it with VoxelNeXt.

Language:Jupyter NotebookStargazers:508Issues:11Issues:16

PointGroup

PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation

Language:PythonLicense:Apache-2.0Stargazers:365Issues:13Issues:62

FocalsConv

Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)

Language:PythonLicense:Apache-2.0Stargazers:359Issues:3Issues:34

Video-P2P

Video-P2P: Video Editing with Cross-attention Control

PFENet

PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).

SphereFormer

The official implementation for "Spherical Transformer for LiDAR-based 3D Recognition" (CVPR 2023).

Language:PythonLicense:Apache-2.0Stargazers:277Issues:5Issues:71

LLMGA

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

Language:PythonLicense:Apache-2.0Stargazers:258Issues:4Issues:5

Parametric-Contrastive-Learning

Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)

Language:PythonLicense:MITStargazers:222Issues:7Issues:23

LargeKernel3D

LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs (CVPR 2023)

Language:PythonLicense:Apache-2.0Stargazers:182Issues:6Issues:16

Context-Aware-Consistency

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

Language:PythonLicense:MITStargazers:154Issues:5Issues:30

SparseTransformer

A fast and memory-efficient libarary for sparse transformer with varying token numbers (e.g., 3D point cloud).

Language:PythonLicense:Apache-2.0Stargazers:142Issues:6Issues:7

MOOD

Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution Detection.

RIVAL

[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain

Language:PythonLicense:Apache-2.0Stargazers:129Issues:17Issues:8

Ref-NPR

[CVPR 2023] Ref-NPR: Reference-Based Non-PhotoRealistic Radiance Fields

Language:PythonLicense:Apache-2.0Stargazers:119Issues:6Issues:13

Prompt-Highlighter

[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Language:PythonLicense:MITStargazers:102Issues:2Issues:2

Imbalanced-Learning

Imbalanced learning tool for imbalanced recognition and segmentation

Mask-Attention-Free-Transformer

Official Implementation for "Mask-Attention-Free Transformer for 3D Instance Segmentation"

MoTCoder

This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.

Language:PythonStargazers:54Issues:0Issues:2

ProposeReduce

Video Instance Segmentation with a Propose-Reduce Paradigm (ICCV 2021)

TriVol

The official code of TriVol in CVPR-2023

GroupContrast

[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

License:MITStargazers:35Issues:0Issues:0

LBGAT

Learnable Boundary Guided Adversarial Training (ICCV2021)

Language:PythonLicense:MITStargazers:33Issues:3Issues:4

MR-GSM8K

Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

Language:PythonStargazers:4Issues:2Issues:0