yukang2017

Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Speech Inputs

Language:Jupyter NotebookApache-2.0000

GroundingDINO

The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.0000

ierg5350-assignment

Language:Jupyter Notebook010

IST-Net

Language:Python000

LinK

[CVPR 2023] LinK: Linear Kernel for LiDAR-based 3D Perception

Language:Python000

LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonMIT000

Mask3D

Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.

Language:PythonMIT000

Segment-Everything-Everywhere-All-At-Once

000

SparseKD

(NeurlPS 2022) Towards Efficient 3D Object Detection with Knowledge Distillation

Language:PythonApache-2.0010

spconv

Spatial Sparse Convolution Library

Language:PythonApache-2.0000

SPS-Conv

(NeurlPS 2022) Spatial Pruned Sparse Convolution for Efficient 3D Object Detection

Language:PythonApache-2.0010

spvnas

[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution

Language:PythonMIT000

SST

Codes for “Fully Sparse 3D Object Detection” & “Embracing Single Stride 3D Object Detector with Sparse Transformer”

Language:PythonApache-2.0000

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.0000

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonApache-2.0000