Beast code in Giters

Shizhen Zhao's starred repositories

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT24964 323 394

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION9908 77 475

VoxelNeXt

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

Language:PythonApache-2.0716 8 64

diffusion-point-cloud

:thought_balloon: Diffusion Probabilistic Models for 3D Point Cloud Generation (CVPR 2021)

Language:PythonMIT634 8 38

LOST

Pytorch implementation of LOST unsupervised object discovery method

Language:PythonNOASSERTION234 8 16

UHDM

(ECCV2022) This is the official PyTorch implementation of ECCV2022 paper: Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoireing

Language:PythonApache-2.0201 6 33

Bamboo

Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning.

Language:Python166 5 8

imvotenet

ImVoteNet: Boosting 3D Object Detection in Point Clouds With Image Votes

Language:PythonMIT127 9 13

IST-Net

(ICCV2023) IST-Net: Prior-free Category-level Pose Estimation with Implicit Space Transformation

Language:PythonMIT107 4 18

CoDet

(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

Language:Python105 7 17

SlotCon

(NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping

Language:PythonApache-2.095 3 10

SPS-Conv

(NeurlPS 2022) Spatial Pruned Sparse Convolution for Efficient 3D Object Detection

Language:PythonApache-2.062 4 5

FS3D

(NeurlPS 2022) Prototypical VoteNet for Few-Shot 3D Point Cloud Object Detection

Language:PythonApache-2.055 5 10

datacentric.vlp

Compress conventional Vision-Language Pre-training data

Language:Python49 9 5

DODA

(ECCV 2022) DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation

Language:PythonApache-2.046 4 11

ResKD

[NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".

Language:PythonApache-2.032 4 1