Shizhen Zhao's starred repositories

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:24964Issues:323Issues:394

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9908Issues:77Issues:475

VoxelNeXt

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

Language:PythonLicense:Apache-2.0Stargazers:716Issues:8Issues:64

diffusion-point-cloud

:thought_balloon: Diffusion Probabilistic Models for 3D Point Cloud Generation (CVPR 2021)

Language:PythonLicense:MITStargazers:634Issues:8Issues:38

LOST

Pytorch implementation of LOST unsupervised object discovery method

Language:PythonLicense:NOASSERTIONStargazers:234Issues:8Issues:16

UHDM

(ECCV2022) This is the official PyTorch implementation of ECCV2022 paper: Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoireing

Language:PythonLicense:Apache-2.0Stargazers:201Issues:6Issues:33

Bamboo

Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning.

imvotenet

ImVoteNet: Boosting 3D Object Detection in Point Clouds With Image Votes

Language:PythonLicense:MITStargazers:127Issues:9Issues:13

IST-Net

(ICCV2023) IST-Net: Prior-free Category-level Pose Estimation with Implicit Space Transformation

Language:PythonLicense:MITStargazers:107Issues:4Issues:18

CoDet

(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

SlotCon

(NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping

Language:PythonLicense:Apache-2.0Stargazers:95Issues:3Issues:10

SPS-Conv

(NeurlPS 2022) Spatial Pruned Sparse Convolution for Efficient 3D Object Detection

Language:PythonLicense:Apache-2.0Stargazers:62Issues:4Issues:5

FS3D

(NeurlPS 2022) Prototypical VoteNet for Few-Shot 3D Point Cloud Object Detection

Language:PythonLicense:Apache-2.0Stargazers:55Issues:5Issues:10

datacentric.vlp

Compress conventional Vision-Language Pre-training data

DODA

(ECCV 2022) DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation

Language:PythonLicense:Apache-2.0Stargazers:46Issues:4Issues:11

ResKD

[NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".

Language:PythonLicense:Apache-2.0Stargazers:32Issues:4Issues:1