royee182

Yiting Cheng's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.046011 303 658

deep-visualization-toolbox

DeepVis Toolbox

Language:PythonMIT4005 173 139

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonNOASSERTION1286 16 118

MulimgViewer

MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.

Language:PythonGPL-3.01081 10 56

clipseg

This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".

Language:PythonNOASSERTION1080 13 54

Paint-by-Example

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Language:PythonNOASSERTION1040 23 53

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonMIT825 12 110

DeCLIP

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Language:Python622 19 29

DenseCLIP

[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Language:Python505 3 53

Seg-Uncertainty

IJCAI2020 & IJCV2021 :city_sunrise: Unsupervised Scene Adaptation with Memory Regularization in vivo

Language:PythonMIT386 13 23

SLRT

Language:Python229 5 59

I2P-MAE

[CVPR 2023] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders

Language:Python210 17 11

3D-MiniNet

Official Implementation in Pytorch and Tensorflow of 3D-MiniNet: Learning a 2D Representation from Point Clouds for Fast and Efficient 3D LIDAR Semantic Segmentation

Language:Python177 14 10

SLidR

Official PyTorch implementation of "Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data"

Language:PythonNOASSERTION175 10 34

X-CLIP

An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"

Language:PythonMIT125 2 6

datasets

TFDS data loaders for sign language datasets.

Language:Python79 6 44

xmuda_journal

[TPAMI] Cross-modal Learning for Domain Adaptation in 3D Semantic Segmentation

Language:PythonNOASSERTION28 4 1