Yiting Cheng's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46011Issues:303Issues:658
Language:PythonLicense:MITStargazers:4005Issues:173Issues:139

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1286Issues:16Issues:118

MulimgViewer

MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.

Language:PythonLicense:GPL-3.0Stargazers:1081Issues:10Issues:56

clipseg

This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".

Language:PythonLicense:NOASSERTIONStargazers:1080Issues:13Issues:54

Paint-by-Example

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:1040Issues:23Issues:53

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonLicense:MITStargazers:825Issues:12Issues:110

DeCLIP

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

DenseCLIP

[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Seg-Uncertainty

IJCAI2020 & IJCV2021 :city_sunrise: Unsupervised Scene Adaptation with Memory Regularization in vivo

Language:PythonLicense:MITStargazers:386Issues:13Issues:23

I2P-MAE

[CVPR 2023] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders

3D-MiniNet

Official Implementation in Pytorch and Tensorflow of 3D-MiniNet: Learning a 2D Representation from Point Clouds for Fast and Efficient 3D LIDAR Semantic Segmentation

SLidR

Official PyTorch implementation of "Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data"

Language:PythonLicense:NOASSERTIONStargazers:175Issues:10Issues:34

X-CLIP

An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"

Language:PythonLicense:MITStargazers:125Issues:2Issues:6

datasets

TFDS data loaders for sign language datasets.

xmuda_journal

[TPAMI] Cross-modal Learning for Domain Adaptation in 3D Semantic Segmentation

Language:PythonLicense:NOASSERTIONStargazers:28Issues:4Issues:1