1170300714's starred repositories

TransNet

TransNet: A deep network for fast detection of common shot transitions

Language:Jupyter NotebookStargazers:52Issues:0Issues:0

svt

Official repository for "Self-Supervised Video Transformer" (CVPR'22)

Language:PythonLicense:MITStargazers:100Issues:0Issues:0

DIKI

[ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models

Language:PythonStargazers:12Issues:0Issues:0

wesam

[CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation"

Language:PythonLicense:MITStargazers:107Issues:0Issues:0

SAMed

The implementation of the technical report: "Customized Segment Anything Model for Medical Image Segmentation"

Language:PythonLicense:MITStargazers:438Issues:0Issues:0

sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Language:PythonLicense:Apache-2.0Stargazers:3548Issues:0Issues:0

pascal-part-py

Helpers in python to manipulate the annotations of the Pascal-Part dataset.

Language:PythonStargazers:30Issues:0Issues:0

semantic-PASCAL-Part

This is a curated semantic version of the PASCAL-Part dataset for part-based object detection. Objects are aligned with WordNet and Yago concepts. The dataset is both in PASCAL-Voc and RDF format.

Language:PythonStargazers:7Issues:0Issues:0
Language:PythonStargazers:14Issues:0Issues:0

PromptKD

[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"

Language:PythonLicense:Apache-2.0Stargazers:154Issues:0Issues:0

Endo-FM

[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train

License:Apache-2.0Stargazers:185Issues:0Issues:0

Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Language:PythonStargazers:2125Issues:0Issues:0

TransNetV2

TransNet V2: Shot Boundary Detection Neural Network

Language:PythonLicense:MITStargazers:391Issues:0Issues:0

DIGA

Official implementation for Dynamically Instance-Guided Adaptation: A Backward-free Approach for Test-Time Domain Adaptive Semantic Segmentation

Language:PythonStargazers:7Issues:0Issues:0

BHYG

B站 会员购 抢票 脚本

Stargazers:582Issues:0Issues:0

tensorboardX

tensorboard for pytorch (and chainer, mxnet, numpy, ...)

Language:PythonLicense:MITStargazers:7833Issues:0Issues:0

FSCILSS

Few-shot Class-Incremental Semantic Segmentation via Pseudo-Labeling and Knowledge Distillation

Language:PythonLicense:NOASSERTIONStargazers:6Issues:0Issues:0

FSS

Prototype-based Incremental Few-Shot Semantic Segmentation

Language:Jupyter NotebookLicense:MITStargazers:30Issues:0Issues:0

ovsam

[arXiv preprint] The official code of paper "Open-Vocabulary SAM".

Language:PythonLicense:NOASSERTIONStargazers:712Issues:0Issues:0

TF-Cholec80

Library packaging the Cholec80 dataset for easy handling with Tensorflow.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:22Issues:0Issues:0

EAD2019

Github pages

Language:PythonLicense:MITStargazers:60Issues:0Issues:0

M3D

M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models

Language:PythonLicense:MITStargazers:128Issues:0Issues:0

Sinus-Surgery-Endoscopic-Image-Datasets

Novel image segmentation datasets collected from endoscopic videos of sinus surgery processes

Stargazers:10Issues:0Issues:0

labeled-images-for-ulcerative-colitis

Codes to process and train LIMUC dataset

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

EndoSLAM

EndoSLAM Dataset and an Unsupervised Monocular Visual Odometry and Depth Estimation Approach for Endoscopic Videos: Endo-SfMLearner

Language:PythonLicense:MITStargazers:213Issues:0Issues:0

weakly-polyp

[MICCAI'22] Contrastive Transformer-based Multiple Instance Learning for Weakly Supervised Polyp Frame Detection.

Language:PythonLicense:NOASSERTIONStargazers:38Issues:0Issues:0
Language:PythonLicense:MITStargazers:25Issues:0Issues:0

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1270Issues:0Issues:0

PriViLege

[CVPR 2024] PriViLege: Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

EPI

Code for the paper: Rehearsal-free Continual Language Learning via Efficient Parameter Isolation

Language:PythonLicense:MITStargazers:11Issues:0Issues:0