mmmmimic

Manxi Lin's starred repositories

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.011129 64 259

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.06064 66 425

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonMIT2551 28 232

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonMIT2522 34 264

Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Language:Python2344 23 93

aloha

Language:PythonMIT1486 34 25

ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Language:PythonApache-2.01254 17 184

MaskDINO

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Language:PythonApache-2.01195 35 109

awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

MIT1162 17 6

cocoapi

Clone of COCO API - Dataset @ http://cocodataset.org/ - with changes to support Windows build and python3

Language:Jupyter NotebookNOASSERTION1129 190

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonApache-2.0770 12 44

DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Language:Python386 15 32

GiT

[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Language:PythonApache-2.0303 7 13

unmasked_teacher

[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Language:PythonMIT294 12 47

Rein

[CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation>

Language:PythonGPL-3.0259 5 71

MetaShift

MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts (ICLR 2022)

Language:Jupyter NotebookMIT108 3 12

GFM

Language:PythonApache-2.064 2 11

SurgicalDINO

[IPCAI'2024 (IJCARS special issue)] Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery

Language:Python48 4 7

EndoDAC

[MICCAI'2024] EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera

Language:Python33 3 12

chadavit

Official PyTorch implementation of ChAda-ViT [CVPR 2024]

Language:PythonApache-2.028 4 4

Mask2Former_DINOv2

将Mask2Former的backbone替换成DINOv2训练好的ViT模型

Language:PythonMIT26 3 3

vit-spurious-robustness

Language:Python24 2 1

LC

Official Implementation of Avoiding spurious correlations via logit correction

Language:PythonMIT17 2 1

CLIP-spurious-finetune

Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)

Language:Python17 2 3

SemFlow

MIT13 2 2

OutlierDetectionChallenge2024

Outlier detection challenge 2024 - a DTU Compute summer school challenge

Language:PythonMIT5 10

CFR

Language:PythonMIT4 10

CPT

Official implementation of "Controllable Prompt Tuning For Balancing Group Distributional Robustness" (ICML 2024), coming soon.

4 1 1

LaConvNet

Language:PythonApache-2.02 3 1

act-plus-plus

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

MIT100