Mark Peng's starred repositories

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12046Issues:96Issues:1017

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8888Issues:77Issues:441

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8172Issues:94Issues:357

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:6057Issues:46Issues:169

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5726Issues:50Issues:139

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:5337Issues:36Issues:275
Language:PythonLicense:Apache-2.0Stargazers:5254Issues:77Issues:1814

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4407Issues:44Issues:120

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4097Issues:46Issues:381

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4088Issues:112Issues:119

Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Language:PythonLicense:Apache-2.0Stargazers:3000Issues:31Issues:223

pyGAT

Pytorch implementation of the Graph Attention Network model by Veličković et. al (2017, https://arxiv.org/abs/1710.10903)

Language:PythonLicense:MITStargazers:2822Issues:17Issues:71

ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

Language:PythonLicense:NOASSERTIONStargazers:2719Issues:55Issues:54

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:2357Issues:21Issues:360

GLIP

Grounded Language-Image Pre-training

Language:PythonLicense:MITStargazers:2026Issues:45Issues:168

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1913Issues:23Issues:62

detrex

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:1869Issues:26Issues:154

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookLicense:MITStargazers:1259Issues:17Issues:51

unlimiformer

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Language:PythonLicense:MITStargazers:1036Issues:22Issues:57

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

Segment-Any-Anomaly

Official implementation of "Segment Any Anomaly without Training via Hybrid Prompt Regularization (SAA+)".

Language:Jupyter NotebookStargazers:671Issues:7Issues:28

linear-attention-transformer

Transformer based on a variant of attention that is linear complexity in respect to sequence length

Language:PythonLicense:MITStargazers:625Issues:12Issues:19

wise-ft

Robust fine-tuning of zero-shot models

Language:PythonLicense:NOASSERTIONStargazers:580Issues:6Issues:25

U-Mamba

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

Language:PythonLicense:Apache-2.0Stargazers:538Issues:11Issues:42

Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Language:PythonLicense:MITStargazers:274Issues:2Issues:71

EEG-ATCNet

Attention temporal convolutional network for EEG-based motor imagery classification

Language:PythonLicense:Apache-2.0Stargazers:154Issues:3Issues:15

CA-TCC

[TPAMI 2023] Self-supervised Contrastive Representation Learning for Semi-supervised Time-Series Classification

LearnablePromptSAM

Try to use the SAM-ViT as the backbone to create the learnable prompt for semantic segmentation

Language:PythonLicense:Apache-2.0Stargazers:69Issues:3Issues:13

BGAD

Pytorch Implementation for CVPR2023 paper: Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection