ZYK100's starred repositories

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1652Issues:0Issues:0

Image2Paragraph

[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

Language:PythonLicense:Apache-2.0Stargazers:787Issues:0Issues:0

HumanBench

This repo is official implementation of HumanBench (CVPR2023)

Language:PythonLicense:MITStargazers:229Issues:0Issues:0

LUPerson

Unsupervised Pre-training for Person Re-identification (LUPerson)

Language:PythonStargazers:240Issues:0Issues:0

HAP

[NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

Language:PythonStargazers:38Issues:0Issues:0

VTF_PAR

[CVPR-2023 Workshop@NFVLR] Official PyTorch implementation of Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition

Language:PythonLicense:MITStargazers:23Issues:0Issues:0

HLC

ICCV'2023: Holistic Label Correction for Noisy Multi-Label Classification

Language:PythonStargazers:10Issues:0Issues:0

DiJiang

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

Language:PythonStargazers:94Issues:0Issues:0

CurricularFace

CurricularFace(CVPR2020)

Language:PythonLicense:MITStargazers:526Issues:0Issues:0

OKNet

[AAAI2024] Omni-Kernel Network for Image Restoration

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

SHIKE

Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation (CVPR 2023)

Language:PythonStargazers:34Issues:0Issues:0

Simba

A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series"

Language:PythonLicense:MITStargazers:27Issues:0Issues:0

Pan-Mamba

Pan-Mamba: Effective Pan-Sharpening with State Space Model

Language:PythonStargazers:71Issues:0Issues:0

Conf-MPU-DS-NER

Code for "Distantly Supervised Named Entity Recognition via Confidence-Based Multi-Class Positive and Unlabeled Learning" published at ACL 2022

Language:PythonStargazers:23Issues:0Issues:0

VMamba

VMamba: Visual State Space Models,code is based on mamba

Language:PythonLicense:MITStargazers:2066Issues:0Issues:0

graph-transformer-pytorch

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

Language:PythonLicense:MITStargazers:197Issues:0Issues:0
Language:PythonLicense:MITStargazers:7Issues:0Issues:0

OpenGraph

[EMNLP'2024] "OpenGraph: Towards Open Graph Foundation Models"

Language:PythonLicense:Apache-2.0Stargazers:210Issues:0Issues:0

DiffMIC

[MICCAI 2023] DiffMIC: Dual-Guidance Diffusion Network for Medical Image Classification

Language:PythonStargazers:133Issues:0Issues:0

DiffusionDet

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Language:PythonLicense:NOASSERTIONStargazers:2070Issues:0Issues:0

RetNet

An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"

Language:PythonLicense:MITStargazers:1160Issues:0Issues:0

RMT

(CVPR2024)RMT: Retentive Networks Meet Vision Transformer

Language:PythonStargazers:274Issues:0Issues:0

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2559Issues:0Issues:0

TokenLabeling

Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:426Issues:0Issues:0

Neighborhood-Attention-Transformer

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Language:PythonLicense:MITStargazers:1044Issues:0Issues:0

CASE

Accepted by ICCV2023, Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach

Language:PythonStargazers:103Issues:0Issues:0

GUR

The Implementation of our ICCV 2023 paper: Towards Grand Unified Representation Learning for Unsupervised Visible-Infrared Person Re-Identification.

Language:PythonLicense:MITStargazers:19Issues:0Issues:0

Instruct-ReID

A General-purpose Person Re-identification Task with Instructions

Language:PythonStargazers:108Issues:0Issues:0

Graph-Optimal-Transport

Code for ICML 2020 "Graph Optimal Transport for Cross-Domain Alignment"

Language:PythonLicense:MITStargazers:153Issues:0Issues:0

volo

VOLO: Vision Outlooker for Visual Recognition

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:923Issues:0Issues:0