Beast code in Giters

ZYK100's starred repositories

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Language:Jupyter NotebookNOASSERTION165200

Image2Paragraph

[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

Language:PythonApache-2.078700

HumanBench

This repo is official implementation of HumanBench (CVPR2023)

Language:PythonMIT22900

LUPerson

Unsupervised Pre-training for Person Re-identification (LUPerson)

Language:Python24000

HAP

[NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

Language:Python3800

VTF_PAR

[CVPR-2023 Workshop@NFVLR] Official PyTorch implementation of Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition

Language:PythonMIT2300

HLC

ICCV'2023: Holistic Label Correction for Noisy Multi-Label Classification

Language:Python1000

DiJiang

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

Language:Python9400

CurricularFace

CurricularFace(CVPR2020)

Language:PythonMIT52600

OKNet

[AAAI2024] Omni-Kernel Network for Image Restoration

Language:PythonMIT3900

SHIKE

Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation (CVPR 2023)

Language:Python3400

Simba

A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series"

Language:PythonMIT2700

Pan-Mamba

Pan-Mamba: Effective Pan-Sharpening with State Space Model

Language:Python7100

Conf-MPU-DS-NER

Code for "Distantly Supervised Named Entity Recognition via Confidence-Based Multi-Class Positive and Unlabeled Learning" published at ACL 2022

Language:Python2300

VMamba

VMamba: Visual State Space Models，code is based on mamba

Language:PythonMIT206600

graph-transformer-pytorch

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

Language:PythonMIT19700

CALR

Language:PythonMIT700

OpenGraph

[EMNLP'2024] "OpenGraph: Towards Open Graph Foundation Models"

Language:PythonApache-2.021000

DiffMIC

[MICCAI 2023] DiffMIC: Dual-Guidance Diffusion Network for Medical Image Classification

Language:Python13300

DiffusionDet

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Language:PythonNOASSERTION207000

RetNet

An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"

Language:PythonMIT116000

RMT

(CVPR2024)RMT: Retentive Networks Meet Vision Transformer

Language:Python27400

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.0255900

TokenLabeling

Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"

Language:Jupyter NotebookApache-2.042600

Neighborhood-Attention-Transformer

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Language:PythonMIT104400

CASE

Accepted by ICCV2023, Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach

Language:Python10300

GUR

The Implementation of our ICCV 2023 paper: Towards Grand Unified Representation Learning for Unsupervised Visible-Infrared Person Re-Identification.

Language:PythonMIT1900

Instruct-ReID

A General-purpose Person Re-identification Task with Instructions

Language:Python10800

Graph-Optimal-Transport

Code for ICML 2020 "Graph Optimal Transport for Cross-Domain Alignment"

Language:PythonMIT15300

volo

VOLO: Vision Outlooker for Visual Recognition

Language:Jupyter NotebookApache-2.092300