yangmin09 (feymanpriv)

feymanpriv

Geek Repo

Company:BUPT

Location:Beijing

Github PK Tool:Github PK Tool

yangmin09's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:128424Issues:1099Issues:15113

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:24894Issues:703Issues:0

ConvNeXt

Code release for ConvNeXt model

Language:PythonLicense:MITStargazers:5620Issues:33Issues:130

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3425Issues:30Issues:250

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2130Issues:30Issues:104

GLIP

Grounded Language-Image Pre-training

Language:PythonLicense:MITStargazers:2052Issues:45Issues:168

detrex

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:1895Issues:26Issues:155

fastdup

fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.

Language:PythonLicense:NOASSERTIONStargazers:1508Issues:22Issues:237

Video-Pre-Training

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Language:PythonLicense:MITStargazers:1241Issues:28Issues:31

VideoX

VideoX: a collection of video cross-modal models

Language:PythonLicense:NOASSERTIONStargazers:942Issues:22Issues:110

SimMIM

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Language:PythonLicense:MITStargazers:887Issues:22Issues:41

cv-arxiv-daily

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Language:PythonLicense:Apache-2.0Stargazers:804Issues:37Issues:2

FastestDet

:zap: A newly designed ultra lightweight anchor free target detection algorithm, weight only 250K parameters, reduces the time consumption by 10% compared with yolo-fastest, and the post-processing is simpler

Language:PythonLicense:BSD-3-ClauseStargazers:747Issues:12Issues:38

self_supervised

A Pytorch-Lightning implementation of self-supervised algorithms

Language:PythonLicense:MITStargazers:523Issues:12Issues:13

XPretrain

Multi-modality pre-training

Language:PythonLicense:NOASSERTIONStargazers:451Issues:14Issues:35

QuadTreeAttention

QuadTree Attention for Vision Transformers (ICLR2022)

Language:Jupyter NotebookStargazers:329Issues:11Issues:29

ovr-cnn

A new framework for open-vocabulary object detection, based on maskrcnn-benchmark

Language:PythonLicense:MITStargazers:215Issues:5Issues:28

knowhere

Knowhere is an open-source vector search engine, integrating FAISS, HNSW, etc.

Language:C++License:Apache-2.0Stargazers:201Issues:14Issues:199

Text4Vis

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective

Language:PythonLicense:MITStargazers:198Issues:6Issues:23

learning_minimal

Learning to Solve Hard Minimal Problems

Language:C++License:NOASSERTIONStargazers:141Issues:6Issues:5

MCQ

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

MUST

PyTorch code for MUST

Language:PythonLicense:BSD-3-ClauseStargazers:103Issues:6Issues:10

BootMAE

ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining

everything_at_once

This is the official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval". CVPR 2022

CLIP4CirDemo

[CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features

LAVENDER

A Unified Framework for Video-Language Understanding

Language:PythonLicense:MITStargazers:55Issues:16Issues:7

met

A large-scale dataset for instance-level recognition for artworks is introduced.

Language:PythonLicense:MITStargazers:46Issues:3Issues:1

Universal-Transformer

Training Google Universal Image Embedding

Language:PythonLicense:MITStargazers:1Issues:1Issues:0