yangmin09 (feymanpriv)

feymanpriv

Geek Repo

Company:BUPT

Location:Beijing

Github PK Tool:Github PK Tool

yangmin09's starred repositories

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:13232Issues:127Issues:303

leetcode

Provide all my solutions and explanations in Chinese for all the Leetcode coding problems.

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Language:PythonLicense:Apache-2.0Stargazers:6012Issues:68Issues:243

Chinese-Text-Classification-Pytorch

中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。

Language:PythonLicense:MITStargazers:5151Issues:36Issues:117

AugLy

A data augmentations library for audio, image, text, and video.

Language:PythonLicense:NOASSERTIONStargazers:4919Issues:74Issues:74

Bert-Chinese-Text-Classification-Pytorch

使用Bert,ERNIE,进行中文文本分类

Language:PythonLicense:MITStargazers:3818Issues:20Issues:186

fast-reid

SOTA Re-identification Methods and Toolbox

Language:PythonLicense:Apache-2.0Stargazers:3318Issues:58Issues:625

vissl

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Language:Jupyter NotebookLicense:MITStargazers:3233Issues:54Issues:173

vearch

Distributed vector search for AI-native applications

Language:GoLicense:Apache-2.0Stargazers:1963Issues:76Issues:572

CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

Language:PythonLicense:Apache-2.0Stargazers:1624Issues:56Issues:63

TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Language:PythonLicense:NOASSERTIONStargazers:1463Issues:28Issues:127

moco-v3

PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057

Language:PythonLicense:NOASSERTIONStargazers:1172Issues:18Issues:34

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonLicense:NOASSERTIONStargazers:1015Issues:35Issues:62

volo

VOLO: Vision Outlooker for Visual Recognition

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:919Issues:21Issues:43

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonLicense:MITStargazers:810Issues:12Issues:109

Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Language:Jupyter NotebookLicense:MITStargazers:730Issues:8Issues:35

ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonLicense:MITStargazers:693Issues:9Issues:58

Transformer-SSL

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

Language:PythonLicense:MITStargazers:606Issues:6Issues:19

awesome-video-text-retrieval

A curated list of deep learning resources for video-text retrieval.

COTR

Code release for "COTR: Correspondence Transformer for Matching Across Images"(ICCV 2021)

Language:PythonLicense:Apache-2.0Stargazers:449Issues:12Issues:47

esvit

EsViT: Efficient self-supervised Vision Transformers

Language:PythonLicense:MITStargazers:403Issues:12Issues:25

natural-language-joint-query-search

Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.

Language:Jupyter NotebookStargazers:202Issues:3Issues:4

isc2021

Code for the Image similarity challenge.

Language:PythonLicense:NOASSERTIONStargazers:193Issues:8Issues:4

cvt

CVT, a Computer Vision Toolkit.

InstanceLoc

[CVPR 2021] Instance Localization for Self-supervised Detection Pretraining

Language:PythonLicense:Apache-2.0Stargazers:144Issues:9Issues:22
Language:PythonLicense:NOASSERTIONStargazers:127Issues:17Issues:8

CIRR

Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models

Semantic-Video-Retrieval

Code and benchmarks for the Semantic Video Retrieval Task

Language:PythonStargazers:8Issues:0Issues:0