Beast code in Giters

yangmin09's starred repositories

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonMIT13232 127 303

leetcode

Provide all my solutions and explanations in Chinese for all the Leetcode coding problems.

6138 242 2013

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Language:PythonApache-2.06012 68 243

Chinese-Text-Classification-Pytorch

中文文本分类，TextCNN，TextRNN，FastText，TextRCNN，BiLSTM_Attention，DPCNN，Transformer，基于pytorch，开箱即用。

Language:PythonMIT5151 36 117

AugLy

A data augmentations library for audio, image, text, and video.

Language:PythonNOASSERTION4919 74 74

Bert-Chinese-Text-Classification-Pytorch

使用Bert，ERNIE，进行中文文本分类

Language:PythonMIT3818 20 186

fast-reid

SOTA Re-identification Methods and Toolbox

Language:PythonApache-2.03318 58 625

vissl

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Language:Jupyter NotebookMIT3233 54 173

vearch

Distributed vector search for AI-native applications

Language:GoApache-2.01963 76 572

CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

Language:PythonApache-2.01624 56 63

TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Language:PythonNOASSERTION1463 28 127

moco-v3

PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057

Language:PythonNOASSERTION1172 18 34

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonNOASSERTION1015 35 62

volo

VOLO: Vision Outlooker for Visual Recognition

Language:Jupyter NotebookApache-2.0919 21 43

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonMIT810 12 109

Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Language:Jupyter NotebookMIT730 8 35

ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonMIT693 9 58