bearcatt

followers

following

stars

Organizations

HRNet

bearcatt's starred repositories

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.031291 309 902

nginx

The official NGINX Open Source repository.

Language:CBSD-2-Clause21176 991 1

imgaug

Image augmentation for machine learning experiments.

Language:PythonMIT14325 231 515

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonMIT13525 127 309

PaddleDetection

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Language:PythonApache-2.012548 197 5384

deit

Official DeiT repository

Language:PythonApache-2.03990 48 197

StyleCLIP

Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)

Language:HTMLMIT3949 75 118

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

pytorchvideo

A deep learning library for video understanding research.

Language:PythonApache-2.03268 157 180

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Language:Python2578 24 96

SegFormer

Official PyTorch implementation of SegFormer

Language:PythonNOASSERTION2458 31 150

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonMIT2433 28 230

TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Language:PythonNOASSERTION1506 27 128

MaskFormer

Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)

Language:PythonNOASSERTION1329 24 69

involution

[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator

Language:PythonMIT1307 15 58

injector

Python dependency injection framework, inspired by Guice

Language:PythonBSD-3-Clause1287 14 144

CenterNet2

Two-stage CenterNet

Language:PythonApache-2.01201 20 87

mdetr

Language:PythonApache-2.0960 19 97

Lite-HRNet

This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.

Language:PythonApache-2.0817 19 91

DeepFashion_Try_On

Official code for "Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content"，CVPR‘20 https://arxiv.org/abs/2003.05863

Language:Python814 33 98

VisTR

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

Language:PythonApache-2.0738 11 78

powerful-benchmarker

A library for ML benchmarking. It's powerful.

Language:Jupyter Notebook426 10 96

TPN

[CVPR 2020] Temporal Pyramid Network for Action Recognition

Language:PythonApache-2.0392 15 42

PanopticFCN

Fully Convolutional Networks for Panoptic Segmentation (CVPR2021 Oral)

Language:PythonApache-2.0392 8 50

METER

METER: A Multimodal End-to-end TransformER Framework

Language:PythonMIT358 6 36

SeqFormer

SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)

Language:PythonNOASSERTION341 7 24

CoCosNet-v2

CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

Language:PythonMIT337 18 27

TOOD

TOOD: Task-aligned One-stage Object Detection, ICCV2021 Oral

Language:PythonApache-2.0314 7 26

merlot

MERLOT: Multimodal Neural Script Knowledge Models

Language:PythonMIT224 14 18

HC-STVG

The HC-STVG Dataset

Language:Python53 4 22