bearcatt

followers

following

stars

Organizations

HRNet

bearcatt's starred repositories

SeqFormer

SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)

Language:PythonNOASSERTION34400

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonMIT248200

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Language:Python259100

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.03169500

METER

METER: A Multimodal End-to-end TransformER Framework

Language:PythonMIT36100

TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Language:PythonNOASSERTION152000

TOOD

TOOD: Task-aligned One-stage Object Detection, ICCV2021 Oral

Language:PythonApache-2.031500

nginx

The official NGINX Open Source repository.

Language:CBSD-2-Clause2467600

powerful-benchmarker

A library for ML benchmarking. It's powerful.

Language:Jupyter Notebook42700

mdetr

Language:PythonApache-2.096700

imgaug

Image augmentation for machine learning experiments.

Language:PythonMIT1435700

PaddleDetection

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Language:PythonApache-2.01264800

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

injector

Python dependency injection framework, inspired by Guice

Language:PythonBSD-3-Clause129900

MaskFormer

Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)

Language:PythonNOASSERTION134100

pytorchvideo

A deep learning library for video understanding research.

Language:PythonApache-2.0329300

SegFormer

Official PyTorch implementation of SegFormer

Language:PythonNOASSERTION250400

CoCosNet-v2

CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

Language:PythonMIT33800

merlot

MERLOT: Multimodal Neural Script Knowledge Models

Language:PythonMIT22300

HC-STVG

The HC-STVG Dataset

Language:Python5300

deit

Official DeiT repository

Language:PythonApache-2.0402400

Lite-HRNet

This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.

Language:PythonApache-2.082600

StyleCLIP

Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)

Language:HTMLMIT397100

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonMIT1368200

PanopticFCN

Fully Convolutional Networks for Panoptic Segmentation (CVPR2021 Oral)

Language:PythonApache-2.039300

involution

[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator

Language:PythonMIT130900

CenterNet2

Two-stage CenterNet

Language:PythonApache-2.0120400

TPN

[CVPR 2020] Temporal Pyramid Network for Action Recognition

Language:PythonApache-2.039300

VisTR

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

Language:PythonApache-2.073900

DeepFashion_Try_On

Official code for "Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content"，CVPR‘20 https://arxiv.org/abs/2003.05863

Language:Python81900