bearcatt

bearcatt

Geek Repo

Github PK Tool:Github PK Tool


Organizations
HRNet

bearcatt's starred repositories

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:31291Issues:309Issues:902

nginx

The official NGINX Open Source repository.

Language:CLicense:BSD-2-ClauseStargazers:21176Issues:991Issues:1

imgaug

Image augmentation for machine learning experiments.

Language:PythonLicense:MITStargazers:14325Issues:231Issues:515

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:13525Issues:127Issues:309

PaddleDetection

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Language:PythonLicense:Apache-2.0Stargazers:12548Issues:197Issues:5384

deit

Official DeiT repository

Language:PythonLicense:Apache-2.0Stargazers:3990Issues:48Issues:197

StyleCLIP

Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)

Language:HTMLLicense:MITStargazers:3949Issues:75Issues:118

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

pytorchvideo

A deep learning library for video understanding research.

Language:PythonLicense:Apache-2.0Stargazers:3268Issues:157Issues:180

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

SegFormer

Official PyTorch implementation of SegFormer

Language:PythonLicense:NOASSERTIONStargazers:2458Issues:31Issues:150

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonLicense:MITStargazers:2433Issues:28Issues:230

TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Language:PythonLicense:NOASSERTIONStargazers:1506Issues:27Issues:128

MaskFormer

Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)

Language:PythonLicense:NOASSERTIONStargazers:1329Issues:24Issues:69

involution

[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator

Language:PythonLicense:MITStargazers:1307Issues:15Issues:58

injector

Python dependency injection framework, inspired by Guice

Language:PythonLicense:BSD-3-ClauseStargazers:1287Issues:14Issues:144

CenterNet2

Two-stage CenterNet

Language:PythonLicense:Apache-2.0Stargazers:1201Issues:20Issues:87
Language:PythonLicense:Apache-2.0Stargazers:960Issues:19Issues:97

Lite-HRNet

This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.

Language:PythonLicense:Apache-2.0Stargazers:817Issues:19Issues:91

DeepFashion_Try_On

Official code for "Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content",CVPR‘20 https://arxiv.org/abs/2003.05863

VisTR

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

Language:PythonLicense:Apache-2.0Stargazers:738Issues:11Issues:78

powerful-benchmarker

A library for ML benchmarking. It's powerful.

Language:Jupyter NotebookStargazers:426Issues:10Issues:96

TPN

[CVPR 2020] Temporal Pyramid Network for Action Recognition

Language:PythonLicense:Apache-2.0Stargazers:392Issues:15Issues:42

PanopticFCN

Fully Convolutional Networks for Panoptic Segmentation (CVPR2021 Oral)

Language:PythonLicense:Apache-2.0Stargazers:392Issues:8Issues:50

METER

METER: A Multimodal End-to-end TransformER Framework

Language:PythonLicense:MITStargazers:358Issues:6Issues:36

SeqFormer

SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)

Language:PythonLicense:NOASSERTIONStargazers:341Issues:7Issues:24

CoCosNet-v2

CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

Language:PythonLicense:MITStargazers:337Issues:18Issues:27

TOOD

TOOD: Task-aligned One-stage Object Detection, ICCV2021 Oral

Language:PythonLicense:Apache-2.0Stargazers:314Issues:7Issues:26

merlot

MERLOT: Multimodal Neural Script Knowledge Models

Language:PythonLicense:MITStargazers:224Issues:14Issues:18

HC-STVG

The HC-STVG Dataset