hzhang57's repositories
action-recognition-pytorch
This is the pytorch implementation of some representative action recognition approaches including I3D, S3D, TSN and TAM.
Adabelief-Optimizer
Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"
Awesome-CV-Team
国内外优秀的计算机视觉团队汇总,极市团队整理
awesome-productivity-cn
绝妙的个人生产力(Awesome Productivity 中文版)
Awesome-Video-Datasets
Video datasets
Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
China_House
**买房相关资料和项目整理,方便查看,持续更新中...
CMT-Convolutional-NN-Meets-ViT
Pytorch unofficial implementation of CMT
ContactPose
Large dataset of hand-object contact, hand- and object-pose, and 2.9 M RGB-D grasp images.
convit
Code for the Convolutional Vision Transformer (ConViT)
CorrNet
Unofficial implementation of paper "Video Modeling with Correlation Networks"
CvT
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
discourse
A platform for community discussion. Free, open, simple.
dolphins-recognition-challenge
Dolphin recognition challenge
hamburger-pytorch
Pytorch implementation of the hamburger module from the ICLR 2020 paper "Is Attention Better Than Matrix Decomposition"
ImageNet21K
Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(2021) paper
MANO
A PyTorch Implementation of MANO hand model.
MiCT-Net-PyTorch
Video Recognition using Mixed Convolutional Tube (MiCT) on PyTorch with a ResNet backbone
mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
ResnetGPT
用Resnet101+GPT搭建一个玩王者荣耀的AI
ResT
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
UGATIT
Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)
video2bvh
Extracts human motion in video and save it as bvh mocap file.
ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)