hzhang57

followers

following

stars

https://hzhang57.github.io/

hzhang57's repositories

action-recognition-pytorch

This is the pytorch implementation of some representative action recognition approaches including I3D, S3D, TSN and TAM.

Apache-2.0000

Adabelief-Optimizer

Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

000

Awesome-CV-Team

国内外优秀的计算机视觉团队汇总，极市团队整理

000

awesome-productivity-cn

绝妙的个人生产力（Awesome Productivity 中文版）

CC0-1.0000

awesome-transformer-for-vision

000

Awesome-Video-Datasets

Video datasets

000

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

000

China_House

**买房相关资料和项目整理，方便查看，持续更新中...

000

CMT-Convolutional-NN-Meets-ViT

Pytorch unofficial implementation of CMT

MIT000

ContactPose

Large dataset of hand-object contact, hand- and object-pose, and 2.9 M RGB-D grasp images.

MIT000

convit

Code for the Convolutional Vision Transformer (ConViT)

NOASSERTION000

CorrNet

Unofficial implementation of paper "Video Modeling with Correlation Networks"

Apache-2.0000

CVPR_Template

000

CvT

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

MIT000

discourse

A platform for community discussion. Free, open, simple.

GPL-2.0000

dolphins-recognition-challenge

Dolphin recognition challenge

Language:Jupyter NotebookApache-2.0010

hamburger-pytorch

Pytorch implementation of the hamburger module from the ICLR 2020 paper "Is Attention Better Than Matrix Decomposition"

MIT000

ImageNet21K

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(2021) paper

MIT000

kinetics-dataset

000

MANO

A PyTorch Implementation of MANO hand model.

NOASSERTION000

MiCT-Net-PyTorch

Video Recognition using Mixed Convolutional Tube (MiCT) on PyTorch with a ResNet backbone

Apache-2.0000

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

NOASSERTION000

ResnetGPT

用Resnet101+GPT搭建一个玩王者荣耀的AI

000

ResT

This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".

Apache-2.0000

T2T-ViT

NOASSERTION000

TokShift-Transformer

000

UGATIT

Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)

MIT000

video2bvh

Extracts human motion in video and save it as bvh mocap file.

MIT000

vision-longformer

MIT000

ViT-pytorch

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

MIT000