hzhang57's repositories

Stargazers:0Issues:0Issues:0

AS-MLP

This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".

License:MITStargazers:0Issues:0Issues:0

awesome-attention-mechanism-in-cv

:punch: CV中常用注意力模块;即插即用模块;ViT模型. PyTorch Implementation Collection of Attention Module and Plug&Play Module

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

awesome-hand-pose-estimation

Awesome work on hand pose estimation/tracking

Language:PythonStargazers:0Issues:1Issues:0

CAA

CAA: Channelized Axial Attention for Semantic Segmentation

Stargazers:0Issues:0Issues:0

CMT_CNN-meet-Vision-Transformer

A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.

License:MITStargazers:0Issues:0Issues:0

Compact-Transformers

[Preprint] Escaping the Big Data Paradigm with Compact Transformers, 2021

License:Apache-2.0Stargazers:0Issues:0Issues:0

ConvNeXt

Code release for ConvNeXt model

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Convolutional-MLPs

[Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021

License:Apache-2.0Stargazers:0Issues:0Issues:0

deepvecfont

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning

License:MITStargazers:0Issues:0Issues:0

DynamicViT

[NeurIPS 2021] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification

License:MITStargazers:0Issues:0Issues:0

ffcv

FFCV: Fast Forward Computer Vision (and other ML workloads!)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

GFNet

[NeurIPS 2021] Global Filter Networks for Image Classification

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

GrabNet

GrabNet: A Generative model to generate realistic 3D hands grasping unseen objects (ECCV2020)

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

how-do-vits-work

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

License:MITStargazers:0Issues:0Issues:0

LLVIP

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

Stargazers:0Issues:0Issues:0

MotionSqueeze

Official PyTorch Implementation of MotionSqueeze, ECCV 2020

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

MoViNet-pytorch

MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

MSG-Transformer

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

License:Apache-2.0Stargazers:0Issues:0Issues:0

MultiBench

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

License:MITStargazers:0Issues:0Issues:0

PASS

The PASS dataset: pretrained models and how to get the data

License:MITStargazers:0Issues:0Issues:0

poster_template

some academic posters as references. May we have in-person poster session soon!

Stargazers:0Issues:0Issues:0

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

temporal-adaptive-module

TAM: Temporal Adaptive Module for Video Recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

vidaug

Effective Video Augmentation Techniques for Training Convolutional Neural Networks

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0