bryanyzhu

followers

following

stars

Amazon AI

SF Bay Area

https://bryanyzhu.github.io/

Yi Zhu's repositories

two-stream-pytorch

PyTorch implementation of two-stream networks for video action recognition

Language:PythonMIT564 17 53

Hidden-Two-Stream

Caffe implementation for "Hidden Two-Stream Convolutional Networks for Action Recognition"

Language:C++NOASSERTION194 17 45

Video-Tutorial-CVPR2020

A Comprehensive Tutorial on Video Modeling

Language:Jupyter Notebook65 5 1

GuidedNet

Caffe implementation for "Guided Optical Flow Learning"

Language:C++MIT32 5 1

deepOF

TensorFlow implementation for "Guided Optical Flow Learning"

Language:Python25 8 6

paper-reading

深度学习经典、新论文逐段精读

Apache-2.021 20

autogluon

AutoGluon: AutoML for Image, Text, and Tabular Data

Language:PythonApache-2.02 10

semantic-segmentation

Improving Semantic Segmentation via Video Propagation and Label Relaxation

Language:PythonNOASSERTION1 20

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonApache-2.01 20

tiny-ucf101

1 10

Video-Swin-Transformer

This is an official implementation for "Video Swin Transformers".

Language:PythonApache-2.01 10

ViLT

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Language:PythonApache-2.01 10

bark

🔊 Text-Prompted Generative Audio Model

MIT000

bigdetection

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

Language:PythonApache-2.0010

blog

MXNet Blog in Chinese

Language:HTML010

CorrFlow

Self-supervised Learning for Video Correspondence Flow (BMVC 2019)

Language:Python020

deit

Official DeiT repository

Language:PythonApache-2.0010

detectron2

Detectron2 is FAIR's next-generation platform for object detection and segmentation.

Language:PythonApache-2.0010

Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Language:PythonApache-2.0010

digital_video_introduction

A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding).

Language:Jupyter NotebookBSD-3-Clause020

gluon-cv

Gluon CV Toolkit

Language:PythonApache-2.0020

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Language:PythonApache-2.0010

ResNeSt

ResNeSt: Split-Attention Network

Language:PythonApache-2.0020

stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Language:PythonApache-2.0000

web-data

The repo to host all the web data including images for documents in dmlc projects.

Language:Jupyter NotebookApache-2.0010