Yi Zhu's repositories
two-stream-pytorch
PyTorch implementation of two-stream networks for video action recognition
Hidden-Two-Stream
Caffe implementation for "Hidden Two-Stream Convolutional Networks for Action Recognition"
Video-Tutorial-CVPR2020
A Comprehensive Tutorial on Video Modeling
paper-reading
深度学习经典、新论文逐段精读
semantic-segmentation
Improving Semantic Segmentation via Video Propagation and Label Relaxation
Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
bark
🔊 Text-Prompted Generative Audio Model
bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
blog
MXNet Blog in Chinese
deit
Official DeiT repository
detectron2
Detectron2 is FAIR's next-generation platform for object detection and segmentation.
Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
digital_video_introduction
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding).
PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
stable-diffusion-videos
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
web-data
The repo to host all the web data including images for documents in dmlc projects.