Yuanhao Zhai's repositories
BMN-Boundary-Matching-Network
A pytorch-version implementation codes of paper: "BMN: Boundary-Matching Network for Temporal Action Proposal Generation", which is accepted in ICCV 2019.
CGDL-for-Open-Set-Recognition
Code for CVPR2020 paper: Conditional Gaussian Distribution Learning for Open Set Recognition
DETAD
This repository is intended to host the diagnosis tool for analyzing temporal action localization algorithms. This tool is first presented as part of our DETAD paper.
depthstillation
Demo code for paper "Learning optical flow from still images", CVPR 2021.
detr
End-to-End Object Detection with Transformers
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
gpu-load-watcher
Simple script for watching GPU usage on both system-wide and per-user basis.
mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
open_clip
An open source implementation of CLIP.
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
PoseFormerV2
The project is an official implementation of our paper "PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation".
ProdL
[Doc] Productive Deep Learner
SelfBlendedImages
[CVPR 2022 Oral] Detecting Deepfakes with Self-Blended Images https://arxiv.org/abs/2204.08376
SLADD
Official code for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection (CVPR 2022 oral)
sleek-beamer
LaTeX sleek beamer template
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
video-generation-survey
A reading list of video generation
video-to-pose3D
Convert video to 3D pose in one-key.
video_features
Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as I3D, R(2+1)D, VGGish, ResNet features.