yhZhai

User data from Github https://github.com/yhZhai

followers

following

stars

State University of New York at Buffalo

New York

Yuanhao Zhai's repositories

mcm

[NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation

Language:PythonApache-2.060 4 12

idol

[ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Apache-2.053 11 2

WSCL

[ICCV 2023] Official implementation of paper "Towards Generic Image Manipulation Detection with Weakly-Supervised Self-Consistency Learning".

Language:PythonMIT29 2 14

ATOM

[ACM MM 2023] Official implementation of paper "Language-guided Human Motion Synthesis with Atomic Actions".

Language:PythonMIT28 1 1

SOAR

[ICCV 2023] Official implementation of paper "SOAR: Scene-debiasing Open-set Action Recognition".

Language:PythonApache-2.010 2 1

BMN-Boundary-Matching-Network

A pytorch-version implementation codes of paper: "BMN: Boundary-Matching Network for Temporal Action Proposal Generation", which is accepted in ICCV 2019.

Language:PythonMIT000

CGDL-for-Open-Set-Recognition

Code for CVPR2020 paper: Conditional Gaussian Distribution Learning for Open Set Recognition

Language:Python000

DETAD

This repository is intended to host the diagnosis tool for analyzing temporal action localization algorithms. This tool is first presented as part of our DETAD paper.

Language:PythonMIT000

depthstillation

Demo code for paper "Learning optical flow from still images", CVPR 2021.

Language:PythonMIT000

detr

End-to-End Object Detection with Transformers

Language:PythonApache-2.0000

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonApache-2.0000

dotfiles

Language:Vim ScriptMIT000

EndoAssistant

010

gpu-load-watcher

Simple script for watching GPU usage on both system-wide and per-user basis.

Language:PythonMIT000

ICT_DeepFake

Language:Python000

mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language:PythonApache-2.0000

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookNOASSERTION000

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.0000

PoseFormerV2

The project is an official implementation of our paper "PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation".

Language:Python000

ProdL

[Doc] Productive Deep Learner

Language:TeX000

RAFT

Language:PythonBSD-3-Clause000

SelfBlendedImages

[CVPR 2022 Oral] Detecting Deepfakes with Self-Blended Images https://arxiv.org/abs/2204.08376

Language:PythonNOASSERTION000

SLADD

Official code for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection (CVPR 2022 oral)

Language:PythonApache-2.0000

sleek-beamer

LaTeX sleek beamer template

Language:TeX000

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonMIT000

video-generation-survey

A reading list of video generation

000

video-to-pose3D

Convert video to 3D pose in one-key.

Language:PythonMIT000

video_features

Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as I3D, R(2+1)D, VGGish, ResNet features.

Language:PythonGPL-3.0000

VideoUtils

Language:PythonMIT010

yhZhai.github.io

Language:HTML010