Yang Di (YangDi666)

YangDi666

Geek Repo

Company:@INRIA

Location:France

Github PK Tool:Github PK Tool

Yang Di's starred repositories

SEINE

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Language:PythonLicense:Apache-2.0Stargazers:861Issues:24Issues:28

LaVie

LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:790Issues:26Issues:23

LIA

[ICLR 22] Latent Image Animator: Learning to Animate Images via Latent Space Navigation

Language:PythonLicense:NOASSERTIONStargazers:578Issues:28Issues:23

awesome-uncertainty-deeplearning

This repository contains a collection of surveys, datasets, papers, and codes, for predictive uncertainty estimation in deep learning models.

License:MITStargazers:459Issues:29Issues:0

2D-Motion-Retargeting

PyTorch implementation for our paper Learning Character-Agnostic Motion for Motion Retargeting in 2D, SIGGRAPH 2019

Language:PythonLicense:MITStargazers:433Issues:19Issues:34

transmomo.pytorch

This is the official PyTorch implementation of the CVPR 2020 paper "TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting".

UniPose

We propose UniPose, a unified framework for human pose estimation, based on our “Waterfall” Atrous Spatial Pooling architecture, that achieves state-of-art-results on several pose estimation metrics. Current pose estimation methods utilizing standard CNN architectures heavily rely on statistical postprocessing or predefined anchor poses for joint localization. UniPose incorporates contextual seg- mentation and joint localization to estimate the human pose in a single stage, with high accuracy, without relying on statistical postprocessing methods. The Waterfall module in UniPose leverages the efficiency of progressive filter- ing in the cascade architecture, while maintaining multi- scale fields-of-view comparable to spatial pyramid config- urations. Additionally, our method is extended to UniPose- LSTM for multi-frame processing and achieves state-of-the- art results for temporal pose estimation in Video. Our re- sults on multiple datasets demonstrate that UniPose, with a ResNet backbone and Waterfall module, is a robust and efficient architecture for pose estimation obtaining state-of- the-art results in single person pose detection for both sin- gle images and videos.

Language:PythonLicense:NOASSERTIONStargazers:209Issues:10Issues:44

MutualGuide

Localize to Classify and Classify to Localize: Mutual Guidance in Object Detection

Language:PythonLicense:MITStargazers:111Issues:4Issues:7

GCL

Implementation for CVPR2021 paper "Joint Generative and Contrastive Learning for Unsupervised Person Re-identification"

Language:PythonLicense:MITStargazers:46Issues:6Issues:17

ICE

Implementation for ICCV 2021 paper "ICE: Inter-instance Contrastive Encoding for Unsupervised Person Re-identification"

Language:PythonLicense:MITStargazers:45Issues:2Issues:20

UNIK

[BMVC 2021 Oral] Official implementation of our paper "A Unified Framework for Real-world Skeleton-based Action Recognition" on Toyota Smarthome/Penn Action/NTU-RGB+D/Posetics datasets

Language:PythonLicense:NOASSERTIONStargazers:44Issues:4Issues:2

LDU

Latent Discriminant deterministic Uncertainty [ECCV2022]

Language:PythonLicense:GPL-3.0Stargazers:38Issues:3Issues:10

g3an-project

[CVPR 20] G3AN: Disentangling Appearance and Motion for Video Generation

Language:PythonLicense:MITStargazers:38Issues:7Issues:11

MUAD-Dataset

MUAD: Multiple Uncertainties for Autonomous Driving, a benchmark for multiple uncertainty types and tasks [BMVC 2022]

Language:PythonLicense:NOASSERTIONStargazers:27Issues:2Issues:4

GAFF

[WACV 2021]"Guided Attentive Feature Fusion for Multispectral Pedestrian Detection"

ABMT

Implementation for WACV2021 paper "Enhancing Diversity in Teacher-Student Networks via Asymmetric branches for Unsupervised Person Re-identification"

Language:PythonLicense:MITStargazers:17Issues:1Issues:9

vpnplusplus

This is a repo of extension of VPN for Recognition of Activities of Daily Living

Latent_Action_Composition

[ICCV 2023] Latent Action Composition for Skeleton-based Action Segmentation

2s-AGCN-For-Daily-Living

2s-AGCN on Smarthome (dataset for daily living)

Video_3D_Pose_Estimation

3D pose estimation from video : LCRNet2D+VideoPose3D

SSTA-PRS

[WACV 2021] Selective Spatio-Temporal Aggregation based Pose Refinement System: Towards understanding human activities in real-world videos.

Language:PythonLicense:MITStargazers:11Issues:2Issues:0

SLURP_uncertainty_estimate

The implementations for SLURP: Side Learning Uncertainty for Regression Problems (BMVC 2021)

Language:PythonStargazers:10Issues:1Issues:0

walking-on-the-edge-fast-low-distortion-adversarial-examples

Walking on the Edge: Fast, Low-Distortion Adversarial Examples

Language:PythonStargazers:7Issues:0Issues:0

improved_HAR_on_Toyota

Improved action recognition with Separable spatio-temporal attention using alternative Skeletal and Video pre-processing

Language:PythonStargazers:4Issues:3Issues:0

CNN-structure-analyse-function

2018.02-03 internship in Hubert Curien Laboratory

Language:PythonStargazers:2Issues:0Issues:0

Human-knee-angles-estimation

Knee flexion/extension angles estimation for a person who walks towards an RGB-D camera

Language:PythonStargazers:2Issues:2Issues:0