Wuchuq

Wuchuq

Geek Repo

Github PK Tool:Github PK Tool

Wuchuq's starred repositories

LLMGA

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral

Language:PythonLicense:Apache-2.0Stargazers:431Issues:0Issues:0

VirtualMarker

[CVPR 2023] Offical Pytorch implementation of "3D Human Mesh Estimation from Virtual Markers"

Language:PythonLicense:Apache-2.0Stargazers:249Issues:0Issues:0

Diffpose

[CVPR 2023] DiffPose: Toward More Reliable 3D Pose Estimation

Language:PythonLicense:MITStargazers:142Issues:0Issues:0

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

Stargazers:1033Issues:0Issues:0

PIDM

Person Image Synthesis via Denoising Diffusion Model (CVPR 2023)

Language:Jupyter NotebookLicense:MITStargazers:476Issues:0Issues:0

MPS-Net_release

Official implementation of CVPR2022 paper "Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video"

Language:PythonLicense:MITStargazers:99Issues:0Issues:0

SemGCN

The Pytorch implementation for "Semantic Graph Convolutional Networks for 3D Human Pose Regression" (CVPR 2019).

Language:PythonLicense:Apache-2.0Stargazers:463Issues:0Issues:0

video-diffusion-pytorch

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Language:PythonLicense:MITStargazers:1210Issues:0Issues:0

diffusion

Denoising Diffusion Probabilistic Models

Language:PythonStargazers:3554Issues:0Issues:0

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:13452Issues:0Issues:0

TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Language:PythonLicense:NOASSERTIONStargazers:1497Issues:0Issues:0

TDAN-VSR-CVPR-2020

TDAN: Temporally-Deformable Alignment Network for Video Super-Resolution, CVPR 2020

Language:PythonLicense:MITStargazers:400Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:3159Issues:0Issues:0

spynet

Spatial Pyramid Network for Optical Flow

Language:LuaLicense:NOASSERTIONStargazers:229Issues:0Issues:0

pytorch-spynet

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

Language:PythonLicense:GPL-3.0Stargazers:306Issues:0Issues:0

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6800Issues:0Issues:0

RealBasicVSR

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Language:PythonLicense:Apache-2.0Stargazers:892Issues:0Issues:0

ddpm-segmentation

Label-Efficient Semantic Segmentation with Diffusion Models (ICLR'2022)

Language:PythonLicense:MITStargazers:649Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:139Issues:0Issues:0

h36m-fetch

Human 3.6M 3D human pose dataset fetcher

Language:PythonLicense:Apache-2.0Stargazers:361Issues:0Issues:0

noah-research

Noah Research

Language:PythonStargazers:851Issues:0Issues:0

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:31084Issues:0Issues:0

maed

[ICCV 2021] Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

Language:PythonLicense:MITStargazers:207Issues:0Issues:0

homogenus

Human Image Gender Classifier for Expressive Body Capture

Language:PythonLicense:NOASSERTIONStargazers:113Issues:0Issues:0

MEVA

Official implementation of ACCV 2020 paper "3D Human Motion Estimation via Motion Compression and Refinement" (Identical repo to https://github.com/KlabCMU/MEVA, will be kept in sync)

Language:PythonLicense:NOASSERTIONStargazers:104Issues:0Issues:0

human_dynamics

Project for paper "Learning 3D Human Dynamics from Video"

Language:PythonLicense:BSD-2-ClauseStargazers:631Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:447Issues:0Issues:0

eft

visualization code for 3D human body annotation by EFT (Exemplar Fine-tuning)

Language:PythonLicense:NOASSERTIONStargazers:372Issues:0Issues:0

smplx

SMPL-X

Language:PythonLicense:NOASSERTIONStargazers:1746Issues:0Issues:0

openpose

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

Language:C++License:NOASSERTIONStargazers:30608Issues:0Issues:0