Beast code in Giters

Wuchuq's starred repositories

LLMGA

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral

Language:PythonApache-2.043100

VirtualMarker

[CVPR 2023] Offical Pytorch implementation of "3D Human Mesh Estimation from Virtual Markers"

Language:PythonApache-2.024900

Diffpose

[CVPR 2023] DiffPose: Toward More Reliable 3D Pose Estimation

Language:PythonMIT14200

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

103300

PIDM

Person Image Synthesis via Denoising Diffusion Model (CVPR 2023)

Language:Jupyter NotebookMIT47600

MPS-Net_release

Official implementation of CVPR2022 paper "Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video"

Language:PythonMIT9900

SemGCN

The Pytorch implementation for "Semantic Graph Convolutional Networks for 3D Human Pose Regression" (CVPR 2019).

Language:PythonApache-2.046300

video-diffusion-pytorch

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Language:PythonMIT121000

diffusion

Denoising Diffusion Probabilistic Models

Language:Python355400

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonMIT1345200

TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Language:PythonNOASSERTION149700

TDAN-VSR-CVPR-2020

TDAN: Temporally-Deformable Alignment Network for Video Super-Resolution, CVPR 2020

Language:PythonMIT40000

spynet

Spatial Pyramid Network for Optical Flow

Language:LuaNOASSERTION22900

pytorch-spynet

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

Language:PythonGPL-3.030600

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookApache-2.0680000

RealBasicVSR

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Language:PythonApache-2.089200

ddpm-segmentation

Label-Efficient Semantic Segmentation with Diffusion Models (ICLR'2022)

Language:PythonMIT64900

H36M-Toolbox

Language:PythonApache-2.013900

h36m-fetch

Human 3.6M 3D human pose dataset fetcher

Language:PythonApache-2.036100

noah-research

Noah Research

Language:Python85100

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.03108400

Wuchuq