Shuolin (Xushuolin)

Xushuolin

Geek Repo

Company:Bournemouth university

Location:UK

Github PK Tool:Github PK Tool

Shuolin's starred repositories

Train_SD_VAE

Finetune your VAE on private datasets!

Language:PythonStargazers:13Issues:0Issues:0

HAC

:house: [ECCV 2024] Pytorch implementation of 'HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression'

Language:PythonLicense:NOASSERTIONStargazers:174Issues:0Issues:0

TokenHMR

[CVPR 2024] TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation

Language:PythonLicense:NOASSERTIONStargazers:181Issues:0Issues:0

ComfyUI-ToonCrafter

This project is used to enable ToonCrafter to be used in ComfyUI.

Language:PythonLicense:Apache-2.0Stargazers:287Issues:0Issues:0

VideoTetris

VideoTetris: Towards Compositional Text-To-Video Generation

Language:PythonStargazers:190Issues:0Issues:0
Stargazers:1Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:109Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:643Issues:0Issues:0

PCDMs

Implementation code:Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:141Issues:0Issues:0

CFLD

[CVPR 2024 Highlight] Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis

Language:Jupyter NotebookLicense:MITStargazers:155Issues:0Issues:0

control-a-video

Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"

Language:PythonLicense:GPL-3.0Stargazers:357Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8189Issues:0Issues:0

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++License:MPL-2.0Stargazers:24889Issues:0Issues:0

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonLicense:GPL-3.0Stargazers:46513Issues:0Issues:0

ToonCrafter

a research paper for generative cartoon interpolation

Language:PythonLicense:Apache-2.0Stargazers:4966Issues:0Issues:0

CAMDM

(SIGGRAPH 2024) Official repository for "Taming Diffusion Probabilistic Models for Character Control"

Language:C#Stargazers:130Issues:0Issues:0

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:8008Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonLicense:Apache-2.0Stargazers:973Issues:0Issues:0

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonLicense:MITStargazers:1625Issues:0Issues:0

PIRender

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

Language:PythonLicense:NOASSERTIONStargazers:511Issues:0Issues:0
Language:PythonStargazers:502Issues:0Issues:0

DeformingThings4D

[ICCV 2021] A dataset of non-rigidly deforming objects.

Language:PythonStargazers:293Issues:0Issues:0

EMOPortraits

Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

Stargazers:199Issues:0Issues:0

PantoMatrix

PantoMatrix: Co-Speech Talking Head and Gestures Generation

Language:PythonLicense:NOASSERTIONStargazers:887Issues:0Issues:0

LAMP

Official implement code of LAMP: Learn a Motion Pattern by Few-Shot Tuning a Text-to-Image Diffusion Model (Few-shot-based text-to-video diffusion)

Language:PythonLicense:NOASSERTIONStargazers:245Issues:0Issues:0

PAE

[CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation

Language:PythonStargazers:51Issues:0Issues:0

AnimeSR

Codes for "AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos"

Language:PythonLicense:NOASSERTIONStargazers:324Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11400Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:10021Issues:0Issues:0