Beast code in Giters

Bingliang Li's repositories

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonMIT100

wav2lip_vq

wav2lip in a Vector Quantized (VQ) space

Language:Python100

audiocaps-download

This package aims at simplifying the download of the AudioCaps dataset.

Language:Python000

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonNOASSERTION000

audioset-download

This package aims at simplifying the download of the AudioSet dataset.

Language:Python000

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT000

controlled-motion-latent-diffusion

Language:PythonMIT000

CUHKSZ-Radiance

Apache-2.0000

ED-Pose

[ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "

Language:PythonNOASSERTION000

CoCap

[ICCV 2023] Accurate and Fast Compressed Video Captioning

MIT000

crawlers

000

detr

End-to-End Object Detection with Transformers

Language:PythonApache-2.0000

Diff-Foley

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

Apache-2.0000

DWPose

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Language:PythonApache-2.0000

EDGE

Official PyTorch Implementation of EDGE (CVPR 2023)

MIT000

guided-motion-diffusion

Language:PythonNOASSERTION000

HumanML3D

HumanML3D: A large and diverse 3d human motion-language dataset.

Language:PythonMIT000

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonMIT000

mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

Apache-2.0000

motion-latent-diffusion

[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model

MIT000

MP-HOI.github.io

Language:JavaScript000

OmniControl

OmniControl: Control Any Joint at Any Time for Human Motion Generation, arXiv 2023

Language:PythonMIT000

OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

AGPL-3.0000

pcpnet

Pytorch implementation of PCPNet

Language:PythonNOASSERTION000

R2-Talker-code

R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning

Language:PythonMIT000

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

MIT000

sd-scripts

Apache-2.0000

stable-audio-tools

Generative models for conditional audio generation

MIT000

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

MIT000

videocomposer

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

MIT000