Bingliang Li (BingliangLi)

BingliangLi

Geek Repo

Company:CUHK(SZ)

Github PK Tool:Github PK Tool

Bingliang Li's repositories

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

wav2lip_vq

wav2lip in a Vector Quantized (VQ) space

Language:PythonStargazers:1Issues:0Issues:0

audiocaps-download

This package aims at simplifying the download of the AudioCaps dataset.

Language:PythonStargazers:0Issues:0Issues:0

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

audioset-download

This package aims at simplifying the download of the AudioSet dataset.

Language:PythonStargazers:0Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

ED-Pose

[ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

CoCap

[ICCV 2023] Accurate and Fast Compressed Video Captioning

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Diff-Foley

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

License:Apache-2.0Stargazers:0Issues:0Issues:0

DWPose

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

EDGE

Official PyTorch Implementation of EDGE (CVPR 2023)

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

HumanML3D

HumanML3D: A large and diverse 3d human motion-language dataset.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

License:Apache-2.0Stargazers:0Issues:0Issues:0

motion-latent-diffusion

[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model

License:MITStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

OmniControl

OmniControl: Control Any Joint at Any Time for Human Motion Generation, arXiv 2023

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

pcpnet

Pytorch implementation of PCPNet

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

R2-Talker-code

R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

stable-audio-tools

Generative models for conditional audio generation

License:MITStargazers:0Issues:0Issues:0

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

License:MITStargazers:0Issues:0Issues:0

videocomposer

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

License:MITStargazers:0Issues:0Issues:0