Shiyan Li's starred repositories
face.evoLVe
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥
FastDeploy
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.
CVPR2022-DaGAN
Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
awesome-RK3588
Useful resources for developing with the RK3588. :rocket:
lookwhostalking
Look Who’s Talking: Active Speaker Detection in the Wild
RKNN-RealESRGAN
Deploy super resolution (RealESRGAN) to RK3588S with single python script and rknn model.