Shiyan Li (Leeviber)

Leeviber

Geek Repo

Github PK Tool:Github PK Tool

Shiyan Li's starred repositories

FairMOT

[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking

Language:PythonLicense:MITStargazers:4010Issues:84Issues:526

face.evoLVe

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Language:PythonLicense:MITStargazers:3444Issues:112Issues:188

FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

Language:C++License:Apache-2.0Stargazers:2986Issues:54Issues:1183

CVPR2022-DaGAN

Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

Language:PythonLicense:NOASSERTIONStargazers:963Issues:26Issues:79
Language:PythonLicense:BSD-3-ClauseStargazers:818Issues:12Issues:448

tinydiarize

Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens

Language:PythonLicense:MITStargazers:441Issues:26Issues:15

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language:PythonLicense:MITStargazers:312Issues:8Issues:69

awesome-RK3588

Useful resources for developing with the RK3588. :rocket:

lookwhostalking

Look Who’s Talking: Active Speaker Detection in the Wild

Language:PythonLicense:MITStargazers:72Issues:10Issues:7

paroli

Streaming TTS based on Piper with optional RK3588 NPU support

Language:C++License:MITStargazers:43Issues:4Issues:7

RKNN-RealESRGAN

Deploy super resolution (RealESRGAN) to RK3588S with single python script and rknn model.

Language:PythonLicense:MITStargazers:4Issues:1Issues:2