eehuahua's starred repositories
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
chineseocr
yolo3+ocr
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
MVSNet_pytorch
PyTorch Implementation of MVSNet
CasMVSNet_pl
Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching using pytorch-lightning
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
FACEGOOD-Audio2Face
http://www.facegood.cc
non_rigid_icp
Modified version of non-rigid Iterative closest point algorithm for fitting to noisy point clouds
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
ColossalAI
Making large AI models cheaper, faster and more accessible
DualStyleGAN
[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
Deep3DFaceReconstruction
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)
book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
asv-subtools
An Open Source Tools for Speaker Recognition
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.