Yuhang's repositories
AdaptiveFilterandActiveNoiseCancellation
Adaptive Filter and Active Noise Cancellation —— LMS, NLMS, RLS
Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
awesome
😎 Awesome lists about all kinds of interesting topics
Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Awesome-GPT-Store
A collection of major GPTS available in public
CaoYuhang.github.io
blog website
ChatWaifu-marai
About Combined ChatGPT with Moegoe TTS to create a Chatting Waifu for Marai
CyberWaifu
GPT + Tacotron2/VITS + Live2D = CyberWaifu
DNS-Challenge
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
Free-Certifications
A curated list of free courses & certifications.
GPT-vup
GPT-vup BIliBili | 抖音 | AI | 虚拟主播
hackingtool
ALL IN ONE Hacking Tool For Hackers
IP_LAP
CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
megatts2
Unoffical implement of Megatts2
rnnt-speech-recognition
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
roop
one-click face swap
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
SpEx_Plus
SpEx+(tied) source code
tcnse
TCN-based Speech Enhancement
Teacher-free-Knowledge-Distillation
Knowledge Distillation: CVPR2020 Oral, Revisiting Knowledge Distillation via Label Smoothing Regularization
tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
unified2021
A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION
voice_activity_detection
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
w2v2-how-to
How to use our public wav2vec2 dimensional emotion model