Yuhang's repositories

AdaptiveFilterandActiveNoiseCancellation

Adaptive Filter and Active Noise Cancellation —— LMS, NLMS, RLS

Language:MATLABStargazers:0Issues:0Issues:0

Auto-Tuning-Spectral-Clustering

This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome

😎 Awesome lists about all kinds of interesting topics

License:CC0-1.0Stargazers:0Issues:0Issues:0

Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

Language:MATLABLicense:MITStargazers:0Issues:0Issues:0

Awesome-GPT-Store

A collection of major GPTS available in public

License:MITStargazers:0Issues:0Issues:0

CaoYuhang.github.io

blog website

Language:JavaScriptStargazers:0Issues:0Issues:0

ChatWaifu-marai

About Combined ChatGPT with Moegoe TTS to create a Chatting Waifu for Marai

License:MITStargazers:0Issues:0Issues:0

CyberWaifu

GPT + Tacotron2/VITS + Live2D = CyberWaifu

License:MITStargazers:0Issues:0Issues:0

DNS-Challenge

This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

Free-Certifications

A curated list of free courses & certifications.

License:MITStargazers:0Issues:0Issues:0

GPT-vup

GPT-vup BIliBili | 抖音 | AI | 虚拟主播

Stargazers:0Issues:0Issues:0

hackingtool

ALL IN ONE Hacking Tool For Hackers

License:MITStargazers:0Issues:0Issues:0

IP_LAP

CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

License:Apache-2.0Stargazers:0Issues:0Issues:0

julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

LiveWhisper

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

License:MITStargazers:0Issues:0Issues:0

megatts2

Unoffical implement of Megatts2

License:MITStargazers:0Issues:0Issues:0

rnnt-speech-recognition

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

License:MITStargazers:0Issues:0Issues:0

roop

one-click face swap

License:GPL-3.0Stargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

License:MITStargazers:0Issues:0Issues:0

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Stargazers:0Issues:0Issues:0

speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

SpEx_Plus

SpEx+(tied) source code

License:MITStargazers:0Issues:0Issues:0

tcnse

TCN-based Speech Enhancement

Stargazers:0Issues:0Issues:0

Teacher-free-Knowledge-Distillation

Knowledge Distillation: CVPR2020 Oral, Revisiting Knowledge Distillation via Label Smoothing Regularization

License:MITStargazers:0Issues:0Issues:0

tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

License:Apache-2.0Stargazers:0Issues:0Issues:0

unified2021

A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

voice_activity_detection

Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)

License:MITStargazers:0Issues:0Issues:0

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Language:PythonStargazers:0Issues:0Issues:0

w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

License:MITStargazers:0Issues:0Issues:0