wangyang199609

wangyang199609

Geek Repo

Github PK Tool:Github PK Tool

wangyang199609's repositories

audio-visual-speech-enhancement

Official Implementation of "Visual Speech Enhancement", Interspeech 2018.

Language:PythonStargazers:0Issues:0Issues:0

awesome-speech-recognition-speech-synthesis-papers

Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling

License:MITStargazers:0Issues:0Issues:0

bsseval

audio source separation evaluation metrics

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CodingInterviewChinese2

《剑指Offer》第二版源代码

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

kaldi

This is the official location of the Kaldi project.

License:NOASSERTIONStargazers:0Issues:0Issues:0

MultimodalAnalysis_SpeakerDiarization

The project tries to solve a speaker diarization problem using audio features, face recognition and video feature extraction from face image, mouth tracking.

Stargazers:0Issues:0Issues:0

phasen

A unofficial Pytorch implementation of Microsoft's PHASEN

Language:PythonStargazers:0Issues:0Issues:0

Speech-measure-SDR-SAR-STOI-PESQ

Speech quality measure of SDR、SAR、STOI、ESTOI、PESQ via MATLAB

Stargazers:0Issues:0Issues:0

SpEx

Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".

License:CC0-1.0Stargazers:0Issues:0Issues:0

SpEx_Plus

SpEx+(tied) source code

Language:PythonStargazers:0Issues:0Issues:0

VGGVox

VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets

Stargazers:0Issues:0Issues:0