Beast code in Giters

ZhaZhaFon's repositories

demo-speakerseparation

This is a demo for my bachelor thesis 'Speaker Separation and Machine Auditory Perception for Dialogue Scene'.

Language:Shell3 1 1

repo_voxcelebtrainer

说话人识别仓库-说话人表征-ResNet/VGGVox || a ready-to-use repo for Speaker Verification / Speaker Embedding with xvector

Language:PythonMIT300

sv-ssl

Collection of self-supervised learning (SSL) methods for speaker verification (SV).

Language:Jupyter Notebook300

spkh

Language:Python2 10

repo_spectralclustering

说话人分割仓库-聚类分割-谱聚类 || a ready-to-use repo for Speaker Diariazation with Spectral Clustering

Language:Jupyter Notebook100

asteroid_c

Language:PythonMIT000

beautiful-jekyll

✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com

MIT000

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

000

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0000

kaldiio

A pure python module for reading and writing kaldi ark files

NOASSERTION000

missing-semester-cn.github.io

the CS missing semester Chinese version

NOASSERTION000

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020

BSD-3-Clause000

os_course_info

OS Lectures 2022 Spring in Dept. CS, Tsinghua Univ.

000

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

MIT000

pytorch-loss

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

MIT000

repo_asteroid

语音前端仓库 || a modified version of Asteroid toolkit for Speech Front-end

Language:PythonMIT000

repo_dvector

说话人识别仓库-说话人表征-dvector || a ready-to-use repo for Speaker Verification / Speaker Embedding with dvector

Language:Python000

repo_librimix

An open source dataset for source separation

Language:PythonMIT000

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

MIT000

speaker_extraction_config

target speaker extraction and verification for multi-talker speech

Language:PythonGPL-3.0000

speakerbeam_config

NOASSERTION000

speakerbrain

Language:Python000

Speech-Resources

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐（排名不分先后）

000

speechbrain_config

A PyTorch-based Speech Toolkit

Apache-2.0000

speechbrain_own

Language:PythonApache-2.0000

SpeechSplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

MIT000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

MPL-2.0000

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

000

voicesplit_config

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

Apache-2.0000

voxceleb_unsupervised

Augmentation adversarial training for self-supervised speaker recognition

000