ZhaZhaFon

ZhaZhaFon

Geek Repo

0

followers

0

following

0

stars

Github PK Tool:Github PK Tool

ZhaZhaFon's repositories

demo-speakerseparation

This is a demo for my bachelor thesis 'Speaker Separation and Machine Auditory Perception for Dialogue Scene'.

repo_voxcelebtrainer

说话人识别仓库-说话人表征-ResNet/VGGVox || a ready-to-use repo for Speaker Verification / Speaker Embedding with xvector

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

sv-ssl

Collection of self-supervised learning (SSL) methods for speaker verification (SV).

Language:Jupyter NotebookStargazers:3Issues:0Issues:0
Language:PythonStargazers:2Issues:1Issues:0

repo_spectralclustering

说话人分割仓库-聚类分割-谱聚类 || a ready-to-use repo for Speaker Diariazation with Spectral Clustering

Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

beautiful-jekyll

✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com

License:MITStargazers:0Issues:0Issues:0

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Stargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kaldiio

A pure python module for reading and writing kaldi ark files

License:NOASSERTIONStargazers:0Issues:0Issues:0

missing-semester-cn.github.io

the CS missing semester Chinese version

License:NOASSERTIONStargazers:0Issues:0Issues:0

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

os_course_info

OS Lectures 2022 Spring in Dept. CS, Tsinghua Univ.

Stargazers:0Issues:0Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

License:MITStargazers:0Issues:0Issues:0

pytorch-loss

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

License:MITStargazers:0Issues:0Issues:0

repo_asteroid

语音前端仓库 || a modified version of Asteroid toolkit for Speech Front-end

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

repo_dvector

说话人识别仓库-说话人表征-dvector || a ready-to-use repo for Speaker Verification / Speaker Embedding with dvector

Language:PythonStargazers:0Issues:0Issues:0

repo_librimix

An open source dataset for source separation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

License:MITStargazers:0Issues:0Issues:0

speaker_extraction_config

target speaker extraction and verification for multi-talker speech

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Speech-Resources

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐(排名不分先后)

Stargazers:0Issues:0Issues:0

speechbrain_config

A PyTorch-based Speech Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SpeechSplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

License:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Stargazers:0Issues:0Issues:0

voicesplit_config

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

License:Apache-2.0Stargazers:0Issues:0Issues:0

voxceleb_unsupervised

Augmentation adversarial training for self-supervised speaker recognition

Stargazers:0Issues:0Issues:0