Yamagishi and Echizen Laboratories, National Institute of Informatics's repositories
project-NN-Pytorch-scripts
see README
multi-speaker-tacotron
VCTK multi-speaker tacotron for ICASSP 2020
Capsule-Forensics-v2
Implementation of the Capsule-Forensics-v2
Intelligibility-MetricGAN
Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning"
Attention_Backend_for_ASV
Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
midi-to-audio
Project for MIDI to Audio Synthesis
speaker_sex_attribute_privacy
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
downloader-DR-VCTK-complete
downloader to obtain the complete DR-VCTK dataset (250GB)
fashion_adv
Fashion-Guided Adversarial Attack on Person Segmentation
Generalization_of_CMs_regularizations
The source code for the paper Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms (interspeech2023)
speechbrain
A PyTorch-based Speech Toolkit
ddsp-guitar
DDSP-Guitar