Yamagishi and Echizen Laboratories, National Institute of Informatics (nii-yamagishilab)

Yamagishi and Echizen Laboratories, National Institute of Informatics

nii-yamagishilab

Geek Repo

Yamagishi and Echizen Laboratories, National Institute of Informatics, Japan

Location:Tokyo, Japan

Home Page:https://nii-yamagishilab.github.io

Github PK Tool:Github PK Tool

Yamagishi and Echizen Laboratories, National Institute of Informatics's repositories

Language:PythonLicense:BSD-3-ClauseStargazers:291Issues:9Issues:26

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020

Language:PythonLicense:BSD-3-ClauseStargazers:263Issues:18Issues:11

Capsule-Forensics-v2

Implementation of the Capsule-Forensics-v2

Language:PythonLicense:BSD-3-ClauseStargazers:115Issues:7Issues:23

tacotron2

An implementation of Tacotron and Tacotron2

Language:PythonLicense:BSD-3-ClauseStargazers:81Issues:19Issues:1

ZMM-TTS

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

Language:CLicense:BSD-3-ClauseStargazers:73Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:59Issues:5Issues:8

Intelligibility-MetricGAN

Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning"

Language:PythonLicense:BSD-3-ClauseStargazers:51Issues:8Issues:3

Attention_Backend_for_ASV

Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances

Language:PythonLicense:BSD-3-ClauseStargazers:46Issues:3Issues:9
Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:20Issues:2Issues:4

midi-to-audio

Project for MIDI to Audio Synthesis

Language:ShellLicense:Apache-2.0Stargazers:17Issues:2Issues:0

NELE-GAN

Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement

Language:PythonLicense:BSD-3-ClauseStargazers:17Issues:3Issues:1

speaker_sex_attribute_privacy

Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE

Language:PythonStargazers:14Issues:3Issues:0

SSL-SAS

Language independent SSL-based Speaker Anonymization system

Language:PythonLicense:NOASSERTIONStargazers:12Issues:2Issues:1

downloader-DR-VCTK-complete

downloader to obtain the complete DR-VCTK dataset (250GB)

Language:PythonLicense:BSD-3-ClauseStargazers:4Issues:3Issues:1

fashion_adv

Fashion-Guided Adversarial Attack on Person Segmentation

mla

A Multi-Level Attention Model for Evidence-Based Fact Checking

Language:PythonLicense:BSD-3-ClauseStargazers:4Issues:4Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:2Issues:0Issues:0
Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:2Issues:0Issues:0

Generalization_of_CMs_regularizations

The source code for the paper Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms (interspeech2023)

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:1Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:ShellLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0
Language:ShellLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

ddsp-guitar

DDSP-Guitar

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:ShellLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:3Issues:0