sarulab-speech

sarulab-speech

Geek Repo

Speech group, Saruwatari-Koyama Lab, the University of Tokyo, Japan.

Location:Tokyo, Japan

Home Page:http://www.sp.ipc.i.u-tokyo.ac.jp/index-en

Github PK Tool:Github PK Tool

sarulab-speech's repositories

Language:PythonLicense:Apache-2.0Stargazers:208Issues:11Issues:8

UTMOS22

UT-Sarulab MOS prediction system using SSL models

Language:PythonLicense:MITStargazers:157Issues:7Issues:10

jsut-label

context labels and pronunciation data for JSUT corpus

xvector_jtubespeech

xvector model on jtubespeech

Language:PythonLicense:MITStargazers:38Issues:4Issues:2

lightweight_spkr_anon

Lightweight speaker anonymization [IEEE SLT2021]

Language:PythonLicense:MITStargazers:24Issues:5Issues:0

multi-speaker-dgp

Official implementation of DGP-based multi-speaker speech synthesis with PyTorch

Language:PythonLicense:MITStargazers:24Issues:4Issues:0

tdmelodic_openjtalk

tdmelodic for open-jtalk

Stargazers:22Issues:0Issues:0

Coco-Nut

Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus

spatial_voice_conversion

Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals

Language:PythonStargazers:9Issues:0Issues:0

ml-audiocaps

Multi-lingual AudioCaps

License:Apache-2.0Stargazers:7Issues:0Issues:0

visual-onoma-to-wave

Visual onoma-to-wave official implementation

Language:PythonLicense:MITStargazers:5Issues:0Issues:0
License:MITStargazers:3Issues:0Issues:0
Language:PythonLicense:MITStargazers:2Issues:0Issues:0

demo_CALLS_corpus

CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center (INTERSPEECH2023)

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

demo_ChatGPT_EDSS

ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings (INTERSPEECH2023)

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bert-japanese

BERT models for Japanese text.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SaSLaW

Dialogue Speech Corpus with Audio-visual Egocentric Information, "So, what are you Speaking, Listening, and Watching?"

Stargazers:0Issues:0Issues:0