sarulab-speech

sarulab-speech's repositories

jtubespeech

Language:PythonApache-2.0208 11 8

UTMOS22

UT-Sarulab MOS prediction system using SSL models

Language:PythonMIT157 7 10

jsut-label

context labels and pronunciation data for JSUT corpus

NOASSERTION64 5 1

xvector_jtubespeech

xvector model on jtubespeech

Language:PythonMIT38 4 2

whisper-asr-finetune

Language:PythonMIT31 4 5

lightweight_spkr_anon

Lightweight speaker anonymization [IEEE SLT2021]

Language:PythonMIT24 50

multi-speaker-dgp

Official implementation of DGP-based multi-speaker speech synthesis with PyTorch

Language:PythonMIT24 40

tdmelodic_openjtalk

tdmelodic for open-jtalk

2200

Coco-Nut

Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus

19 10

spatial_voice_conversion

Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals

Language:Python900

ml-audiocaps

Multi-lingual AudioCaps

Apache-2.0700

visual-onoma-to-wave

Visual onoma-to-wave official implementation

Language:PythonMIT500

VMC2024-sarulab-data

MIT300

Mid-Attribute-Speaker-Generation

Language:PythonMIT200

pseudo_speech_decryption

Language:Python1 30

demo_CALLS_corpus

CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center (INTERSPEECH2023)

Language:HTMLApache-2.0000

demo_ChatGPT_EDSS

ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings (INTERSPEECH2023)

Language:HTMLApache-2.0000

bert-japanese

BERT models for Japanese text.

Language:PythonApache-2.0000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT000

SaSLaW

Dialogue Speech Corpus with Audio-visual Egocentric Information, "So, what are you Speaking, Listening, and Watching?"

000