AI-S2-Lab

Speech understanding and Speech generation (AI.S2) Lab

https://ttslr.github.io

AI-S2-Lab's repositories

EmoPP

[ICASSP2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech

Language:HTML20 2 2

FluentEditor

[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency

Language:Python20 80

MnTTS2

NCMMSC'2022

Language:Jupyter Notebook400

EMS

[InterSpeech'2024] Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge

300

M2S-ADD

[InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"

Language:Python300

english-conversation-corpus

English conversation corpus for conversational TTS.

Language:ShellGPL-3.0200

MonTTS

Journal of Chinese Information Processing (中文信息学报). 2022

Language:Python200

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

MIT100

HohhotBrain

LLM-based Hohhot AI Smart Brain

100

NCE-TTS

100

FCTalker

Submitted to ICASSP'2023

Language:Python000

IF-MMIN

[ICASSP'2023] Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities

Language:PythonMIT000

MnTTS

IALP'2022

Language:Jupyter NotebookUnlicense000

.github

000

ECSS

[AAAI'2024] Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling

Language:PythonMIT000

Noise-robust_MER

MIT000

StrengthNet

[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning

NOASSERTION000

TTS_Voice_Demo_Website

这个仓库用来分享tts合成语音demo，共两个版本(仅语音demo、包括频谱图和语音的demo)

000