AI-S2-Lab

AI-S2-Lab

Geek Repo

Speech understanding and Speech generation (AI.S2) Lab

Home Page:https://ttslr.github.io

Github PK Tool:Github PK Tool

AI-S2-Lab's repositories

EmoPP

[ICASSP2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech

FluentEditor

[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency

Language:PythonStargazers:20Issues:8Issues:0

MnTTS2

NCMMSC'2022

Language:Jupyter NotebookStargazers:4Issues:0Issues:0

EMS

[InterSpeech'2024] Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge

Stargazers:3Issues:0Issues:0

M2S-ADD

[InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"

Language:PythonStargazers:3Issues:0Issues:0

english-conversation-corpus

English conversation corpus for conversational TTS.

Language:ShellLicense:GPL-3.0Stargazers:2Issues:0Issues:0

MonTTS

Journal of Chinese Information Processing (中文信息学报). 2022

Language:PythonStargazers:2Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

License:MITStargazers:1Issues:0Issues:0

HohhotBrain

LLM-based Hohhot AI Smart Brain

Stargazers:1Issues:0Issues:0
Stargazers:1Issues:0Issues:0

FCTalker

Submitted to ICASSP'2023

Language:PythonStargazers:0Issues:0Issues:0

IF-MMIN

[ICASSP'2023] Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MnTTS

IALP'2022

Language:Jupyter NotebookLicense:UnlicenseStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

ECSS

[AAAI'2024] Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

StrengthNet

[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning

License:NOASSERTIONStargazers:0Issues:0Issues:0

TTS_Voice_Demo_Website

这个仓库用来分享tts合成语音demo,共两个版本(仅语音demo、包括频谱图和语音的demo)

Stargazers:0Issues:0Issues:0