cc-cherie's starred repositories
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
generative-models
Generative Models by Stability AI
emotionally_consistent_speech
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
benchmarks
This repository contains the SpeechBrain Benchmarks
audiotext-transformer
Multimodal Transformer for Korean Sentiment Analysis with Audio and Text Features
BERT-like-is-All-You-Need
The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Speech-emotion-recognition-MCFN
This is a repository for our work: A DUAL ATTENTION-BASED MODALITY-COLLABORATIVE FUSION NETWORK FOR EMOTION RECOGNITION
data2vec-pytorch
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
PraatScripts
These are praat scripts I use in my research, implemented in parselmouth for python for use in binder
leedl-tutorial
《李宏毅深度学习教程》(李宏毅老师推荐👍),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
audioset-downloader
cli to download examples of a specific class from google's AudioSet
audioset-processing
Toolkit for downloading and processing Google's AudioSet dataset.