Beast code in Giters

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.01125300

Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion

This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion".

Language:Python1400

KnowledgeEditingPapers

Must-read Papers on Knowledge Editing for Large Language Models.

MIT79400

speech-model-compression

A collection of papers related to speech model compression

2300

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT2043200

jiamin1013

Jiamin Xie's starred repositories

disfluency_detection_from_audio

English-words-pronunciation-mp3-audio-download

misspell

english-words

english-fisher-annotations

awesome-disfluency-detection

relative_phoneme_analysis

peft

distil-whisper

speech-trident

g2p

whisper

icefall

NANSY

Child-ASR-Paper

NeMo

Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion

KnowledgeEditingPapers

speech-model-compression

audiocraft

awesome-large-audio-models

SGConv

tuning_playbook

adjustText

wer_are_we

VQMIVC

codec2-dev

tvdcn

espnet

speechbrain