Beast code in Giters

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonApache-2.031200

lamini

Language:PythonApache-2.0242300

VoiceActivityProjection

Voice Activity Projection Models: Self-supervised learning of Turn-taking Events

Language:PythonMIT2300

speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Language:PythonMIT16200

spear-tts-pytorch

Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch

Language:PythonMIT24100

CleanUNet

Official PyTorch Implementation of CleanUNet (ICASSP 2022)

Language:PythonMIT27100

WavJourney

WavJourney: Compositional Audio Creation with LLMs

Language:PythonNOASSERTION50900

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookNOASSERTION1038400

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonMIT59800

getTV3Videos

Descarrega vídeos de TV3 (tv3.cat) // Download videos of TV3 channel (tv3.cat)

Language:JavaScript5900

PolyLangVITS

Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)

Language:PythonMIT7000

stable-diffusion.cpp

Stable Diffusion in pure C/C++

Language:C++MIT276800

langstream

Build robust LLM applications with true composability 🔗

Language:PythonMIT39800