kun song (kunsung)

kunsung

Geek Repo

Company:Northwestern Polytechnical University

Github PK Tool:Github PK Tool

kun song's starred repositories

AAR

[Official Implementation] Acoustic Autoregressive Modeling 🔥

Language:PythonStargazers:49Issues:0Issues:0

SimpleSpeech

The open source code for SimpleSpeech series

Language:PythonStargazers:66Issues:0Issues:0

speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Language:PythonLicense:Apache-2.0Stargazers:2439Issues:0Issues:0

contrastive-predictive-coding

PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)

Language:PythonStargazers:82Issues:0Issues:0

Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language:Jupyter NotebookLicense:MITStargazers:574Issues:0Issues:0

CPC_audio

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Language:PythonLicense:MITStargazers:346Issues:0Issues:0

LlamaVoice

LlamaVoice is a llama-based large voice generation model, providing inference and training ability.

Language:PythonStargazers:152Issues:0Issues:0

mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Language:PythonLicense:MITStargazers:651Issues:0Issues:0

llama-models

Utilities intended for use with Llama models.

Language:PythonLicense:NOASSERTIONStargazers:3576Issues:0Issues:0

e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Language:PythonLicense:MITStargazers:214Issues:0Issues:0

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:MITStargazers:1157Issues:0Issues:0

tortoise-tts-fast

Fast TorToiSe inference (5x or your money back!)

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:769Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:4261Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11374Issues:0Issues:0

AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Language:PythonLicense:NOASSERTIONStargazers:519Issues:0Issues:0
Language:PythonStargazers:85Issues:0Issues:0

SRVQ

Spherical residual vector quantization (SRVQ)

Language:PythonLicense:MITStargazers:25Issues:0Issues:0

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonLicense:MITStargazers:1142Issues:0Issues:0

LLM-Codec

The open source code for LLM-Codec

Language:PythonStargazers:105Issues:0Issues:0

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:2393Issues:0Issues:0

LibriTTS-P

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

Stargazers:104Issues:0Issues:0

valle-audiodec

Inference code for Audiodec-Valle-Wenetspeech4TTS

Language:PythonLicense:NOASSERTIONStargazers:40Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:29764Issues:0Issues:0

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2342Issues:0Issues:0

fairseq2

FAIR Sequence Modeling Toolkit 2

Language:PythonLicense:MITStargazers:661Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55253Issues:0Issues:0

TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Language:PythonLicense:Apache-2.0Stargazers:189Issues:0Issues:0

UniAudio

The Open Source Code of UniAudio

Language:PythonStargazers:502Issues:0Issues:0
Language:PythonLicense:MITStargazers:245Issues:0Issues:0

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

License:Apache-2.0Stargazers:3187Issues:0Issues:0