atuxhe's repositories

BitNet

Official inference framework for 1-bit LLMs

Language:C++License:MITStargazers:1Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

AEC-Challenge

AEC Challenge

License:MITStargazers:0Issues:0Issues:0

agc

Audiogen Codec

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

gemmlowp

Low-precision matrix multiplication

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gtcrn

An official implementation of GTCRN, an ultra-lite speech enhancement model.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:0Issues:0Issues:0

metavoice-src

AI for human-level speech intelligence

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mix-phoneme-bert

An unofficial PyTorch implementation of Mix-Phoneme-Bert

Language:PythonStargazers:0Issues:0Issues:0

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MooER

MooER: an LLM-based Speech Recognition and Translation Model from Moore Threads

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RSTnet

Real-time Speech-Text Foundation Model Toolkit

Language:PythonStargazers:0Issues:0Issues:0

ruapu

Detect CPU ISA features with single-file

Language:CLicense:MITStargazers:0Issues:0Issues:0

SPTK

A suite of speech signal processing tools

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

tts-frontend-dataset

TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

unified2021

A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION

Language:MATLABStargazers:0Issues:1Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:0Issues:0Issues:0