atuxhe

User data from Github https://github.com/atuxhe

followers

following

stars

atuxhe's repositories

BitNet

Official inference framework for 1-bit LLMs

Language:C++MIT100

OpenVoice

Instant voice cloning by MyShell

Language:PythonMIT100

AEC-Challenge

AEC Challenge

MIT000

agc

Audiogen Codec

Language:PythonMIT000

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT000

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT000

cfm-vc

Language:Jupyter NotebookAGPL-3.0000

gemmlowp

Low-precision matrix multiplication

Language:C++Apache-2.0020

FireRedASR

Language:PythonApache-2.0000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit

Language:PythonNOASSERTION000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT000

gtcrn

An official implementation of GTCRN, an ultra-lite speech enhancement model.

Language:PythonMIT000

ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Language:PythonApache-2.0000

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT000

metavoice-src

AI for human-level speech intelligence

Language:PythonApache-2.0000

mix-phoneme-bert

An unofficial PyTorch implementation of Mix-Phoneme-Bert

Language:Python000

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonApache-2.0000

MooER

MooER: an LLM-based Speech Recognition and Translation Model from Moore Threads

Language:PythonNOASSERTION000

moshi

Language:PythonApache-2.0000

natural_voice_assistant

Language:PythonMIT000

RSTnet

Real-time Speech-Text Foundation Model Toolkit

Language:Python000

ruapu

Detect CPU ISA features with single-file

Language:CMIT000

SPTK

A suite of speech signal processing tools

Language:C++Apache-2.0000

Step-Audio

Language:PythonApache-2.0000

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonMIT000

TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Language:PythonMIT000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

tts-frontend-dataset

TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization

Language:PythonApache-2.0000

unified2021

A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION

Language:MATLAB010

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT000