xzm2004260

xzm2004's repositories

awesome-music-informatics

A curated list of awesome article, tutorial, library, webpage, etc.

100

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonCC-BY-4.0100

FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

MIT100

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

MIT000

audioFlux

A library for audio and music analysis, feature extraction.

MIT000

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

MIT000

audiowmark

Audio Watermarking

GPL-3.0000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Apache-2.0000

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

MIT000

diffwave-sr

MIT000

DiJiang

The official implementation of "DiJiang: Efficient Large Language Models through Compact Kernelization"

000

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Apache-2.0000