runngezhang's repositories

aero

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AP-BWE

Towards Efficient and High-Quality Bandwidth Extension with Parallel Amplitude-Phase Prediction

License:MITStargazers:0Issues:0Issues:0

APNet2

Source code of APNet2, a vocoder

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audio-transformers-course

The Hugging Face Course on Transformers for Audio

Language:MDXLicense:Apache-2.0Stargazers:0Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audiowmark

Audio Watermarking

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

Awesome-state-space-models

Collection of papers on state-space models

Stargazers:0Issues:0Issues:0

CRUSE

a lightweight network for monaural speech enhancement

Language:PythonStargazers:0Issues:0Issues:0

datasets_musicDetect

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

docs-l10n

Translations of TensorFlow documentation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:MITStargazers:0Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

License:MITStargazers:0Issues:0Issues:0

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

scikit-image

Image Processing SciKit (Toolbox for SciPy)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

storm

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

License:MITStargazers:0Issues:0Issues:0

TCN

Sequence modeling benchmarks and temporal convolutional networks

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

tianya-docs

精心收集的天涯神贴,不带水印,方便阅读

Stargazers:0Issues:0Issues:0

Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/粘贴/批量导入图片,段落排版/排除水印,扫描/生成二维码。内置多国语言库。

License:MITStargazers:0Issues:0Issues:0

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

voicefixer

General Speech Restoration

Language:PythonLicense:MITStargazers:0Issues:0Issues:0