runngezhang-jx's repositories
AP-BWE
Towards Efficient and High-Quality Bandwidth Extension with Parallel Amplitude-Phase Prediction
asteroid
The PyTorch-based audio source separation toolkit for researchers
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
DeepFilterNet
Noise supression using deep filtering
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
dlrover
DLRover: An Automatic Distributed Deep Learning System
DPCRN_DNS3
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
mlx-examples
Examples in the MLX framework
MP-SENet
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
phasen
A unofficial Pytorch implementation of Microsoft's PHASEN
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
pybind11
Seamless operability between C++11 and Python
python
Boost.org python module
pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
rnnoise
Recurrent neural network for audio noise reduction
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
silero-vad2
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
simple-cython-limiter
A simple real-time limiter implemented using Python, Cython, Numpy and PyAudio
TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows