runngezhang's repositories
AEC-Challenge
AEC Challenge
Autoformer
About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008
Awesome-Speech-Pretraining
Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
blog
Public repo for HF blog posts
CLIP
Contrastive Language-Image Pretraining
colorednoise
Python package to generate Gaussian (1/f)**beta noise (e.g. pink noise)
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
ebooks
收藏的一些经典的历史、政治、心理、哲学、数学、计算机方面电子书(约10万本)
FullSubNet-plus
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
hifi-gan-bwe
Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.
INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
ipfsspec
readonly python fsspec implementation for IPFS
noise-suppression-for-voice
Noise suppression plugin based on Xiph's RNNoise
pocketfft
Fork of https://gitlab.mpcdf.mpg.de/mtr/pocketfft to simplify external contributions
pykaldi
A Python wrapper for Kaldi
pytorch_optimizer
optimizer & lr scheduler & loss function collections in PyTorch
RealTimeBWE
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
TRUNet
unofficial PyTorch implementation of 《REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET》
vst3_public_sdk
VST 3 Implementation Helper Classes And Examples
vst3projectgenerator
VST3 Project Generator
vst3sdk
VST 3 Plug-In SDK