zcy618's repositories
speechbrain
A PyTorch-based Speech Toolkit
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
DeepFilterNet
Noise supression using deep filtering
RapidOCR
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle.
room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
asteroid
The PyTorch-based audio source separation toolkit for researchers
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Yuzukilizard
Yuzukilizard is a Small Heterogeneous & AI Powered Dev Board Based on Allwinner V851S
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
InternLM
InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.
MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
biquad
Collection of alterable digital biquad filters for dynamic audio effect creation
UIT_Mobile
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
nnabla
Neural Network Libraries
cpp_torch
It is tiny-dnn based on libtorch. Only headers without dependencies other than libtorch, deep learning framework
tflite-micro
TensorFlow Lite for Microcontrollers
QRSolutionToMatrixInverse
We use c languafe to implememnt the QRSolutionToMatrixInverse.
IS2022-CVQ
Samples for Complex VQ-VAE speech enhancement - ICASSP2021
TaylorSENet
This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', which was accepted by IJCAI-ECAI2022 (Long oral)
android-cmake-sample
Android and CMake sample - learn how to compile native code inside an Android app with CMake
Speech-enhancement
Deep learning for audio denoising
DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
pytorch-distributed
A quickstart and benchmark for pytorch distributed training.
voice_activity_detection-1
Voice Activity Detection based on Deep Learning & TensorFlow