zcy618's repositories
android-cmake-sample
Android and CMake sample - learn how to compile native code inside an Android app with CMake
asteroid
The PyTorch-based audio source separation toolkit for researchers
biquad
Collection of alterable digital biquad filters for dynamic audio effect creation
ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
cpp_torch
It is tiny-dnn based on libtorch. Only headers without dependencies other than libtorch, deep learning framework
DeepFilterNet
Noise supression using deep filtering
DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
InternLM
InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.
IS2022-CVQ
Samples for Complex VQ-VAE speech enhancement - ICASSP2021
MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
nnabla
Neural Network Libraries
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
pytorch-distributed
A quickstart and benchmark for pytorch distributed training.
QRSolutionToMatrixInverse
We use c languafe to implememnt the QRSolutionToMatrixInverse.
RapidOCR
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle.
room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Speech-enhancement
Deep learning for audio denoising
speechbrain
A PyTorch-based Speech Toolkit
TaylorSENet
This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', which was accepted by IJCAI-ECAI2022 (Long oral)
tflite-micro
TensorFlow Lite for Microcontrollers
torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
UIT_Mobile
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
voice_activity_detection-1
Voice Activity Detection based on Deep Learning & TensorFlow
VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Yuzukilizard
Yuzukilizard is a Small Heterogeneous & AI Powered Dev Board Based on Allwinner V851S