ziggy1209's repositories
Add_noise_and_rir_to_speech
The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generate far-field speech data using room impulse response data from BUT Speech@FIT Reverb Database.
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
cognitive-services-speech-sdk
Sample code for the Microsoft Cognitive Services Speech SDK
DNS-Challenge-2020
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
google-research
Google Research
honk
PyTorch implementations of neural network models for keyword spotting
kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
kaldi_egs_CGN
Kaldi recipe for creating Dutch ASR from CGN
kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
KWS_pytorch
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
lite.ai.toolkit
đź› A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOX, YOLOP, YOLOv6, YOLOR, MODNet, YOLOX, YOLOv7, YOLOv5. MNN, NCNN, TNN, ONNXRuntime.
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
pykaldi
A Python wrapper for Kaldi
pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
shennong
A Python toolbox for speech features extraction
switch-cuda
A simple bash script for switching between installed versions of CUDA.
usingcli-book
using command line like a hacker
vision
Datasets, Transforms and Models specific to Computer Vision
WebRTC_AGC
Automatic Gain Control Module Port From WebRTC
WebRTC_NS
Noise Suppression Module Port From WebRTC
wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
wespeaker
Research and Production Oriented Speaker Recognition Toolkit
z3
The Z3 Theorem Prover