Weiji Zhuang's repositories
asteroid
The PyTorch-based audio source separation toolkit for researchers || Current highlight : we got our WHAMR results check it out here !
Audiomer-PyTorch
A Convolutional Transformer for Keyword Spotting
av_hubert
A self-supervised learning framework for audio-visual speech
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
distribuuuu
The pure and clear PyTorch Distributed Training Framework.
face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
gcommands
Speech Commands Recognition using end-to-end deep learning models in pytorch
google-research
Google Research
kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
nhc
LBNL Node Health Check
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
Rasa_NLU_Chi
Turn Chinese natural language into structured data 中文自然语言理解
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
setk
Tools for Speech Enhancement integrated with Kaldi
Speech_Enhancement_DNN_NMF
Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF
TC-ResNet
Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
wfst-mkgraph
wfst make graph learning