zr_jin's repositories
neural-architecture-search
Basic implementation of [Neural Architecture Search with Reinforcement Learning](https://arxiv.org/abs/1611.01578).
Ambar-SwiftUI
Ambar is a macOS Menu Bar app built with SwiftUI.
conv-tasnet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
conv-tasnet-libriheavymix
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https://github.com/funcwj/aps)
lhotse
Tools for handling speech data in machine learning projects.
sherpa-ncnn
Real-time (online/streaming) speech recognition using next-gen Kaldi with ncnn. Support embedded systems
espnet
End-to-End Speech Processing Toolkit
FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
GigaSpeech
Large, modern dataset for speech recognition
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
sherpa
Speech-to-text server framework with next-gen Kaldi
sherpa-onnx
Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go
Shift-Net
A Simple Baseline for Video Restoration with Grouped Spatial-temporal Shift
transformers
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
voicefilter-libriheavymix
Unofficial PyTorch implementation of Google AI's VoiceFilter system
whisper
Robust Speech Recognition via Large-Scale Weak Supervision