AI-X-King's repositories
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Ax
Adaptive Experimentation Platform
C-Plus-Plus
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.
calculator
Windows Calculator: A simple yet powerful calculator that ships with Windows
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
CPlusPlusThings
C++那些事
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
dns_mos_calculate
Code for calculate DNS_MOS.
FasterTransformer
Transformer related optimization, including BERT, GPT
lhotse
Tools for handling speech data in machine learning projects.
machine-learning-notes
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接
MLAPP_CN_CODE
《Machine Learning: A Probabilistic Perspective》(Kevin P. Murphy)中文翻译和书中算法的Python实现。
modern-cpp-tutorial
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/
MTFAA-Net
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
onnx-tutorials
Tutorials for creating and using ONNX models
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
PSST
Prosodic Speech Segmentation with Transformers
pyctcdecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
pytorch-docker
Pure Pytorch Docker Images.
reformer-pytorch
Reformer, the efficient Transformer, in Pytorch
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
sherpa
Streaming and non-streaming ASR server for next-gen Kaldi
SpeechT5
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing (ACL'2022)
Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.