Anton Mitrofanov's starred repositories
GoodbyeDPI
GoodbyeDPI — Deep Packet Inspection circumvention utility (for Windows)
FAdam_PyTorch
an implementation of FAdam (Fisher Adam) in PyTorch
DataProcessingFramework
Framework for processing and filtering datasets
AudioBench
AudioBench: A Universal Benchmark for Audio Large Language Models
Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
rir-classifier
Recipe for training and testing RIR-Classifier
jsalt2020_simulate
Training data simulation
CTranslate2
Fast inference engine for Transformer models
C8DASR-Baseline-NeMo
NeMo: a toolkit for conversational AI
chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
NOTSOFAR1-Challenge
NOTSOFAR-1 Challenge: Distant Diarization and ASR
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.