Dmitrii Mukhutdinov's repositories
uroman-python
Python wrapper around uroman tokenizer
CTranslate2
Fast inference engine for Transformer models
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
express-jwt
connect/express middleware that validates a JsonWebToken (JWT) and set the req.user with the attributes
fairseq2
FAIR Sequence Modeling Toolkit 2
faster-whisper
Faster Whisper transcription with CTranslate2
itmo-ctd-msc-thesis
MSc thesis, ITMO CTD 2019
llm.sh
LLM personal agent on your command line
marker
Convert PDF to markdown quickly with high accuracy
org-journal
A simple org-mode based journaling mode
pact-js-sdk
JavaScript SDK for Pact smart contracts
personal-finance
Monorepo for my custom personal finance management tools.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
reach-lang
Reach: The Safest and Easiest DApp Programming Language
templates
Flake templates
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.