flyingleafe

followers

following

stars

@serokell

London

Dmitrii Mukhutdinov's repositories

uroman-python

Python wrapper around uroman tokenizer

Language:Nix12 2 1

dqnroute

Emulation system for routing algorithms, particularly for DQN routing

Language:Jupyter Notebook9 3 4

vxs-vpt

Vocal percussion transcription system (UoE MSc project, WIP)

Language:Jupyter Notebook7 30

hooktest

whatever

1 30

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonMIT020

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonBSD-2-Clause010

CTranslate2

Fast inference engine for Transformer models

Language:C++MIT010

davis

Package containing helper functions for loading and evaluating DAVIS

Language:C++NOASSERTION020

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonNOASSERTION020

express-jwt

connect/express middleware that validates a JsonWebToken (JWT) and set the req.user with the attributes

Language:TypeScriptMIT020

fairseq2

FAIR Sequence Modeling Toolkit 2

MIT000

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonMIT010

helpers

Helper/utility functions written with typescript

Language:TypeScript010

itmo-ctd-msc-thesis

MSc thesis, ITMO CTD 2019

Language:TeX020

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonApache-2.0010

llm.sh

LLM personal agent on your command line

Language:Python000

lootbox

Toolbox for your cool project

Language:Haskell020

magenta

Magenta: Music and Art Generation with Machine Intelligence

Language:PythonApache-2.0020

marker

Convert PDF to markdown quickly with high accuracy

GPL-3.0000

music-inf

Practicals and experiments for UoE Music Informatics coursework

Language:HaskellBSD-3-Clause030

nixpkgs

Nix Packages collection

Language:NixMIT020

org-journal

A simple org-mode based journaling mode

Language:Emacs LispBSD-3-Clause020

pact-js-sdk

JavaScript SDK for Pact smart contracts

Language:TypeScriptMIT020

personal-finance

Monorepo for my custom personal finance management tools.

Language:Python000

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT010

reach-lang

Reach: The Safest and Easiest DApp Programming Language

Language:HaskellApache-2.0020

sednn

deep learning based speech enhancement using keras or pytorch, make it easy to use

Language:Jupyter Notebook020

templates

Flake templates

MIT000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

wespeaker

Research and Production Oriented Speaker Recognition Toolkit

Language:PythonApache-2.0010