Daria Diatlova (dariadiatlova)

dariadiatlova

Geek Repo

Company:@deepvk

Location:Saint-Petersburg

Home Page:https://www.linkedin.com/in/daria-diatlova-09b589184/

Github PK Tool:Github PK Tool


Organizations
deepvk

Daria Diatlova's repositories

Fre-GAN

Test-task for VK-research internship 2022

Language:PythonLicense:MITStargazers:8Issues:3Issues:0

iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

speech_course

YSDA course in Speech Processing.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

dla

Deep learning for audio processing

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

dsp

Digital Signal Processing course

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

Kazakh_TTS

An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.

Language:ShellLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

LPCNet

Efficient neural speech synthesis

Language:CLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MSP-Podcast_Challenge

MSP-Podcast Challenge Baseline Code

Language:PythonStargazers:0Issues:0Issues:0

RecSys-hse-fall-2021

This repository consists of hometasks for the recommendation systems course.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

russian_speech_denoiser

The repository consists of supportive scripts for the Master Thesis research

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

StarGAN-Voice-Conversion

This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks

Language:PythonStargazers:0Issues:0Issues:0

swift-sandbox

This repository consists of several simple iOS apps

Language:SwiftStargazers:0Issues:1Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0