taalua

taalua

Geek Repo

0

following

0

stars

Github PK Tool:Github PK Tool

taalua's repositories

AudioStyleNet

This repository contains the code for my master thesis on Emotion-Aware Facial Animation

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Catch-A-Waveform

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

clpcnet

Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

License:NOASSERTIONStargazers:0Issues:0Issues:0

FastSpeech2

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:

Stargazers:0Issues:0Issues:0

few-shot-transformer-tts

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

License:MITStargazers:0Issues:0Issues:0

FG-transformer-TTS

Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.

License:MITStargazers:0Issues:0Issues:0

flow_synthesizer

Universal audio synthesizer control learning with normalizing flows

License:MITStargazers:0Issues:0Issues:0

flowEQ

β-VAE for intelligent control of a five band parametric EQ

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

g2p

g2p: English Grapheme To Phoneme Conversion

License:Apache-2.0Stargazers:0Issues:0Issues:0

inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

License:MITStargazers:0Issues:0Issues:0

jax-variational-diffwave

Jax/Flax implementation of Variational-DiffWave.

License:MITStargazers:0Issues:0Issues:0

MaskCycleGAN-VC

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

License:MITStargazers:0Issues:0Issues:0

mir-svc

Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach

License:NOASSERTIONStargazers:0Issues:0Issues:0

mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

License:GPL-3.0Stargazers:0Issues:0Issues:0

msaf

Music Structure Analysis Framework

License:MITStargazers:0Issues:0Issues:0

MTL-Speaker-Embeddings

Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021

License:MITStargazers:0Issues:0Issues:0

Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

License:MITStargazers:0Issues:0Issues:0

normalizing-flows

PyTorch implementation of normalizing flow models

License:MITStargazers:0Issues:0Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

License:MITStargazers:0Issues:0Issues:0

SSL_Anti-spoofing

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

License:MITStargazers:0Issues:0Issues:0

ssqueezepy

Synchrosqueezing, wavelet transforms, and time-frequency analysis in Python

License:MITStargazers:0Issues:0Issues:0

stereoEEG2speech

Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using the PyTorch Lightning.

Stargazers:0Issues:0Issues:0

taalua

Config files for my GitHub profile.

Stargazers:0Issues:0Issues:0

tt-vae-gan

Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

License:Apache-2.0Stargazers:0Issues:0Issues:0

WaveGrad

Implementation of Google Brain's WaveGrad vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0