Ruoho Ruotsi's starred repositories
python-fire
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
stable-diffusion-videos
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
sd-webui-deforum
Deforum extension for AUTOMATIC1111's Stable Diffusion webui
audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
Practical-RIFE
We are developing more practical approach for users based on RIFE.
Quality-Net
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)
PidginUNMT
Unsupervised Neural Machine Translation from West African Pidgin (Creole) to English without a single parallel sentence
Intelligibility-MetricGAN
Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning"
stoi-vqcpc
Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"
py-intelligibility
Python implementation of a few speech intelligibility prediction algorithms
audioObfuscation
A python function to extract the LPC coefficients of recorded speech and replace them with obfuscating coefficients. Based on this paper: https://www.fxpal.com/publications/audio-privacy-reducing-speech-intelligibility-while-preserving-environmental-sounds.pdf
DTW_Phoneme
Computes the Dynamic Time Warping between two phoneme sequences.
prores-proxy-generator
Create ProRes Proxies using FFMPEG