Vlad Bataev's starred repositories
spotify-downloader
Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
ml-engineering
Machine Learning Engineering Open Book
wemake-python-styleguide
The strictest and most opinionated python linter ever!
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
DeepFilterNet
Noise supression using deep filtering
audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
eng-handbook
A developer's guide to management: an open-sourced handbook for leading software engineering teams.
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
audio-dataset
Audio Dataset for training CLAP and other models
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
DiscreteSpeechMetrics
Reference-aware automatic speech evaluation toolkit
podcasts-dataset
dataset of podcasts and episodes
sd-benchmarks
Stable Diffusion inference benchmarks