Dimitrios Bralios's starred repositories
llama_codec
A language-driven audio codec model (LLAMA-Codec).
aac-metrics
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
AudioFlamingo
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities"
Awesome-instruction-tuning
A curated list of awesome instruction tuning datasets, models, papers and repositories.
Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
aac-datasets
Audio Captioning datasets for PyTorch.
improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
python-audio-effects
Apply audio effects such as reverb and EQ directly to audio files or NumPy ndarrays.
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
computersandmusic
Notebooks for the EPFL class "Computers and Music".
easyeffects
Limiter, compressor, convolver, equalizer and auto volume and many other plugins for PipeWire applications
stable-audio-tools
Generative models for conditional audio generation
heterogeneous_separation
Code and data recipes for the paper: Heterogeneous Target Speech Separation
optimal_condition_training
Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris Smaragdis and Jonathan Le Roux
musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch