Dimitrios Bralios (dbralios)

dbralios

Geek Repo

Location:Athens Greece

Github PK Tool:Github PK Tool

Dimitrios Bralios's starred repositories

drl-zh

Deep Reinforcement Learning: Zero to Hero!

Language:Jupyter NotebookLicense:MITStargazers:1916Issues:0Issues:0

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilogStargazers:6274Issues:0Issues:0

llama_codec

A language-driven audio codec model (LLAMA-Codec).

Language:PythonStargazers:33Issues:0Issues:0

aac-metrics

Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

emuser

a painless LaTeX resume template

Language:TeXLicense:GPL-3.0Stargazers:2Issues:0Issues:0

AQG-QAV

Audio Question Generation with Question Answering Validation

Stargazers:1Issues:0Issues:0

AudioFlamingo

Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities"

Language:PythonLicense:MITStargazers:23Issues:0Issues:0

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1763Issues:0Issues:0

Cacophony

Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986

Language:PythonLicense:MITStargazers:21Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0

Awesome-instruction-tuning

A curated list of awesome instruction tuning datasets, models, papers and repositories.

Language:PythonLicense:Apache-2.0Stargazers:254Issues:0Issues:0

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

Stargazers:411Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4039Issues:0Issues:0

aac-datasets

Audio Captioning datasets for PyTorch.

Language:PythonLicense:MITStargazers:89Issues:0Issues:0

julius

Fast PyTorch based DSP for audio and 1D signals

Language:PythonLicense:MITStargazers:407Issues:0Issues:0

improved-diffusion

Release for Improved Denoising Diffusion Probabilistic Models

Language:PythonLicense:MITStargazers:2899Issues:0Issues:0

python-audio-effects

Apply audio effects such as reverb and EQ directly to audio files or NumPy ndarrays.

Language:PythonLicense:MITStargazers:380Issues:0Issues:0

fma

FMA: A Dataset For Music Analysis

Language:Jupyter NotebookLicense:MITStargazers:2154Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:25424Issues:0Issues:0

audiolazy

Expressive Digital Signal Processing (DSP) package for Python

Language:PythonLicense:GPL-3.0Stargazers:683Issues:0Issues:0

computersandmusic

Notebooks for the EPFL class "Computers and Music".

Language:Jupyter NotebookStargazers:20Issues:0Issues:0

easyeffects

Limiter, compressor, convolver, equalizer and auto volume and many other plugins for PipeWire applications

Language:C++License:GPL-3.0Stargazers:6034Issues:0Issues:0

ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Language:PythonStargazers:306Issues:0Issues:0

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:1500Issues:0Issues:0

ddc_onset

Music onset detector from Dance Dance Convolution packaged as a lightweight PyTorch module

Language:PythonLicense:MITStargazers:30Issues:0Issues:0

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:254Issues:0Issues:0

heterogeneous_separation

Code and data recipes for the paper: Heterogeneous Target Speech Separation

Language:PythonLicense:MITStargazers:38Issues:0Issues:0
Language:Jupyter NotebookStargazers:47Issues:0Issues:0

optimal_condition_training

Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris Smaragdis and Jonathan Le Roux

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Language:PythonLicense:MITStargazers:3061Issues:0Issues:0