zelaki

followers

following

stars

Athena Research Center

Athens, Greece

Thodoris Kouzelis's repositories

DreamSound

Code for Investigating Personalization Methods in Text to Music Generation

Language:Python33 3 9

DisfluentFA

A Weakly Supervised Forced Alignment for disluent speech

Language:Python10 10

KaldiLongAligner

Speech to Text Alignment tool implemented with Python and Kaldi

Language:Python8 20

awesome-LoRA

A curated list of Parameter Efficient Fine-tuning papers with a TL;DR

6 10

Reading-Diffusion

A collection of interesting papers on Diffusion Models

3 10

localdiff-demo

A repo containing demo for Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis

wsac

This reporsitory code form Weakly Supervised Automaed Audio Captioning via Text Only Training

Language:Python2 10

local_pnp

This repo contains experiments for local editing in Diffusion Models

Language:Python1 20

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:Jupyter NotebookBSD-3-Clause000

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonNOASSERTION000

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonApache-2.0000

Folder-Structure-Conventions

Folder / directory structure options and naming conventions for software projects

MIT000

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellNOASSERTION000

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Language:ShellApache-2.0000

passt_hear21

Language:Python000

presentations

This is a repo where to save Marp presentations

Language:HTML010

sail_align

SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends on freely available software, namely HTK, srilm and sclite.

Language:Perl000

secretsanta

Host secret santa without leaking your guests' informations 🎄

Language:HTML000

SEffCaps

Automated Audio Captioning of Sound Effects in Movies and Videos

Language:Python010

user_study_templates

Language:JavaScript000

wavetransformer

Code base for WaveTransformer: A novel architecture for automated audio captioning

Language:PythonNOASSERTION000

zelaki.github.io

Language:HTML010