Thodoris Kouzelis (zelaki)

zelaki

Geek Repo

Company:Athena Research Center

Location:Athens, Greece

Github PK Tool:Github PK Tool

Thodoris Kouzelis's repositories

DreamSound

Code for Investigating Personalization Methods in Text to Music Generation

DisfluentFA

A Weakly Supervised Forced Alignment for disluent speech

Language:PythonStargazers:9Issues:1Issues:0

KaldiLongAligner

Speech to Text Alignment tool implemented with Python and Kaldi

Language:PythonStargazers:8Issues:0Issues:0

Reading-Diffusion

A collection of interesting papers on Diffusion Models

awesome-LoRA

A curated list of Parameter Efficient Fine-tuning papers

Stargazers:1Issues:0Issues:0

local_pnp

This repo contains experiments for local editing in Diffusion Models

Language:PythonStargazers:1Issues:2Issues:0

wsac

This reporsitory code form Weakly Supervised Automaed Audio Captioning via Text Only Training

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Folder-Structure-Conventions

Folder / directory structure options and naming conventions for software projects

License:MITStargazers:0Issues:0Issues:0

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

presentations

This is a repo where to save Marp presentations

Language:HTMLStargazers:0Issues:1Issues:0

sail_align

SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends on freely available software, namely HTK, srilm and sclite.

Language:PerlStargazers:0Issues:0Issues:0

secretsanta

Host secret santa without leaking your guests' informations 🎄

Language:HTMLStargazers:0Issues:0Issues:0

SEffCaps

Automated Audio Captioning of Sound Effects in Movies and Videos

Language:PythonStargazers:0Issues:1Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

wavetransformer

Code base for WaveTransformer: A novel architecture for automated audio captioning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0