Irene Martín Morató (marmoi)

marmoi

Geek Repo

Company:Tampere university

Location:Tampere

Home Page:https://marmoi.github.io

Github PK Tool:Github PK Tool

Irene Martín Morató's starred repositories

Language:PythonStargazers:1Issues:0Issues:0

ssl4birdsounds

Self-supervised representation learning for bird sounds (ICASSPW SASB 2024)

Language:PythonStargazers:9Issues:0Issues:0

ATST-SED

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

Language:Jupyter NotebookLicense:MITStargazers:60Issues:0Issues:0

HTS-Audio-Transformer

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Language:PythonLicense:MITStargazers:327Issues:0Issues:0

DCASE2021_task6_v2

Code for CVSSP submission to DCASE 2021 Task 6

Language:Jupyter NotebookStargazers:34Issues:0Issues:0

TextToAudioGrounding

The dataset and baseline code for Text-to-Audio Grounding (TAG)

Language:PythonLicense:MITStargazers:34Issues:0Issues:0

whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Language:PythonLicense:BSD-2-ClauseStargazers:286Issues:0Issues:0

aac-datasets

Audio Captioning datasets for PyTorch.

Language:PythonLicense:MITStargazers:89Issues:0Issues:0

transformer_workshop

Code for the Transformer workshop

Language:Jupyter NotebookStargazers:4Issues:0Issues:0

audio-and-speech-tech-2022

Audio and Speech Technologies Workshop 2022, code examples

Language:ShellLicense:MITStargazers:4Issues:0Issues:0

kapre

kapre: Keras Audio Preprocessors

Language:PythonLicense:MITStargazers:916Issues:0Issues:0

netron

Visualizer for neural network, deep learning and machine learning models

Language:JavaScriptLicense:MITStargazers:26743Issues:0Issues:0

interpretable_predictions

Interpretable Neural Predictions with Differentiable Binary Variables

Language:PythonLicense:MITStargazers:85Issues:0Issues:0

byol-a

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation

Language:PythonLicense:NOASSERTIONStargazers:202Issues:0Issues:0

wiki

This repo contains the source code for the deployment of the unofficial crowdsourced wiki for the Faculty of Information Technology and Communication Sciences at Tampere University.

License:UnlicenseStargazers:3Issues:0Issues:0

sed_eval

Evaluation toolbox for Sound Event Detection

Language:PythonLicense:MITStargazers:136Issues:0Issues:0

dcase_util

A collection of utilities for Detection and Classification of Acoustic Scenes and Events

Language:PythonLicense:MITStargazers:130Issues:0Issues:0

sed_vis

Visualization toolbox for Sound Event Detection

Language:PythonLicense:MITStargazers:106Issues:0Issues:0

fense

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

Language:PythonStargazers:16Issues:0Issues:0

pytorchforaudio

Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.

Language:PythonLicense:MITStargazers:227Issues:0Issues:0

soundata

Python library for downloading, loading & working with sound datasets

Language:PythonLicense:BSD-3-ClauseStargazers:280Issues:0Issues:0

dcase_datalist

Collection of DCASE related datasets

Language:HTMLLicense:MITStargazers:13Issues:0Issues:0