Benjamin Elizalde (bmartin1)

bmartin1

Geek Repo

Company:Microsoft

Location:Redmond

Home Page:https://bmartin1.github.io/

Github PK Tool:Github PK Tool

Benjamin Elizalde's starred repositories

Language:PythonLicense:Apache-2.0Stargazers:629Issues:0Issues:0

PAM

PAM is a no-reference audio quality metric for audio generation tasks

Language:PythonLicense:MITStargazers:37Issues:0Issues:0
Stargazers:12Issues:0Issues:0

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:1786Issues:0Issues:0

fadtk

A simple library for Fréchet Audio Distance (FAD) calculation

Language:PythonLicense:MITStargazers:132Issues:0Issues:0

NoAudioCaptioning

Repository for "Training Audio Captioning Models without Audio"

License:NOASSERTIONStargazers:9Issues:0Issues:0

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:274Issues:0Issues:0

CLAP

Learning audio concepts from natural language supervision

Language:PythonLicense:MITStargazers:446Issues:0Issues:0

frechet-audio-distance

A lightweight library for Frechet Audio Distance calculation.

Language:PythonLicense:MITStargazers:224Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19374Issues:0Issues:0

WavText5K

Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"

Language:PythonLicense:MITStargazers:49Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:66090Issues:0Issues:0

audio-dataset

Audio Dataset for training CLAP and other models

Language:PythonStargazers:610Issues:0Issues:0

ai-audio-startups

Community list of startups working with AI in audio and music technology

License:Apache-2.0Stargazers:1509Issues:0Issues:0
Language:HTMLLicense:MITStargazers:3Issues:0Issues:0

muscaps

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:76Issues:0Issues:0

bmartin1.github.io

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:JavaScriptLicense:MITStargazers:1Issues:0Issues:0

strfnet-IS2020

Official repo for the STRFNet system appeared in INTERSPEECH2020

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

audio-captioning-resources

A list of resources that can help in research for automated audio captioning

Stargazers:33Issues:0Issues:0

audio-captioning-papers

A list of papers about audio captioning

Stargazers:77Issues:0Issues:0

PlotNeuralNet

Latex code for making neural networks diagrams

Language:TeXLicense:MITStargazers:21706Issues:0Issues:0

aed-demo

Tiny Acoustic Event Detection Demo

Language:PythonStargazers:1Issues:0Issues:0

NELQlearning

nel subcommittee code

Language:Jupyter NotebookStargazers:4Issues:0Issues:0

sed_vis

Visualization toolbox for Sound Event Detection

Language:PythonLicense:MITStargazers:111Issues:0Issues:0