Alison Bernice Ma (alisonbma)

alisonbma

Geek Repo

Location:United States

Home Page:https://www.alisonma.com/

Github PK Tool:Github PK Tool

Alison Bernice Ma's repositories

aiSFX

Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & Non-UCS-compliant datasets.

Language:PythonLicense:CC-BY-4.0Stargazers:41Issues:4Issues:2

ai-audio-startups

Community list of startups working with AI in audio and music technology

License:Apache-2.0Stargazers:1Issues:0Issues:0

astronify

Astronomical data sonification.

Language:PythonStargazers:0Issues:0Issues:0

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

auraloss

Collection of audio-focused loss functions in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:0Issues:0Issues:0

CLAP-microsoft

Learning audio concepts from natural language supervision

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

ddsp

DDSP: Differentiable Digital Signal Processing

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1/24 kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

jukebox

Code for the paper "Jukebox: A Generative Model for Music"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

micro-tcn

Efficient neural networks for analog audio effect modeling

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

MIR_ismir2018-oss-tutorial

ISMIR2018 Tutorial on Open Source and Reproducibility in MIR Research

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

MIR_openl3

OpenL3: Open-source deep audio and image embeddings

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

ML_embedding-playbook

You want to embed your Tableau content in lots of places. Start here.

Language:CSSStargazers:0Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pedalboard

πŸŽ› πŸ”Š A Python library for working with audio.

License:GPL-3.0Stargazers:0Issues:0Issues:0

PUBLICATIONS_paperTemplates

Repository for paper templates in ISMIR Proceedings

Language:TeXStargazers:0Issues:0Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

riffusion

Stable diffusion for real-time music generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

riffusion-app

Stable diffusion for real-time music generation (web app)

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

sample-generator

Tools to train a generative model on arbitrary audio samples

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

SoundStream

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Stargazers:0Issues:0Issues:0

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tiny-audio-diffusion

A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB VRAM GPU)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

visqol

Perceptual Quality Estimator for speech and audio

License:Apache-2.0Stargazers:0Issues:0Issues:0

VoiceLab

Automated Reproducible Acoustical Analysis

Stargazers:0Issues:0Issues:0

WaveRNN

WaveRNN Vocoder + TTS

License:MITStargazers:0Issues:0Issues:0