ys1998 / maso

Self-supervised representation learning for raw audio

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

maso [WIP]

This repository contains the code for our ASR project on self-supervised representation learning for raw audio.

Introduction

We hypothesize that problem-agnostic features learnt from raw audio can be beneficial for downstream tasks if they capture two important aspects of the underlying audio signal - context and order. We suggest two relatively simple tasks for enforcing these constraints on the learnt features by borrowing some ideas from prior work in other domains (images, videos, text).

Members

About

Self-supervised representation learning for raw audio

License:MIT License


Languages

Language:Python 98.1%Language:PHP 1.5%Language:Shell 0.3%Language:C++ 0.1%Language:Assembly 0.1%