maso [WIP]

This repository contains the code for our ASR project on self-supervised representation learning for raw audio.

Introduction

We hypothesize that problem-agnostic features learnt from raw audio can be beneficial for downstream tasks if they capture two important aspects of the underlying audio signal - context and order. We suggest two relatively simple tasks for enforcing these constraints on the learnt features by borrowing some ideas from prior work in other domains (images, videos, text).

Members

About

Self-supervised representation learning for raw audio

MIT License

Languages

Language:Python 98.1%Language:PHP 1.5%Language:Shell 0.3%Language:C++ 0.1%Language:Assembly 0.1%