bubing / ml-audio-start

Suggestions for those interested in developing audio applications of machine learning

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Getting Started in 'ML-Audio'

Suggestions for students.

About

Audio and acoustics students sometimes ask "How do I get started learning machine learning?" Not everyone gets their start in a major research environment.

This page began after @drscotthawley felt sufficiently embarassed about not having a coherent answer. Until someone creates a "ML for Audio" online course -- update 1/7/20: See Valerio Velardo's "Deep Learning for Audio"! -- this page may prove helpful.

Notes:

  • This is a collaborative page. Please suggest additions, re-organizations, edits, updates, etc., either via Issues or Pull Requests. (In addition, @drscotthawley may gladly cede control of this content to whichever student or group wants to Wiki-fy it!)

Introductory Remarks

"Read all the tutorials and papers you can, watch videos of all the talks you can, try out and modify whatever code you can get your hands on, take whatever courses you can find, go to whatever conferences you can. Try to build your own system, and spend all your nights and weekends improving it."

This was the best advice some of us could give, because it was the path we took. Some such stories are shared below. This page is an attempt to offer something more "direct" for newcomers.

Essays / Reflections / Autobiographical Sketches

Many practicioners took very different interdisciplinary paths, learning from a hodgepodge of information, in order to complement their existing strengths and fill in gaps in their knowledge. Here are some stories.

(For submissions: Either link to elsewhere on the web, or add a file to the repo via PR. Try to make submissions conclude with a section on what you would say to new students.)

  • How __[someone]__ got started
  • __[a young person]'s__ story
  • ...your name(s) here!...Chris Donahue, Christian Steinmetz, Jordi Pons, Keunwoo Choi, Faro, Justin Salomon,...?

Active Practictioners to Follow

Many of us learn about and contribue to news of new developments, papers, conferences, grants, and networking opportunities via Twitter.

Quick Quotes

  • Justin Salomon: "Anyone working in ML, anyone, should be obliged to curate a dataset before they're allowed to train a single model. The lessons learnt in the process are invaluable, and the dangers of skipping said lessons are manifold (see what I did there?)"

General Reference Information

Online Courses

Tutorials

(I'm often underwhelmed with audio-specific tutorials, actually. No offense! Feel free to suggest some. Here are a couple on related topics that I've found inspiring)

Talks (at conferences)

that we found helpful/inspiring (and are hopefully still relevant)

Key Papers / Codes

(Let's try to list "representative" or "landmark" papers, not just our latest tweak, unless it includes a really good intro/review section. ;-) )

Demos

(Not sure if this only means "deployed models you can play with in your browser," or if other things should count as demos)

Packages & Libraries

Tools / GUIs / Gists

Books

Computer-Related Topics

Python:

Signal Processing Topics

Statistics / Math Topics

Datasets (raw audio)

One finds that many supposed "audio datasets" are really only features or even just metadata! Here are some "raw audio" datasets:

"Major" ML-Audio Research/Development Groups

Universities:

(or, "Where should I apply for grad school?")

  • QMUL (London)
  • UPF (Barcelona)
  • CRRMA (Stanford, San Francisco)
  • IRCAM (Paris)
  • NYU (New York)

Industry:

("Where can I get an internship/job"?)

Conferences

("Which conference(s) should I go to?" -- asked by student on the day this doc began)

Audio-Specific

**Long list of Music Technology specific conferences https://conferences.smcnetwork.org/ - which is references from here https://github.com/MTG/conferences

  • Audio Engineering Society (AES)
  • ASA
  • Digital Audio Effects (DAFx)
  • ICASSP
  • ISMIR
  • SANE
  • Web Audio Conference (WAC)
  • SMC
  • LVA/ICA
  • Audio Mostly
  • WIMP
  • DCASE
  • CSMC
  • MuMe
  • ICMC
  • CMMR
  • IBAC
  • MLSP
  • Interspeech
  • FMA

General ML

  • ICLR
  • ICML
  • NeurIPS
  • IJCNN

Journals

("Where can I get published?")

In addition, in machine learning specifically, the tendency is for conference papers to be peer-reviewed and to "count" as journal publications.

Competitions / Benchmarks

Some are yearly, some may be defunct but still interesting.

Contributors

Ryan Miller

If you want your name listed here, you may. ;-)

About

Suggestions for those interested in developing audio applications of machine learning