eonglints

Dan Lyth's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT65282 5450

foam

A personal knowledge management and sharing system for VSCode

Language:TypeScriptNOASSERTION15100 121 689

ml-engineering

Machine Learning Engineering Open Book

Language:PythonCC-BY-SA-4.010317 107 18

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookMIT3625 73 96

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonMIT3331 58 70

musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Language:PythonMIT3101 99 53

pytorch-optimizer

torch-optimizer -- collection of optimizers for Pytorch

Language:PythonApache-2.02996 33 63

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonMIT2335 60 167

s4

Structured state space sequence models

Language:Jupyter NotebookApache-2.02287 52 132

notero

A Zotero plugin for syncing items and notes into Notion

Language:TypeScriptMIT2152 26 227

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonMIT1888 40 43

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonMIT1775 21 180

mt3

MT3: Multi-Task Multitrack Music Transcription

Language:PythonApache-2.01379 26 89

NeuralSpeech

Language:PythonMIT1346 34 124

Notion-to-Obsidian-Converter

Converts exported Notion notes to work with Obsidian.

Language:JavaScriptMIT969 8 25

auraloss

Collection of audio-focused loss functions in PyTorch

Language:PythonApache-2.0695 18 35

WavAugment

A library for speech data augmentation in time-domain

Language:PythonMIT631 26 17

GigaSpeech

Large, modern dataset for speech recognition

Language:ShellApache-2.0617 19 60

textlesslib

Library for Textless Spoken Language Processing

Language:PythonMIT513 16 23

torchlibrosa

Language:PythonMIT453 6 6

hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Language:PythonMIT314 5 16

ocotillo

Performant and accurate speech recognition built on Pytorch

Language:PythonNOASSERTION240 9 4

penn

Pitch Estimating Neural Networks (PENN)

Language:PythonMIT213 10 10

shennong

A Python toolbox for speech features extraction

Language:PythonGPL-3.0157 24 7

diffwave-sashimi

Implementation of DiffWave and SaShiMi audio generation models

Language:PythonMIT112 5 11

notion-zotero

Create a Notion collection, synced with Zotero.

Language:PythonMIT76 2 1

notion

notion hosts a library of interactive widgets for @makenotion pages

Language:JavaScript72 10

Voice-conversion-evaluation

An evaluation toolkit for voice conversion models.

Language:Python39 2 3

audb

Manage audio and video databases

Language:PythonNOASSERTION23 4 151

musicgen_trainer

simple trainer for musicgen/audiocraft

Language:PythonAGPL-3.01500