Dan Lyth (eonglints)

eonglints

Geek Repo

Company:Stability AI

Twitter:@danlyth

Github PK Tool:Github PK Tool

Dan Lyth's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65282Issues:545Issues:0

foam

A personal knowledge management and sharing system for VSCode

Language:TypeScriptLicense:NOASSERTIONStargazers:15100Issues:121Issues:689

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10317Issues:107Issues:18

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookLicense:MITStargazers:3625Issues:73Issues:96

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3331Issues:58Issues:70

musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Language:PythonLicense:MITStargazers:3101Issues:99Issues:53

pytorch-optimizer

torch-optimizer -- collection of optimizers for Pytorch

Language:PythonLicense:Apache-2.0Stargazers:2996Issues:33Issues:63

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2335Issues:60Issues:167

s4

Structured state space sequence models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2287Issues:52Issues:132

notero

A Zotero plugin for syncing items and notes into Notion

Language:TypeScriptLicense:MITStargazers:2152Issues:26Issues:227

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:1888Issues:40Issues:43

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:1775Issues:21Issues:180

mt3

MT3: Multi-Task Multitrack Music Transcription

Language:PythonLicense:Apache-2.0Stargazers:1379Issues:26Issues:89

Notion-to-Obsidian-Converter

Converts exported Notion notes to work with Obsidian.

Language:JavaScriptLicense:MITStargazers:969Issues:8Issues:25

auraloss

Collection of audio-focused loss functions in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:695Issues:18Issues:35

WavAugment

A library for speech data augmentation in time-domain

Language:PythonLicense:MITStargazers:631Issues:26Issues:17

GigaSpeech

Large, modern dataset for speech recognition

Language:ShellLicense:Apache-2.0Stargazers:617Issues:19Issues:60

textlesslib

Library for Textless Spoken Language Processing

Language:PythonLicense:MITStargazers:513Issues:16Issues:23

hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Language:PythonLicense:MITStargazers:314Issues:5Issues:16

ocotillo

Performant and accurate speech recognition built on Pytorch

Language:PythonLicense:NOASSERTIONStargazers:240Issues:9Issues:4

penn

Pitch Estimating Neural Networks (PENN)

Language:PythonLicense:MITStargazers:213Issues:10Issues:10

shennong

A Python toolbox for speech features extraction

Language:PythonLicense:GPL-3.0Stargazers:157Issues:24Issues:7

diffwave-sashimi

Implementation of DiffWave and SaShiMi audio generation models

Language:PythonLicense:MITStargazers:112Issues:5Issues:11

notion-zotero

Create a Notion collection, synced with Zotero.

Language:PythonLicense:MITStargazers:76Issues:2Issues:1

notion

notion hosts a library of interactive widgets for @makenotion pages

Language:JavaScriptStargazers:72Issues:1Issues:0

Voice-conversion-evaluation

An evaluation toolkit for voice conversion models.

audb

Manage audio and video databases

Language:PythonLicense:NOASSERTIONStargazers:23Issues:4Issues:151

musicgen_trainer

simple trainer for musicgen/audiocraft

Language:PythonLicense:AGPL-3.0Stargazers:15Issues:0Issues:0