Robin Scheibler (fakufaku)

fakufaku

Geek Repo

Company:LINE Corporation

Location:Japan

Home Page:http://www.robinscheibler.org

Twitter:@fakufakurevenge

Github PK Tool:Github PK Tool


Organizations
BioDesignRealWorld
pyroom
Safecast
TokyoHackerspace

Robin Scheibler's starred repositories

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:32823Issues:348Issues:295

spotify-downloader

Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).

Language:PythonLicense:MITStargazers:15329Issues:187Issues:1430

mlx

MLX: An array framework for Apple silicon

pytube

A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.

Language:PythonLicense:UnlicenseStargazers:10491Issues:193Issues:1277

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:6400Issues:53Issues:196

FriendsDontLetFriends

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

marimo

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

Language:PythonLicense:Apache-2.0Stargazers:4984Issues:25Issues:372

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonLicense:Apache-2.0Stargazers:2631Issues:72Issues:79

beartype

Unbearably fast near-real-time hybrid runtime-static type-checking in pure Python.

Language:PythonLicense:MITStargazers:2473Issues:15Issues:308

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonLicense:MITStargazers:1654Issues:27Issues:211

ai-audio-startups

Community list of startups working with AI in audio and music technology

fpdf2

Simple PDF generation for Python

Language:PythonLicense:LGPL-3.0Stargazers:973Issues:22Issues:429

kmcuda

Large scale K-means and K-nn implementation on NVIDIA GPU / CUDA

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:778Issues:30Issues:103

melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Language:PythonLicense:BSD-3-ClauseStargazers:626Issues:30Issues:59

LanguageAgentTreeSearch

Official repository for ICML'24 paper "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

Language:PythonLicense:MITStargazers:506Issues:9Issues:18

survey

A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf

tagainijisho

A free Japanese dictionary and learning assistant

Language:C++License:GPL-3.0Stargazers:350Issues:30Issues:216

ZeroSpeech

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

tacotron_pytorch

PyTorch implementation of Tacotron speech synthesis model.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:305Issues:16Issues:21

PyTorch-Wavelet-Toolbox

Differentiable fast wavelet transforms in PyTorch with GPU support.

Language:PythonLicense:EUPL-1.2Stargazers:252Issues:7Issues:22

whisper-finetuning

[WIP] Scripts for fine-tuning Whisper

Language:PythonLicense:MITStargazers:195Issues:7Issues:19

SpeechMOS

Easy-to-Use Speech MOS predictors

Language:PythonLicense:MITStargazers:167Issues:7Issues:11

Python_Simulations

Various Python Simulations

Language:Jupyter NotebookStargazers:99Issues:3Issues:0

ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

License:NOASSERTIONStargazers:75Issues:18Issues:0

DPMTSE

A Diffusion Probabilistic Model for Target Sound Extraction

Language:PythonStargazers:25Issues:0Issues:0

WER-CER

Calculator Tool of Word Error Rate and Character Error Rate

Language:PythonLicense:MITStargazers:9Issues:2Issues:0

self-remixing

Official implementation of Self-Remixing

Language:PythonLicense:MITStargazers:9Issues:0Issues:0