akuzeee

followers

0

following

stars

Kei Akuzawa's starred repositories

code2flow

Pretty good call graphs for dynamic languages

Language:PythonMIT384300

chronos-forecasting

Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

Language:PythonApache-2.0215400

wfstpy

Weighted Finite State Transducers and algorithms

Language:PythonNOASSERTION200

open-tts-tracker

freqtrade

Free, open source crypto trading bot

Language:PythonGPL-3.02666800

rbot

Language:RustLGPL-3.01000

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookNOASSERTION1058800

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonApache-2.0396800

voice-generator-webui

A multi-speaker, multilingual speech generation tool

Language:Jupyter NotebookMIT15000

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonMIT4192000

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonApache-2.0195400

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT746200

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonMIT138500

DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Language:PythonMIT174800

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT2030200

vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Language:PythonMIT46700

Awesome-LLMOps

An awesome & curated list of best LLMOps tools for developers

Language:ShellCC0-1.0349800

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonMIT2152200

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonUnlicense7784800

scrape-youtube

A lightning fast package to scrape YouTube search results

Language:JavaScriptMIT10500

metaseq

Repo for external large-scale work

Language:PythonMIT644100

awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs

Apache-2.07200

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonMIT233500

espnet_onnx

Onnx wrapper for espnet infrernce model

Language:PythonMIT14900

diart

A python package to build AI-powered real-time audio applications

Language:PythonMIT92500

dr-doc-search

Converse with book - Built with GPT-3

Language:PythonMIT59800

jtubespeech

Language:PythonApache-2.020700

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonMIT90700

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0816300

multimodal-vae-public

A PyTorch implementation of "Multimodal Generative Models for Scalable Weakly-Supervised Learning" (https://arxiv.org/abs/1802.05335)

Language:PythonMIT15100