Kei Akuzawa (akuzeee)

akuzeee

Geek Repo

Github PK Tool:Github PK Tool

Kei Akuzawa's starred repositories

code2flow

Pretty good call graphs for dynamic languages

Language:PythonLicense:MITStargazers:3843Issues:0Issues:0

chronos-forecasting

Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

Language:PythonLicense:Apache-2.0Stargazers:2154Issues:0Issues:0

wfstpy

Weighted Finite State Transducers and algorithms

Language:PythonLicense:NOASSERTIONStargazers:2Issues:0Issues:0

freqtrade

Free, open source crypto trading bot

Language:PythonLicense:GPL-3.0Stargazers:26668Issues:0Issues:0
Language:RustLicense:LGPL-3.0Stargazers:10Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10588Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:3968Issues:0Issues:0

voice-generator-webui

A multi-speaker, multilingual speech generation tool

Language:Jupyter NotebookLicense:MITStargazers:150Issues:0Issues:0

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonLicense:MITStargazers:41920Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:1954Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7462Issues:0Issues:0

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonLicense:MITStargazers:1385Issues:0Issues:0

DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Language:PythonLicense:MITStargazers:1748Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20302Issues:0Issues:0

vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Language:PythonLicense:MITStargazers:467Issues:0Issues:0

Awesome-LLMOps

An awesome & curated list of best LLMOps tools for developers

Language:ShellLicense:CC0-1.0Stargazers:3498Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:21522Issues:0Issues:0

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonLicense:UnlicenseStargazers:77848Issues:0Issues:0

scrape-youtube

A lightning fast package to scrape YouTube search results

Language:JavaScriptLicense:MITStargazers:105Issues:0Issues:0

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6441Issues:0Issues:0

awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs

License:Apache-2.0Stargazers:72Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2335Issues:0Issues:0

espnet_onnx

Onnx wrapper for espnet infrernce model

Language:PythonLicense:MITStargazers:149Issues:0Issues:0

diart

A python package to build AI-powered real-time audio applications

Language:PythonLicense:MITStargazers:925Issues:0Issues:0

dr-doc-search

Converse with book - Built with GPT-3

Language:PythonLicense:MITStargazers:598Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:207Issues:0Issues:0

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonLicense:MITStargazers:907Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8163Issues:0Issues:0

multimodal-vae-public

A PyTorch implementation of "Multimodal Generative Models for Scalable Weakly-Supervised Learning" (https://arxiv.org/abs/1802.05335)

Language:PythonLicense:MITStargazers:151Issues:0Issues:0