Tomoki Hayashi's repositories
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
PytorchWaveNetVocoder
WaveNet-Vocoder implementation with pytorch.
LibriTTSLabel
Alignment files of LibriTTS.
INTERSPEECH19_TUTORIAL
Interspeech 2019 tutorial materials
WaveNetVocoderSamples
WaveNet Vocoder Samples
NonARSeq2SeqVC
Non-autoregressive sequence-to-sequence voice conversion
asj-espnet2-tutorial
ESPnet2解説原稿付録
VCTKCorpusFullContextLabel
Full context label for VCTK Corpus.
AudioSamples
Audio samples
crank
Non-parallel Voice Conversion
denite.nvim
:dragon: Dark powered asynchronous unite all interfaces for Neovim/Vim8
DiscreTalk
Demo HP for DiscreTalk.
EfficientWord-Net
OneShot Learning-based hotword detection.
espnet_onnx
Onnx wrapper for espnet infrernce model
interspeech2019-tutorial
INTERSPEECH 2019 Tutorial Materials
litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
nvim-jellybeans
Jellybeans inspired Neovim color scheme
sprocket
Voice Conversion Tool Kit
torchpack
A neural network training interface based on PyTorch, with a focus on flexibility
VideoX
VideoX: a collection of video cross-modal models