Akira Tamamori's repositories
speech_process_exercise
音声情報処理n本ノックを目指して
ace-isearch
A seamless bridge between isearch, ace-jump-mode, avy, and helm-swoop.
music_process_exercise
音楽情報処理n本ノックを目指して
onoma-to-wave_transformer
Unofficial implementations of environmental sound synthesis system with Transformer
onoma-to-wave
Unofficial implementations of Onoma-to-Wave
emotion_separator
Provides a neural network which implements a functionarity to separate the emotional component from the x-vector.
minimal-sqvae
A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony
complexPyTorch
A high-level toolbox for using complex valued neural networks in PyTorch
deep_divergence_practice
Implementations of "Deep Divergence Learning" to reproduce the experimental results
diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
nussl
A flexible source separation library in Python
pyod
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
soxbindings
Python bindings for SoX, aiming to replicate a subset of the command line sox utility.
spherecluster
Clustering routines for the unit sphere
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
wavegrad
A fast, high-quality neural vocoder.
xvector_jtubespeech
xvector model on jtubespeech