meru's repositories
DeepLearningExamples
Deep Learning Examples
fewshot-font-generation
The unified repository for few-shot font generation methods. This repository includes FUNIT (ICCV'19), DM-Font (ECCV'20), LF-Font (AAAI'21) and MX-Font (ICCV'21).
FT-w2v2-ser
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
huggingsound
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
lhotse
Tools for handling speech data in machine learning projects.
mammoth.js
Convert Word documents (.docx files) to HTML
phrase_break_prediction
Scripts for training a phrase break prediction system
pyJuliusAlign
One-button-press forced aligner for Japanese, using Julius.
pyopenjtalk
Python wrapper for OpenJTalk
pyvcroid2
Python Library to Access to Core DLL of VOICEROID2
rvc-webui
This project is a fork of liujing04/Retrieval-based-Voice-Conversion-WebUI
soxan
Wav2Vec for speech recognition, classification, and audio classification
SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
StyleTTS
Official Implementation of StyleTTS
TTSController
各種 Text-to-Speech エンジンを統一的に操作するライブラリです
vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
voiceroid_daemon
VOICEROID2のHTTPサーバーデーモン
voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
voicevox_cli_client
VOICEVOX ENGINE、COEIROINK用コマンドラインクライアント。複数のエンジンを使用した並列処理もできます