merumeru-rururu

meru's repositories

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0100

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonMIT100

additional_openjtalk_dic

Language:PythonNOASSERTION000

DeepLearningExamples

Deep Learning Examples

Language:Python000

fewshot-font-generation

The unified repository for few-shot font generation methods. This repository includes FUNIT (ICCV'19), DM-Font (ECCV'20), LF-Font (AAAI'21) and MX-Font (ICCV'21).

Language:PythonNOASSERTION000

FT-w2v2-ser

Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition

Language:PythonMIT000

groonga-command-token-count

Language:CLGPL-2.1000

groonga-tokenizer-yangram

Language:CLGPL-2.1000

huggingsound

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

Language:PythonMIT000

jtubespeech

Apache-2.0000

lhotse

Tools for handling speech data in machine learning projects.

Apache-2.0000

mammoth.js

Convert Word documents (.docx files) to HTML

Language:JavaScriptBSD-2-Clause000

phrase_break_prediction

Scripts for training a phrase break prediction system

MIT000

pyJuliusAlign

One-button-press forced aligner for Japanese, using Julius.

Language:PythonNOASSERTION000

pyopenjtalk

Python wrapper for OpenJTalk

Language:PythonNOASSERTION000

pyvcroid2

Python Library to Access to Core DLL of VOICEROID2

MIT000

rvc-webui

This project is a fork of liujing04/Retrieval-based-Voice-Conversion-WebUI

Language:Python000

soxan

Wav2Vec for speech recognition, classification, and audio classification

Language:Jupyter NotebookApache-2.0000

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

MIT000

StyleTTS

Official Implementation of StyleTTS

MIT000

TTSController

各種 Text-to-Speech エンジンを統一的に操作するライブラリです

Apache-2.0000

ttsQuestV3Voicevox

NOASSERTION000

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, WIP

MIT000

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

MIT000

voiceroid_daemon

VOICEROID2のHTTPサーバーデーモン

MIT000

voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Language:PythonApache-2.0000

voicevox_cli_client

VOICEVOX ENGINE、COEIROINK用コマンドラインクライアント。複数のエンジンを使用した並列処理もできます

MIT000