meru (merumeru-rururu)

merumeru-rururu

Geek Repo

Github PK Tool:Github PK Tool

meru's repositories

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DeepLearningExamples

Deep Learning Examples

Language:PythonStargazers:0Issues:0Issues:0

fewshot-font-generation

The unified repository for few-shot font generation methods. This repository includes FUNIT (ICCV'19), DM-Font (ECCV'20), LF-Font (AAAI'21) and MX-Font (ICCV'21).

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

FT-w2v2-ser

Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:CLicense:LGPL-2.1Stargazers:0Issues:0Issues:0
Language:CLicense:LGPL-2.1Stargazers:0Issues:0Issues:0

huggingsound

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

lhotse

Tools for handling speech data in machine learning projects.

License:Apache-2.0Stargazers:0Issues:0Issues:0

mammoth.js

Convert Word documents (.docx files) to HTML

Language:JavaScriptLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

phrase_break_prediction

Scripts for training a phrase break prediction system

License:MITStargazers:0Issues:0Issues:0

pyJuliusAlign

One-button-press forced aligner for Japanese, using Julius.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pyopenjtalk

Python wrapper for OpenJTalk

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pyvcroid2

Python Library to Access to Core DLL of VOICEROID2

License:MITStargazers:0Issues:0Issues:0

rvc-webui

This project is a fork of liujing04/Retrieval-based-Voice-Conversion-WebUI

Language:PythonStargazers:0Issues:0Issues:0

soxan

Wav2Vec for speech recognition, classification, and audio classification

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

License:MITStargazers:0Issues:0Issues:0

StyleTTS

Official Implementation of StyleTTS

License:MITStargazers:0Issues:0Issues:0

TTSController

各種 Text-to-Speech エンジンを統一的に操作するライブラリです

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, WIP

License:MITStargazers:0Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

License:MITStargazers:0Issues:0Issues:0

voiceroid_daemon

VOICEROID2のHTTPサーバーデーモン

License:MITStargazers:0Issues:0Issues:0

voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

voicevox_cli_client

VOICEVOX ENGINE、COEIROINK用コマンドラインクライアント。複数のエンジンを使用した並列処理もできます

License:MITStargazers:0Issues:0Issues:0