Beast code in Giters

Easily create text-to-speech models in any voice for rhasspy/piper. Make a text-to-speech model with your own voice recordings, or use thousands of RVC voices. Works offline on a Raspberry pi.

Language:ShellMIT14900

ssr_eval

Evaluation and Benchmarking of Speech Super-resolution Methods

Language:Python125 4 11

timething

Timething is a library for aligning text transcripts with their audio recordings.

Language:Jupyter NotebookMIT80 1 21

audio-preprocessing-scripts

数据集自动化制作脚本

Language:PythonMIT70 3 2

Multi-Singer.github.io

Language:SCSSNOASSERTION63 2 1

UnitySpeechToText

A native Unity plugin to convert speech to text on Android & iOS

Language:C#MIT61 3 3

bandit

BandIt: Cinematic Audio Source Separation

Language:PythonApache-2.05200

StyleTalk

Official release of StyleTalk dataset.

MIT4200

FlashSpeech

FlashSpeech: Efficient Zero-Shot Speech Synthesis

2900

audio_diarization_annotation

Audio Diarization Annotation tool

Language:JavaScriptApache-2.02100

gazelle-train

Joint speech-language model - respond directly to audio!

Language:PythonApache-2.02100

nlp-rus-zaliz

Processing the grammar dictionary of A. A. Zaliznyak for morphological inflection

Language:AdaGPL-3.01600

Lightvoc

LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM

Language:Jupyter Notebook1500

MINETrans-IWSLT23

Official implementation of our IWSLT 2023 paper "The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks"

Language:Python14 30