Beast code in Giters

Helper script for cross compiling some media tools for windows, like customizable ffmpeg.exe (with or without non-free components, etc), and some other bonuses like mplayer, mp4box, mxf, etc.

GPL-3.0000

glew

The OpenGL Extension Wrangler Library

Language:CNOASSERTION000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT000

libssh

000

NativeSpeaker

make your Speaker talking as Native style with own voice！

Apache-2.0000

OpenVoice

Instant voice cloning by MyShell

NOASSERTION000

pingora

A library for building fast, reliable and evolvable network services.

Language:RustApache-2.0000

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

MIT000

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Apache-2.0000

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonMIT000

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Apache-2.0000

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.0000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT000

whisper.cpp

Port of OpenAI's Whisper model in C/C++

MIT000

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

BSD-4-Clause000