RidiculousDeath's starred repositories

AppManager

A full-featured package manager and viewer for Android

Language:JavaLicense:NOASSERTIONStargazers:4519Issues:0Issues:0

Microsoft-Activation-Scripts

A Windows and Office activator using HWID / Ohook / KMS38 / Online KMS activation methods, with a focus on open-source code and fewer antivirus detections.

Language:BatchfileLicense:GPL-3.0Stargazers:87023Issues:0Issues:0

tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

Language:TypeScriptLicense:MITStargazers:1497Issues:0Issues:0

Whisper-WebUI

A Web UI for easy subtitle using whisper model.

Language:PythonLicense:Apache-2.0Stargazers:807Issues:0Issues:0

subsync

Subtitle Speech Synchronizer

Language:C++License:GPL-3.0Stargazers:1240Issues:0Issues:0

subgen

Autogenerate subtitles using OpenAI Whisper Model via Jellyfin, Plex, Emby, Tautulli, or Bazarr

Language:PythonLicense:MITStargazers:450Issues:0Issues:0

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonLicense:UnlicenseStargazers:77086Issues:0Issues:0

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonLicense:NOASSERTIONStargazers:15401Issues:0Issues:0

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonLicense:MITStargazers:16668Issues:0Issues:0

whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Language:SvelteLicense:AGPL-3.0Stargazers:1177Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20235Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:64705Issues:0Issues:0

Pandrator

Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with subtitles and more) using local models (XTTS, Silero or VoiceCraft), plus voice cloning, LLM pre-processing, RVC enhancement, and automatic evaluation

Language:PythonLicense:AGPL-3.0Stargazers:124Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:459Issues:0Issues:0

xtts-webui

Webui for using XTTS and for finetuning it

Language:PythonLicense:MITStargazers:459Issues:0Issues:0

alltalk_tts

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.

Language:HTMLLicense:AGPL-3.0Stargazers:685Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27321Issues:0Issues:0

buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Language:PythonLicense:MITStargazers:10987Issues:0Issues:0

InvokeAI

InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.

Language:TypeScriptLicense:Apache-2.0Stargazers:22349Issues:0Issues:0

MagiskOnWSALocal

Integrate Magisk root and Google Apps into WSA (Windows Subsystem for Android)

Language:ShellLicense:AGPL-3.0Stargazers:9299Issues:0Issues:0

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2974Issues:0Issues:0

piper

A fast, local neural text to speech system

Language:C++License:MITStargazers:5181Issues:0Issues:0

audio-webui

A webui for different audio related Neural Networks

Language:PythonLicense:MITStargazers:968Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:21152Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12479Issues:0Issues:0

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4451Issues:0Issues:0

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7247Issues:0Issues:0

sd-forge-couple

An Extension for Forge Webui that implements Attention Couple

Language:PythonLicense:GPL-3.0Stargazers:167Issues:0Issues:0

ComfyUI-Manager

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.

Language:JavaScriptLicense:GPL-3.0Stargazers:4812Issues:0Issues:0

superprompter

Supercharge your AI/LLM prompts

Language:PythonLicense:MITStargazers:67Issues:0Issues:0