Mrc2023's repositories

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

bark

🔊 Text-prompted Generative Audio Model

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

fish-diffusion

An easy to understand TTS / SVS / SVC framework

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Stable-Diffusion

Best Stable Diffusion and AI Tutorials, Guides, News, Tips and Tricks

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Language:RoffLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

tortoise-tts-fast

Fast TorToiSe inference (5x or your money back!)

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

wav2lip-hq-updated-ESRGAN

Updated fork of wav2lip-hq allowing for the use of current ESRGAN models

Language:PythonStargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0