wonwizard's repositories
SpeechRecognition
Speech Recognition
ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
dalle-mini
DALL·E Mini - Generate images from a text prompt
DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
DualStyleGAN
[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
facefusion
Next generation face swapper and enhancer
GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Linux-StableDiffusion-Script
A simple script to automate the installation and running of the hlky Stable Diffusion fork for Linux users. Please see my guide for running this on Linux: https://rentry.org/linux-sd
magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
NeRF-Factory
An awesome PyTorch NeRF library
nerf-pytorch
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
rembg
Rembg is a tool to remove images background.
roop
one-click face swap
ShortGPT
🚀🎬 ShortGPT - Experimental AI framework for automated short/video content creation.
stable-diffusion-webui
Stable Diffusion web UI
stable-diffusion-webui-1
Stable Diffusion web UI
Text2Video
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
Thin-Plate-Spline-Motion-Model
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
upscayl
🆙 Upscayl - Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy.
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech