natlamir's repositories
Wav2Lip-WebUI
A wav2lip Web UI using Gradio
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
LLaVA-Windows
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
tortoise-WebUI
A multi-voice TTS system trained with an emphasis on quality
ProjectFiles
Where I will be storing misc files with details / links used during the installation process, etc
magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
OnlySpeakTTS
Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.
sd-wav2lip-uhq
Wav2Lip UHQ extension for Automatic1111
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
PixArt-alpha
Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
vid2densepose
Convert your videos to densepose and use it on MagicAnimate
zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
audio-webui
A webui for different audio related Neural Networks
bitsandbytes-windows
8-bit CUDA functions for PyTorch in Windows 10
StabilityMatrix
Multi-Platform Package Manager for Stable Diffusion