natlamir

natlamir

Geek Repo

Github PK Tool:Github PK Tool

natlamir's repositories

Wav2Lip-WebUI

A wav2lip Web UI using Gradio

Language:PythonStargazers:49Issues:5Issues:0

PiperUI

A UI for the Piper TTS

DINet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Language:PythonStargazers:35Issues:1Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:33Issues:1Issues:0

DINet-UI

Windows Forms user interface for making lip sync videos with DINet and OpenFace

LLaVA-Windows

[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.

Language:PythonLicense:Apache-2.0Stargazers:21Issues:0Issues:0

tortoise-WebUI

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:21Issues:2Issues:0

ProjectFiles

Where I will be storing misc files with details / links used during the installation process, etc

Language:Jupyter NotebookStargazers:11Issues:2Issues:0

magic-animate

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:7Issues:0Issues:0

OnlySpeakTTS

Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.

Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0

AudioSep

implementation of "Separate Anything You Describe"

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

sd-wav2lip-uhq

Wav2Lip UHQ extension for Automatic1111

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

tpsm

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Language:Jupyter NotebookLicense:MITStargazers:5Issues:1Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0

dream

Generative Gaussian Splatting for Efficient 3D Content Creation

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

PixArt-alpha

Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:3Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:3Issues:0Issues:0

SdPaint

Stable Diffusion Painting

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

vid2densepose

Convert your videos to densepose and use it on MagicAnimate

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

a11

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:2Issues:0Issues:0

audio-webui

A webui for different audio related Neural Networks

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

bitsandbytes-windows

8-bit CUDA functions for PyTorch in Windows 10

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

OogaBooga

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

Language:PythonLicense:AGPL-3.0Stargazers:1Issues:0Issues:0

OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Language:MATLABLicense:NOASSERTIONStargazers:1Issues:0Issues:0

piper

A fast, local neural text to speech system

Language:C++License:MITStargazers:1Issues:0Issues:0

Show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

StabilityMatrix

Multi-Platform Package Manager for Stable Diffusion

Language:C#License:AGPL-3.0Stargazers:1Issues:0Issues:0