Beast code in Giters

Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.

Language:PythonApache-2.0700

AudioSep

implementation of "Separate Anything You Describe"

Language:PythonMIT600

sd-wav2lip-uhq

Wav2Lip UHQ extension for Automatic1111

Language:PythonApache-2.0600

tpsm

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Language:Jupyter NotebookMIT5 10

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonApache-2.0400

dream

Generative Gaussian Splatting for Efficient 3D Content Creation

Language:PythonMIT300

PixArt-alpha

Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonAGPL-3.0300

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonNOASSERTION300

SdPaint

Stable Diffusion Painting

Language:PythonMIT300

vid2densepose

Convert your videos to densepose and use it on MagicAnimate

Language:PythonMIT300

zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Language:PythonApache-2.0300

a11

Stable Diffusion web UI

Language:PythonAGPL-3.0200

audio-webui

A webui for different audio related Neural Networks

Language:PythonMIT100

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT100

bitsandbytes-windows

8-bit CUDA functions for PyTorch in Windows 10

Language:PythonMIT100

OogaBooga

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

Language:PythonAGPL-3.0100

OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Language:MATLABNOASSERTION100

piper

A fast, local neural text to speech system

Language:C++MIT100

Show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Language:PythonNOASSERTION100

StabilityMatrix

Multi-Platform Package Manager for Stable Diffusion

Language:C#AGPL-3.0100