highstakes's starred repositories

vid2vid

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Language:PythonLicense:NOASSERTIONStargazers:8540Issues:0Issues:0

3d-photo-inpainting

[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting

Language:PythonLicense:NOASSERTIONStargazers:6883Issues:0Issues:0

pix2pix

Image-to-image translation with conditional adversarial nets

Language:LuaLicense:NOASSERTIONStargazers:10027Issues:0Issues:0

DragGAN

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)

Language:PythonStargazers:4989Issues:0Issues:0

text-to-video-synthesis-colab

Text To Video Synthesis Colab

Language:Jupyter NotebookLicense:UnlicenseStargazers:1426Issues:0Issues:0

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonLicense:MITStargazers:51514Issues:0Issues:0

sd-webui-segment-anything

Segment Anything for Stable Diffusion WebUI

Language:PythonStargazers:3331Issues:0Issues:0

stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Language:PythonLicense:Apache-2.0Stargazers:8053Issues:0Issues:0

Face2FaceRHO

The Official PyTorch Implementation for Face2Face^ρ (ECCV2022)

Language:PythonLicense:BSD-3-ClauseStargazers:212Issues:0Issues:0

chaiNNer

A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and powerful programmatic image processing application.

Language:PythonLicense:GPL-3.0Stargazers:4334Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12552Issues:0Issues:0

stable-diffusion-webui-rembg

Removes backgrounds from pictures. Extension for webui.

Language:PythonLicense:MITStargazers:1142Issues:0Issues:0

Auto-Photoshop-StableDiffusion-Plugin

A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using either Automatic or ComfyUI as a backend.

Language:TypeScriptLicense:MITStargazers:6588Issues:0Issues:0

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:24941Issues:0Issues:0

lossless-cut

The swiss army knife of lossless video/audio editing

Language:TypeScriptLicense:GPL-2.0Stargazers:25072Issues:0Issues:0

Stable-Diffusion

Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:1919Issues:0Issues:0

silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4756Issues:0Issues:0

roop

one-click face swap

Language:PythonLicense:GPL-3.0Stargazers:25835Issues:0Issues:0

flowframes

Flowframes Windows GUI for video interpolation using DAIN (NCNN) or RIFE (CUDA/NCNN)

Language:PythonLicense:GPL-3.0Stargazers:1404Issues:0Issues:0

Pallaidium

PALLAIDIUM - a generative AI movie studio integrated in the Blender video editor.

Language:PythonLicense:GPL-3.0Stargazers:871Issues:0Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6576Issues:0Issues:0

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Stargazers:2916Issues:0Issues:0

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4378Issues:0Issues:0

CogVideo

Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

Language:PythonLicense:Apache-2.0Stargazers:3576Issues:0Issues:0

QualityScaler

QualityScaler - image/video AI upscaler app

Language:PythonLicense:MITStargazers:1879Issues:0Issues:0

Text2Video-Zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonLicense:NOASSERTIONStargazers:3936Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:21620Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11327Issues:0Issues:0

video2x

A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR. Started in Hack the Valley II, 2018.

Language:PythonLicense:AGPL-3.0Stargazers:9031Issues:0Issues:0

sd-webui-text2video

Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies

Language:PythonLicense:NOASSERTIONStargazers:1274Issues:0Issues:0