v0xie's repositories
sd-webui-incantations
Enhance Stable Diffusion image quality, prompt following, and more through multiple implementations of novel algorithms for Automatic1111 WebUI.
sd-webui-cads
Greatly increase the diversity of your generated images in Automatic1111 WebUI through Condition-Annealed Sampling.
sd-webui-semantic-guidance
Unofficial implementation of "SEGA: Instructing Text-to-Image Models using Semantic Guidance". Semantic Guidance gives you more control over the semantics of an image given an additional text prompt. An extension for Automatic1111 WebUI.
sd-webui-agentattention
Speed up image generation and improve image quality using Agent Attention.
BakeActionsToShapekeys
Blender script to bake armature actions to shape keys
CharacteristicGuidanceWebUI
Provide large guidance scale correction for Stable Diffusion web UI (AUTOMATIC1111)
efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
sd-webui-RK-Sampler
Batched Runge-Kutta Samplers for Automatic1111 WebUI
stable-diffusion-webui
Stable Diffusion web UI
bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
HierSpeechpp
The official implementation of HierSpeech++
LyCORIS
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
OpenVR-Tracker-Websocket-Driver
A driver to connect to SteamVR using a websocket interface and create trackers and get device data.
Poi8LTCGIAdapter
LTCGI in Poiyomi 8
Ouroboros3D
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
pywhispercpp
Python bindings for whisper.cpp
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
stable-diffusion-webui-extensions
Extension index for stable-diffusion-webui
swin2sr
Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration at the Advances in Image Manipulation (AIM) workshop ECCV 2022, Tel Aviv
text-generation-webui
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.
TEXTurePaper
Official Implementation for "TEXTure: Semantic Texture Transfer using Text Tokens"
VoroMesh
Code for the VoroMesh paper
VRCPlayersOnlyMirror
A simple mirror prefab for mirrors that show players only without any background
whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.