v0xie

v0xie

Geek Repo

Home Page:www.voxie3d.com

Twitter:@voxie3d

Github PK Tool:Github PK Tool

v0xie's repositories

sd-webui-incantations

Enhance Stable Diffusion image quality, prompt following, and more through multiple implementations of novel algorithms for Automatic1111 WebUI.

Language:PythonLicense:GPL-3.0Stargazers:105Issues:1Issues:20

sd-webui-cads

Greatly increase the diversity of your generated images in Automatic1111 WebUI through Condition-Annealed Sampling.

Language:PythonLicense:GPL-3.0Stargazers:90Issues:4Issues:15

sd-webui-semantic-guidance

Unofficial implementation of "SEGA: Instructing Text-to-Image Models using Semantic Guidance". Semantic Guidance gives you more control over the semantics of an image given an additional text prompt. An extension for Automatic1111 WebUI.

Language:PythonLicense:GPL-3.0Stargazers:64Issues:1Issues:11

sd-webui-agentattention

Speed up image generation and improve image quality using Agent Attention.

Language:PythonLicense:GPL-3.0Stargazers:38Issues:2Issues:5

BakeActionsToShapekeys

Blender script to bake armature actions to shape keys

Language:PythonLicense:GPL-3.0Stargazers:1Issues:1Issues:1

CharacteristicGuidanceWebUI

Provide large guidance scale correction for Stable Diffusion web UI (AUTOMATIC1111)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

efficientspeech

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LyCORIS

Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

OpenVR-Tracker-Websocket-Driver

A driver to connect to SteamVR using a websocket interface and create trackers and get device data.

Language:C++Stargazers:0Issues:0Issues:0

Poi8LTCGIAdapter

LTCGI in Poiyomi 8

License:MITStargazers:0Issues:1Issues:2

OSCmooth

Create smooth parameters that mimic IK Sync for OSC or general use.

Language:C#Stargazers:0Issues:0Issues:0

pywhispercpp

Python bindings for whisper.cpp

Language:C++License:MITStargazers:0Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

sptlrx

Timesynced lyrics in your terminal

Language:GoLicense:MITStargazers:0Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

stable-diffusion-webui-extensions

Extension index for stable-diffusion-webui

Stargazers:0Issues:0Issues:0

swin2sr

Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration at the Advances in Image Manipulation (AIM) workshop ECCV 2022, Tel Aviv

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

text-generation-webui

A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

TEXTurePaper

Official Implementation for "TEXTure: Semantic Texture Transfer using Text Tokens"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VoroMesh

Code for the VoroMesh paper

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VRCPlayersOnlyMirror

A simple mirror prefab for mirrors that show players only without any background

Language:ShaderLabStargazers:0Issues:0Issues:0

whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0