Hitlab Studios's repositories
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Auto-Photoshop-StableDiffusion-Plugin
A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using Automatic1111-sd-webui as a backend.
ComfyUI
A powerful and modular stable diffusion GUI with a graph/nodes interface.
ComfyUI_examples
Examples of ComfyUI workflows
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
DWPose
Official implementation of the paper "Effective Whole-body Pose Estimation with Two-stages Distillation"
facefusion
Next generation face swapper and enhancer
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
iff-chatgpt-api-tutorial
Basic set up in python for ChatGPT API
LivePortrait
Bring portraits to life!
mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
NuGetForUnity
A NuGet Package Manager for Unity
obsidian-dataview
A high-performance data index and query language over Markdown files, for https://obsidian.md/.
open-interpreter
OpenAI's Code Interpreter in your terminal, running locally
OpenAI-API-dotnet
An unofficial C#/.NET SDK for accessing the OpenAI GPT-3 API
OpenAI-Unity
An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.
privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
rembg
Rembg is a tool to remove images background
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
roop
one-click face swap
sd-webui-panorama-viewer
Sends rendered SD_auto1111 images quickly to this panorama (hdri, equirectangular) viewer
sd-webui-txt-img-to-3d-model
A custom extension for sd-webui that allow you to generate 3D model from txt or image, basing on OpenAI Shap-E.
StableDiffusion-CheatSheet
A list of StableDiffusion styles and some notes for offline use. Pure HTML, CSS and a bit of JS.
Thin-Plate-Spline-Motion-Model
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
WarpFusion
WarpFusion
whisper_real_time
Real time transcription with OpenAI Whisper.