uw11's starred repositories
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
StableStudio
Community interface for generative AI
LLaMA-LoRA-Tuner
UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.
llm-foundry
LLM training code for Databricks foundation models
babyagi4all-api
BabyAGI to run with locally hosted models using the API from https://github.com/oobabooga/text-generation-webui
chatbot-ui
AI chat for every model.
Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
pix2pix-zero
Zero-shot Image-to-Image Translation [SIGGRAPH 2023]
custom-diffusion-webui
An unofficial implementation of Custom Diffusion for Automatic1111's WebUI.
chatgpt-vscode
A VSCode extension that allows you to use ChatGPT
gpt-discord-bot
Example Discord bot written in Python that uses the completions API to have conversations with the `text-davinci-003` model, and the moderations API to filter the messages.
Auto-Photoshop-StableDiffusion-Plugin
A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using either Automatic or ComfyUI as a backend.
dream-factory
Multi-threaded GUI manager for mass creation of AI-generated art with support for multiple GPUs.
DiffusionToolkit
Metadata-indexer and Viewer for AI-generated images