hypereikon lab's starred repositories
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
multidiffusion-upscaler-for-automatic1111
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
recognize-anything
Open-source and strong foundation image recognition models.
Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
diffusion-nbs
Getting started with diffusion
DiffusionFastForward
DiffusionFastForward: a free course and experimental framework for diffusion-based generative models
anthology-of-modern-ml
Collection of important articles to be treated as a textbook
UnpromptedControl
Remove unwanted objects and restore images without prompts, powered by ControlNet.
mixture-of-diffusers
Mixture of Diffusers for scene composition and high resolution image generation
musicgen_trainer
simple trainer for musicgen/audiocraft
creative_ml
Creative Machine Learning course and notebook tutorials in JAX, PyTorch and Numpy
sample-diffusion
A Python library and CLI for generating audio samples using Harmonai Dance Diffusion models.