Lightning-Universe / stable-diffusion-deploy

Learn to serve Stable Diffusion models on cloud infrastructure at scale. This Lightning App shows load-balancing, orchestrating, pre-provisioning, dynamic batching, GPU-inference, micro-services working together via the Lightning Apps framework.

https://lightning.ai/muse

model doesn't work properly with concurrent requests

aniketmaurya opened this issue 2 years ago · comments

Aniket Maurya commented 2 years ago

model doesn't work properly with concurrent requests. And Gradio is not respected enable_queue=True when we directly access the REST API.

Luca Antiga commented 2 years ago

We should definitely add our own queue in the app, just a Python queue, nothing fancy.