how about using modelmesh to serve thousands of stable diffusion models

Question

Jack47 opened this issue a year ago · comments

I want to use modelmesh to serving thousands of stable diffusions models. Any advice would be appreciated~

I'm using triton as serving runtime. Inference time is about 3~10s.
I'm using ensemble in triton to leverage business logics like audit and watermark, maybe they can be standalone service in the future
currently every model have it's own k8s service and ingress rules.

Goals:

Christian Kadner · Answer 1 · Sat Jan 20 2024 11:41:36 GMT+0800 (China Standard Time)

@Jack47 -- were you able to use ModelMesh-Serving for your stable diffusion models? Did you run into any specific issues?

WikiPedia thinks it should look like this :-)

Jack Chen · Answer 2 · Thu Jan 25 2024 22:34:47 GMT+0800 (China Standard Time)

currently we don't use modelmesh, thanks for your response.