Documentation suggestion

Question

Documentation suggestion

IAINATDBI opened this issue 5 months ago · comments

It might be worth adding a note that when serving an LLM from the CLI within a container, the triton start command does not return and you therefore need to launch a new shell using docker exec, in order to interact with any infer commands. This might be obvious, but would help understanding the overall process.

I did this successfully. I really like the tensor detail coming back from the infer command.

Cheers

Ryan McCormick · Answer 1 · Tue Mar 26 2024 02:10:06 GMT+0800 (China Standard Time)

Hi @IAINATDBI, thanks for calling this out. We'll try to improve this clarification.