clam004 / triton-ft-api

tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

clam004/triton-ft-api Watchers