Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
yjymickey opened this issue a year ago · comments
if i use 750 for Maximum prompt token count, the input size must less than 75 word.So how can I solve the problem.are there any other ways to use tensorrt with 500-750 word or 7500 token