Compares with tvm

Question

Compares with tvm

lucasjinreal opened this issue 4 months ago · comments

does it runs faster than torch or tvm now? or llama.cpp

Joe Fioti · Answer 1 · Tue Mar 05 2024 01:07:22 GMT+0800 (China Standard Time)

Currently faster than pytorch for llms on metal, about 10-20% slower than llama.cpp, unsure about tvm. Proper benchmarks (#21 ) are a goal I want to get done soon.

MagicSource · Answer 2 · Tue Mar 05 2024 10:29:32 GMT+0800 (China Standard Time)

that's pretty good, how about on CUDA?