./llama-server
massudy opened this issue · comments
Do generate the llama-server as well, this bitnet performance test is not complete without llama-server performance test
Official inference framework for 1-bit LLMs
massudy opened this issue · comments
Do generate the llama-server as well, this bitnet performance test is not complete without llama-server performance test