Free Pascal bindings for llama.cpp
. This allows running inference for Facebook's LLaMA model on a CPU with good performance using full precision, f16 or 4-bit quantized versions of the model.
Free Pascal bindings for llama.cpp
Free Pascal bindings for llama.cpp
. This allows running inference for Facebook's LLaMA model on a CPU with good performance using full precision, f16 or 4-bit quantized versions of the model.
Free Pascal bindings for llama.cpp
MIT License