karpathy / llm.c

LLM training in simple, raw C/CUDA

Repository from Github https://github.comkarpathy/llm.cRepository from Github https://github.comkarpathy/llm.c

llm.c for inference

ztachip opened this issue · comments

Do you have the equivalent simple C implementation of LLM but for inference of LLAMA models.
I am trying to build a FPGA accelerator for LLM and a simple reference C code would be very helpful
Thanks

Maybe this : https://github.com/karpathy/llama2.c ?
A one-file implementation for Llama2 inference in C

This is perfect. Thanks