karpathy / llama2.c

Inference Llama 2 in one file of pure C

karpathy/llama2.c Issues

Adding support to Llama 3.1
Updated 3 days ago1
Everyone, I have implemented multi-token prediction of InfiniAttention and meta.
Closed 8 days ago
[Suggestion] Enable Discussion
Closed 17 days ago1
Everyone, I have implemented multi-token prediction of InfiniAttention and meta.
Closed 22 days ago3
Runing llama2.c on a microcontroller
Closed 25 days ago2
Error with torch not compiled with cuda enabled
Updated 2 months ago1
Fail for execute ./run with meta llama2_7b.bin
Closed 2 months ago
failed to convert llama_2 7B model in .gguf to .bin format
Updated 2 months ago2
Weight share of input and output embedding
Updated 3 months ago
Could llama2.c be adapted to BitNet?
Updated 3 months ago1
Missing Sampler when running on multiple GPUs using DDP
Closed 3 months ago3
Training Tiny Stories: 'CUDA' -vs- 'MPS'
Updated 4 months ago2
Simplified llama2.c.dll
Updated 4 months ago4
Can this be compiled to run on Windows 10, or Windows XP?
Updated 4 months ago3
Not an issue: Asking for help
Closed 4 months ago2
support for SIMD in matmul might increase performance i think so...
Closed 4 months ago1
mmap failed! ./run llama2_7b_q80.bin
Updated 4 months ago
-O3 does not apply auto-vectorization on X86-64 CPU
Updated 4 months ago1
the export model and read_checkpoint is conflict
Updated 4 months ago2
Tokenizer errors out when inferencing llama2
Updated 4 months ago1
malloc failed! on stories260 model
Updated 4 months ago1
How about Llama3？
Closed 4 months ago1
Can the Huggingface model be converted to ckpt.pt to support training?
Updated 4 months ago
RuntimeError with CUDA assertion failure when resuming model training from checkpoint
Updated 5 months ago1
add feature: export (quantize) from Llama2.c format
Updated 5 months ago
Once upon a time, there was a little girl named Lily
Updated 6 months ago5
Can you make a sora (diffusion transformer) tutorial similar to llama2.c?
Updated 6 months ago1
I'm doing an experiment with image generation, but my script outputs a binary file, how can I train a model using llama2.c?
Closed 6 months ago
Could anyone port deepseek-moe to llama2.c?
Updated 6 months ago
Please implement a project
Closed 6 months ago
New Visual Walkthrough of Llama2.c
Updated 6 months ago
Mobile React native Support Ported
Updated 7 months ago
Understanding "multiple_of"
Updated 7 months ago
Train/val split
Updated 7 months ago
Code/script to reproduce val loss using the shared models
Updated 7 months ago3
How to quantize stories15M.bin
Closed 7 months ago1
can i train on cpu
Updated 7 months ago5
HOw to add different coropus ?
Updated 7 months ago4
Keras based tiny llama implementations
Updated 7 months ago
casual attention implementation
Closed 8 months ago3
How do we reproduce your stories, by a more practical Q&A chat model?
Updated 8 months ago1
Llama-shepherd-cli a small tool to keep track of implementations in various languages
Updated 8 months ago
NanoGPT in c for inference
Closed 8 months ago
export does not seem to work?
Closed 9 months ago2
mfu calculation
Updated 9 months ago
Is it possible to use Orca2 with this code ?
Updated 9 months ago1
numpy llama2 for fun and learning
Closed 9 months ago
ld: warning: ignoring duplicate libraries: '-lgcc'
Updated 9 months ago
How to train a chat model
Updated 9 months ago
Prefill Processing
Updated 9 months ago2