brappier's starred repositories

llama.cpp

LLM inference in C/C++

metal-flash-attention

FlashAttention (Metal Port)

Language:SwiftLicense:MITStargazers:339Issues:16Issues:12