Robert Henryk Zawiasa's starred repositories
gguf-tools
GGUF implementation in C as a library and a tools CLI program
flash-attention
Fast and memory-efficient exact attention
metal-flash-attention
FlashAttention (Metal Port)
applegpuinfo
Print all known information about the GPU on Apple-designed chips
metal-benchmarks
Apple GPU microarchitecture
metal-without-xcode
A command-line-compilable example of Metal.
emscripten
Emscripten: An LLVM-to-WebAssembly Compiler
awesome-prompt-injection
Learn about a type of vulnerability that specifically targets machine learning models
kompute
General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
metal_tutorials_youtube
Apple Metal Programming Tutorials
text-generation-webui
A Gradio web UI for Large Language Models.
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
matmulfreellm
Implementation for MatMul-free LM.