vineetp6 / minbpe.c

a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

vineetp6/minbpe.c Stargazers