kexi's repositories
llm.c
LLM training in simple, raw C/CUDA
Language:CudaMIT000
MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Language:PythonMIT000
x-stable-diffusion
Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention.
Language:Jupyter NotebookApache-2.0000