Stephen Panaro's repositories
coreml-llm-cli
CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.
more-ane-transformers
Run transformers (incl. LLMs) on the Apple Neural Engine.
CoreMLInspect
See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.
apple-silicon-4bit-quant
Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"
norm-tweaking
Post post-training-quantization (PTQ) method for improving LLMs. Unofficial implementation of https://arxiv.org/abs/2309.02784
mlx-squeezellm-gradients
SqueezeLLM-style gradients/Fisher Information collection in MLX
swift-transformers
Swift Package to implement a transformers-like API in Swift
WhisperKit
Swift native on-device speech recognition with Whisper for Apple Silicon
whisperkittools
Python tools for WhisperKit: Model conversion, optimization and evaluation
CLIP-Finder2
CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on Apple's MobileCLIP-S0 architecture, it ensures optimal performance and accurate media retrieval.
coremltools
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
FloatingPanel
A clean and easy-to-use floating panel UI component for iOS
powerlevel10k
A Zsh theme
smpanaro.github.io
My personal website.
swift-chat
Mac app to demonstrate swift-transformers
time-series-compression
Utilities for evaluating time series compression techniques. Companion to blog post.
tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
tree-sitter-flatbuffers
tree-sitter grammar for FlatBuffers
zed-extensions
Extensions for the Zed editor
zed-flatbuffers
zed.dev extension with language support for FlatBuffers