ivanfioravanti

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

000

macos-core-to-core-latency

Core-to-core latency benchmark that works on MacOS without hard affinity

Language:C++MIT000

mactop

mactop - Apple Silicon Monitor Top written in pure Golang! Under 1,000 lines of code.

Language:GoMIT000

mflux

A MLX port of FLUX based on the Huggingface Diffusers implementation.

MIT000

mlx

MLX: An array framework for Apple silicon

Language:C++MIT010

mlx-benchmark

Benchmark of Apple's MLX operations on mlx gpu, cpu, torch mps and cuda.

Language:PythonMIT010

mlx-tuning-fork

Very basic framework for parameterized large language model (Q)LoRa fine-tuning using mlx, mlx_lm, and OgbujiPT. Architecture for systematic running of easily parameterized fine-tunes

Language:PythonMIT010

MLX-vs-Pytorch

Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs

Language:PythonMIT000

prompt-eng-interactive-tutorial

Anthropic's Interactive Prompt Engineering Tutorial

000

rlx

A reinforcement learning framework based on MLX.

Language:PythonMIT010

smol-vision

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Apache-2.0000

theWrongRoom

Interrogate LLMs to solve corporate mysteries.

NOASSERTION000

ivanfioravanti

Ivan Fioravanti's repositories

chatbot-ollama

prompt-eng-ollama-interactive-tutorial

mlx-examples

mlx-ui

chat-with-mlx

CrewAI

phidata

aider

chat-logger

ChatMLX

crewAI-examples

crewAI-tools

DIY-Astra

exo

f5-tts-mlx

fastmlx

huggingface.js

lightning-whisper-mlx

llama-recipes

macos-core-to-core-latency

mactop

mflux

mlx

mlx-benchmark

mlx-tuning-fork

MLX-vs-Pytorch

prompt-eng-interactive-tutorial

rlx

smol-vision

theWrongRoom