aarnphm

Aaron Pham's starred repositories

three.js

JavaScript 3D Library.

Language:JavaScriptMIT100798 2551 12432

zed

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Language:RustNOASSERTION42523 192 6908

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.034760 360 65

fish-shell

The user-friendly command line shell.

Language:RustNOASSERTION25203 284 7113

fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Language:PythonMIT19793 282 338

uv

An extremely fast Python package installer and resolver, written in Rust.

Language:RustApache-2.015453 35 2155

pkl

A configuration as code language with rich validation and tooling.

Language:JavaApache-2.09953 54 191

OpenLLM

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

Language:PythonApache-2.09434 54 258

livekit

End-to-end stack for WebRTC. SFU media server and SDKs.

Language:GoApache-2.09097 119 486

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT8808 82 36

ZLUDA

CUDA on AMD GPUs

Language:RustApache-2.08491 120 156

llrt

LLRT (Low Latency Runtime) is an experimental, lightweight JavaScript runtime designed to address the growing demand for fast and efficient Serverless applications.

Language:JavaScriptApache-2.07816 50 130

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++MIT7695 75 152

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++Apache-2.05823 38 77

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonApache-2.05191 38 37

moondream

tiny vision language model

Language:Jupyter NotebookApache-2.04613 54 98

sql.js-httpvfs

Hosting read-only SQLite databases on static file hosters like Github Pages

Language:TypeScriptApache-2.03432 34 44

react-strict-dom

React Strict DOM (RSD) is a subset of React DOM, imperative DOM, and CSS that supports web and native targets

Language:JavaScriptMIT3039 37 57

s4

Structured state space sequence models

Language:Jupyter NotebookApache-2.02285 52 132

meditron

Meditron is a suite of open-source medical Large Language Models (LLMs).

Language:PythonApache-2.01788 30 29

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.01679 37 270

ipyflow

A reactive Python kernel for Jupyter notebooks.

Language:PythonBSD-3-Clause1104 8 102

sp1

A performant, 100% open-source, contributor-friendly zkVM.

Language:RustApache-2.0831 36 83

yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Language:PythonGPL-3.0811 17 9

marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Language:PythonApache-2.0477 14 24

pubgrub

PubGrub version solving algorithm implemented in Rust

Language:RustMPL-2.0341 13 77

micromorph

A very tiny library for diffing DOM nodes

Language:TypeScriptMIT335 4 10

mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

Language:C++MIT189 17 84

jiter

Fast iterable JSON parser.

Language:RustMIT138 4 13