Aaron Pham (aarnphm)

aarnphm

Geek Repo

Company:@bentoml

Location:toronto, ca

Home Page:https://aarnphm.xyz

Twitter:@aarnphm_

Github PK Tool:Github PK Tool


Organizations
MLH-Fellowship
tiproad

Aaron Pham's starred repositories

three.js

JavaScript 3D Library.

Language:JavaScriptLicense:MITStargazers:100798Issues:2551Issues:12432

zed

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Language:RustLicense:NOASSERTIONStargazers:42523Issues:192Issues:6908

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:34760Issues:360Issues:65

fish-shell

The user-friendly command line shell.

Language:RustLicense:NOASSERTIONStargazers:25203Issues:284Issues:7113

fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Language:PythonLicense:MITStargazers:19793Issues:282Issues:338

uv

An extremely fast Python package installer and resolver, written in Rust.

Language:RustLicense:Apache-2.0Stargazers:15453Issues:35Issues:2155

pkl

A configuration as code language with rich validation and tooling.

Language:JavaLicense:Apache-2.0Stargazers:9953Issues:54Issues:191

OpenLLM

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

Language:PythonLicense:Apache-2.0Stargazers:9434Issues:54Issues:258

livekit

End-to-end stack for WebRTC. SFU media server and SDKs.

Language:GoLicense:Apache-2.0Stargazers:9097Issues:119Issues:486

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8808Issues:82Issues:36

ZLUDA

CUDA on AMD GPUs

Language:RustLicense:Apache-2.0Stargazers:8491Issues:120Issues:156

llrt

LLRT (Low Latency Runtime) is an experimental, lightweight JavaScript runtime designed to address the growing demand for fast and efficient Serverless applications.

Language:JavaScriptLicense:Apache-2.0Stargazers:7816Issues:50Issues:130

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7695Issues:75Issues:152

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5823Issues:38Issues:77

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5191Issues:38Issues:37

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4613Issues:54Issues:98
Language:PythonLicense:Apache-2.0Stargazers:3888Issues:50Issues:110

sql.js-httpvfs

Hosting read-only SQLite databases on static file hosters like Github Pages

Language:TypeScriptLicense:Apache-2.0Stargazers:3432Issues:34Issues:44

react-strict-dom

React Strict DOM (RSD) is a subset of React DOM, imperative DOM, and CSS that supports web and native targets

Language:JavaScriptLicense:MITStargazers:3039Issues:37Issues:57

s4

Structured state space sequence models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2285Issues:52Issues:132

meditron

Meditron is a suite of open-source medical Large Language Models (LLMs).

Language:PythonLicense:Apache-2.0Stargazers:1788Issues:30Issues:29

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:1679Issues:37Issues:270

ipyflow

A reactive Python kernel for Jupyter notebooks.

Language:PythonLicense:BSD-3-ClauseStargazers:1104Issues:8Issues:102

sp1

A performant, 100% open-source, contributor-friendly zkVM.

Language:RustLicense:Apache-2.0Stargazers:831Issues:36Issues:83

yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Language:PythonLicense:GPL-3.0Stargazers:811Issues:17Issues:9

marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Language:PythonLicense:Apache-2.0Stargazers:477Issues:14Issues:24

pubgrub

PubGrub version solving algorithm implemented in Rust

Language:RustLicense:MPL-2.0Stargazers:341Issues:13Issues:77

micromorph

A very tiny library for diffing DOM nodes

Language:TypeScriptLicense:MITStargazers:335Issues:4Issues:10

mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

Language:C++License:MITStargazers:189Issues:17Issues:84

jiter

Fast iterable JSON parser.

Language:RustLicense:MITStargazers:138Issues:4Issues:13