KeAWang

followers

following

stars

Stanford, CA

http://keawang.github.io

Alex Wang's starred repositories

openui

OpenUI let's you describe UI using your imagination, then see it rendered live.

Language:TypeScriptApache-2.018694 124 157

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptMIT13402 99 246

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilog6934 68 22

superfile

Pretty fancy and modern terminal file manager

Language:GoMIT5596 9 148

phoenix

A lightweight macOS window and app manager scriptable with JavaScript

Language:Objective-CNOASSERTION4362 56 295

timesfm

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Language:PythonApache-2.03562 37 90

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.02212 32 87

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause1839 36 134

ThunderKittens

Tile primitives for speedy kernels

Language:CudaMIT1488 25 26

Memary

The Open Source Memory Layer For Autonomous Agents

Language:Jupyter NotebookMIT1393 14 28

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT1201 23 40

conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

Language:PythonMIT1057 15 49

experts

Experts.js is the easiest way to create and deploy OpenAI's Assistants and link them together as Tools to create advanced Multi AI Agent Systems with expanded memory and attention to detail.

Language:JavaScriptMIT973 12 13

stencila

Programmable, reproducible, interactive documents

Language:RustApache-2.0796 26 747

data-to-paper

data-to-paper: Backward-traceable AI-driven scientific research

Language:PythonMIT441 4 18

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaMIT423 3 11

helix

Multi-node production AI stack. Run the best of open source AI easily on your own servers. Create your own AI by fine-tuning open source models. Integrate LLMs with APIs. Run gptscript securely on the server

Language:GoNOASSERTION321 7 169

based

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Language:PythonApache-2.0206 16 12

Annotated-ML-Papers

Annotations of the interesting ML papers I read

MIT201 190

GPUSorting

State of the art sorting and segmented sorting, including OneSweep. Implemented in CUDA, D3D12, and Unity style compute shaders. Theoretically portable to all wave/warp/subgroup sizes.

Language:CudaNOASSERTION123 3 5

FlashAttention-PyTorch

Implementation of FlashAttention in PyTorch

Language:PythonMIT94 20

GPUPrefixSums

A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp/subgroup sizes.

Language:C++NOASSERTION73 5 6

RTF

A State-Space Model with Rational Transfer Function Representation.

Language:AssemblyApache-2.061 40

tcu_scope

Language:HTML44 40

pepin

A probabilistic approximate DNF counter

Language:C++MIT36 30

mean-field-cnns

Language:Jupyter NotebookApache-2.035 8 1

gpu-prefix-sum

CUDA implementation of exclusive prefix sum via Blelloch's algorithm

Language:Cuda25 20

Pytorch-Depthwise-Conv3d

cuda implementation of depthwise conv3d

Language:CudaMIT21 1 4

Parallel-Computing

Implementation of various Parallel Computing algorithms using CUDA C++

Language:Cuda1 10