Alex Wang (KeAWang)

KeAWang

Geek Repo

Location:Stanford, CA

Home Page:http://keawang.github.io

Github PK Tool:Github PK Tool

Alex Wang's starred repositories

openui

OpenUI let's you describe UI using your imagination, then see it rendered live.

Language:TypeScriptLicense:Apache-2.0Stargazers:18684Issues:124Issues:157

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptLicense:MITStargazers:13379Issues:99Issues:246

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilogStargazers:6934Issues:68Issues:22

superfile

Pretty fancy and modern terminal file manager

Language:GoLicense:MITStargazers:5584Issues:9Issues:148

phoenix

A lightweight macOS window and app manager scriptable with JavaScript

Language:Objective-CLicense:NOASSERTIONStargazers:4362Issues:56Issues:295

timesfm

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Language:PythonLicense:Apache-2.0Stargazers:3555Issues:37Issues:90

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2209Issues:32Issues:87

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1648Issues:35Issues:132

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1488Issues:25Issues:26

Memary

The Open Source Memory Layer For Autonomous Agents

Language:Jupyter NotebookLicense:MITStargazers:1392Issues:14Issues:28

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:1194Issues:23Issues:40

conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

Language:PythonLicense:MITStargazers:1057Issues:15Issues:49

experts

Experts.js is the easiest way to create and deploy OpenAI's Assistants and link them together as Tools to create advanced Multi AI Agent Systems with expanded memory and attention to detail.

Language:JavaScriptLicense:MITStargazers:973Issues:12Issues:13

stencila

Programmable, reproducible, interactive documents

Language:RustLicense:Apache-2.0Stargazers:795Issues:26Issues:747

data-to-paper

data-to-paper: Backward-traceable AI-driven scientific research

Language:PythonLicense:MITStargazers:441Issues:4Issues:18

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaLicense:MITStargazers:421Issues:3Issues:11

helix

Multi-node production AI stack. Run the best of open source AI easily on your own servers. Create your own AI by fine-tuning open source models. Integrate LLMs with APIs. Run gptscript securely on the server

Language:GoLicense:NOASSERTIONStargazers:321Issues:7Issues:169

based

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Language:PythonLicense:Apache-2.0Stargazers:206Issues:16Issues:12

Annotated-ML-Papers

Annotations of the interesting ML papers I read

License:MITStargazers:201Issues:19Issues:0

GPUSorting

State of the art sorting and segmented sorting, including OneSweep. Implemented in CUDA, D3D12, and Unity style compute shaders. Theoretically portable to all wave/warp/subgroup sizes.

Language:CudaLicense:NOASSERTIONStargazers:121Issues:3Issues:5

FlashAttention-PyTorch

Implementation of FlashAttention in PyTorch

Language:PythonLicense:MITStargazers:94Issues:2Issues:0

GPUPrefixSums

A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp/subgroup sizes.

Language:C++License:NOASSERTIONStargazers:73Issues:5Issues:6

RTF

A State-Space Model with Rational Transfer Function Representation.

Language:AssemblyLicense:Apache-2.0Stargazers:61Issues:4Issues:0
Language:HTMLStargazers:44Issues:4Issues:0

pepin

A probabilistic approximate DNF counter

Language:C++License:MITStargazers:36Issues:3Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:35Issues:8Issues:1

gpu-prefix-sum

CUDA implementation of exclusive prefix sum via Blelloch's algorithm

Language:CudaStargazers:25Issues:2Issues:0

Pytorch-Depthwise-Conv3d

cuda implementation of depthwise conv3d

Language:CudaLicense:MITStargazers:21Issues:1Issues:4

Parallel-Computing

Implementation of various Parallel Computing algorithms using CUDA C++

Language:CudaStargazers:1Issues:1Issues:0