Kevin Kiningham's starred repositories

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:35854Issues:247Issues:5223

carbon-lang

Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)

Language:C++License:NOASSERTIONStargazers:32243Issues:392Issues:603

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:30135Issues:386Issues:3499

codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

Language:C++License:NOASSERTIONStargazers:15013Issues:139Issues:413

fauxpilot

FauxPilot - an open-source alternative to GitHub Copilot server

Language:PythonLicense:MITStargazers:14559Issues:124Issues:133

diffusionbee-stable-diffusion-ui

Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.

Language:JavaScriptLicense:AGPL-3.0Stargazers:12452Issues:109Issues:456

engineeringladders

A framework for Engineering Managers

mmdetection3d

OpenMMLab's next-generation platform for general 3D object detection.

Language:PythonLicense:Apache-2.0Stargazers:5209Issues:61Issues:1606

libunifex

Unified Executors

Language:C++License:NOASSERTIONStargazers:1461Issues:58Issues:156

stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Language:PythonLicense:MITStargazers:1158Issues:18Issues:122

resource-stream

GPU programming related news and material links

amx

Apple AMX Instruction Set

mlir-tutorial

MLIR For Beginners tutorial

ringattention

Transformers with Arbitrarily Large Context

Language:PythonLicense:Apache-2.0Stargazers:619Issues:6Issues:16

veryl

Veryl: A Modern Hardware Description Language

Language:RustLicense:NOASSERTIONStargazers:478Issues:10Issues:313

gf180mcu-pdk

PDK for GlobalFoundries' 180nm MCU bulk process technology (GF180MCU).

Language:MakefileLicense:Apache-2.0Stargazers:364Issues:20Issues:49

100DaysOfRTL

100 Days of RTL

Language:SystemVerilogStargazers:327Issues:26Issues:5

quadrable

Authenticated multi-version database: sparse binary merkle tree with compact partial-tree proofs

Language:C++License:BSD-2-ClauseStargazers:299Issues:10Issues:3

commavq

commaVQ is a dataset of compressed driving video

Language:Jupyter NotebookLicense:MITStargazers:290Issues:20Issues:13

poprc

A Compiler for the Popr Language

Language:CLicense:GPL-3.0Stargazers:241Issues:19Issues:3

halutmatmul

Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator

Language:PythonLicense:MITStargazers:206Issues:10Issues:4

concurrent_deferred_rc

Concurrent Deferred Reference Counting

Language:C++License:MITStargazers:145Issues:19Issues:4

mmperf

MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.

Language:C++License:Apache-2.0Stargazers:123Issues:11Issues:22

gpunet

GPUnet is a native GPU networking layer that provides a socket abstraction over Infiniband to GPU programs for NVIDIA GPUs.

Language:CLicense:NOASSERTIONStargazers:92Issues:21Issues:2

mlir-tv

A translation validation framework for MLIR

Language:C++License:Apache-2.0Stargazers:71Issues:7Issues:51

triton-autodiff

Experiment of using Tangent to autodiff triton

Language:PythonLicense:MITStargazers:68Issues:5Issues:0

mlir-tcp

Tensor Compute Primitives: Mid-level Intermediate Representation for Machine Learning Programs

Language:MLIRLicense:NOASSERTIONStargazers:35Issues:10Issues:2

iree-torch

Torch Frontend for IREE

Language:PythonLicense:Apache-2.0Stargazers:25Issues:17Issues:11

rules_m4

Bazel build rules for GNU M4

Language:StarlarkLicense:Apache-2.0Stargazers:16Issues:4Issues:9

iree-pjrt

PJRT plugin for interfacing the IREE to Jax and TensorFlow.

Language:C++License:Apache-2.0Stargazers:1Issues:0Issues:0