Manan Shah (mananshah99)

mananshah99

Geek Repo

Company:@kumo-ai

Location:San Francisco, CA

Github PK Tool:Github PK Tool

Manan Shah's starred repositories

imessage-exporter

Export iMessage data + run iMessage Diagnostics

Language:RustLicense:GPL-3.0Stargazers:2690Issues:0Issues:0

card-web

The web app behind thecompendium.cards

Language:TypeScriptLicense:Apache-2.0Stargazers:46Issues:0Issues:0

py-spy

Sampling profiler for Python programs

Language:RustLicense:MITStargazers:12231Issues:0Issues:0

diagrams

:art: Diagram as Code for prototyping cloud system architectures

Language:PythonLicense:MITStargazers:35725Issues:0Issues:0

libbacktrace

A C library that may be linked into a C/C++ program to produce symbolic backtraces

Language:CLicense:NOASSERTIONStargazers:923Issues:0Issues:0

anynp

Proof-of-concept of global switching between numpy/jax/pytorch in a library.

Language:PythonStargazers:16Issues:0Issues:0

unitycatalog

Open, Multi-modal Catalog for Data & AI

Language:JavaLicense:Apache-2.0Stargazers:2001Issues:0Issues:0

kubectx

Faster way to switch between clusters and namespaces in kubectl

Language:GoLicense:Apache-2.0Stargazers:17310Issues:0Issues:0

kubernetes

Production-Grade Container Scheduling and Management

Language:GoLicense:Apache-2.0Stargazers:108720Issues:0Issues:0
Language:GoStargazers:9Issues:0Issues:0

engineering-blogs

A curated list of engineering blogs

Language:RubyStargazers:30388Issues:0Issues:0

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Language:CudaLicense:Apache-2.0Stargazers:766Issues:0Issues:0

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

License:Apache-2.0Stargazers:1Issues:0Issues:0

llama_duo

asynchronous/distributed speculative evaluation for llama3

Language:C++License:MITStargazers:33Issues:0Issues:0

go

The Go programming language

Language:GoLicense:BSD-3-ClauseStargazers:121522Issues:0Issues:0

ucall

Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring ☎️

Language:CLicense:Apache-2.0Stargazers:1099Issues:0Issues:0

liburing

Library providing helpers for the Linux kernel io_uring support

Language:CLicense:MITStargazers:2715Issues:0Issues:0

pt-three-ways

Path tracing, done three ways

Language:C++License:MITStargazers:191Issues:0Issues:0

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilogStargazers:6735Issues:0Issues:0

attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Language:PythonLicense:MITStargazers:425Issues:0Issues:0

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5816Issues:0Issues:0

whispercpp

Pybind11 bindings for Whisper.cpp

Language:C++License:Apache-2.0Stargazers:31Issues:0Issues:0

bark.cpp

Suno AI's Bark model in C/C++ for fast text-to-speech

Language:C++License:MITStargazers:642Issues:0Issues:0

shouldersOfGiants.rs

I have no idea what I'm doing , but llm.c in rust

Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22248Issues:0Issues:0

ShallowSpeed

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Language:PythonStargazers:71Issues:0Issues:0
Language:CudaLicense:Apache-2.0Stargazers:557Issues:0Issues:0

PytorchBridge

Designing bridge trusses with Pytorch autograd

Language:Jupyter NotebookStargazers:61Issues:0Issues:0

simdjson

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

Language:C++License:Apache-2.0Stargazers:18863Issues:0Issues:0

awesome-distributed-system-projects

🚀 List of distributed system projects for inspiration and learning to build distributed services from real world examples

Stargazers:629Issues:0Issues:0