Iman Tabrizian (Tabrizian)

Tabrizian

Geek Repo

Company:NVIDIA

Location:Toronto, Canada

Github PK Tool:Github PK Tool


Organizations
kubeflow
nuxt-community
triton-inference-server

Iman Tabrizian's starred repositories

mlx

MLX: An array framework for Apple silicon

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:16043Issues:134Issues:676

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13083Issues:115Issues:983

triton

Development repository for the Triton language and compiler

nn-zero-to-hero

Neural Networks: Zero to Hero

Language:Jupyter NotebookLicense:MITStargazers:11384Issues:282Issues:30

the-incredible-pytorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookLicense:MITStargazers:9862Issues:149Issues:29

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8952Issues:82Issues:36

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7994Issues:87Issues:1739

warp

A Python framework for high performance GPU simulation and graphics

Language:PythonLicense:NOASSERTIONStargazers:4031Issues:55Issues:213

paper-qa

LLM Chain for answering questions from documents with citations

Language:PythonLicense:Apache-2.0Stargazers:3821Issues:40Issues:141

pixi

Package management made easy

Language:RustLicense:BSD-3-ClauseStargazers:2775Issues:21Issues:801

makemore

An autoregressive character-level language model for making more things

Language:PythonLicense:MITStargazers:2417Issues:33Issues:8

libcudacxx

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

Language:C++License:NOASSERTIONStargazers:2294Issues:68Issues:94

twinny

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but completely free and 100% private.

Language:TypeScriptLicense:MITStargazers:2279Issues:14Issues:154

blog

Some notes on things I find interesting and important.

marl

A hybrid thread / fiber task scheduler written in C++ 11

Language:C++License:Apache-2.0Stargazers:1838Issues:54Issues:69

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonLicense:Apache-2.0Stargazers:1672Issues:24Issues:38

stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

Language:C++License:Apache-2.0Stargazers:1473Issues:55Issues:535

Essentials-of-Compilation

A book about compiling Racket and Python to x86-64 assembly

llama3.np

llama3.np is a pure NumPy implementation for Llama 3 model.

Language:PythonLicense:MITStargazers:946Issues:13Issues:4

pytriton

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Language:PythonLicense:Apache-2.0Stargazers:706Issues:18Issues:73

rmm

RAPIDS Memory Manager

Language:C++License:Apache-2.0Stargazers:461Issues:27Issues:390

onnxruntime-genai

Generative AI extensions for onnxruntime

Language:C++License:MITStargazers:392Issues:45Issues:207

extending-jax

Extending JAX with custom C++ and CUDA code

Language:PythonLicense:MITStargazers:368Issues:10Issues:6

multi-core-python

Enabling CPython multi-core parallelism via subinterpreters.

cuda-checkpoint

CUDA checkpoint and restore utility

Language:CudaLicense:NOASSERTIONStargazers:183Issues:22Issues:13

multipy

torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters in a single C++ process.

Language:C++License:NOASSERTIONStargazers:169Issues:15Issues:58

extrainterpreters

Utilities for using Python's PEP 554 subinterpreters

Language:PythonLicense:LGPL-3.0Stargazers:106Issues:12Issues:7

vscode-micromamba

A VSCode extension to generate development environments using micromamba and conda-forge package repository

Language:TypeScriptLicense:BSD-3-ClauseStargazers:81Issues:6Issues:16