Iman Tabrizian (Tabrizian)

Tabrizian

Geek Repo

Company:NVIDIA

Location:Toronto, Canada

Github PK Tool:Github PK Tool


Organizations
kubeflow
nuxt-community
triton-inference-server

Iman Tabrizian's starred repositories

ant-design

An enterprise-class UI design language and React UI library

Language:TypeScriptLicense:MITStargazers:90696Issues:235Issues:29443

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

ninja

a small build system with a focus on speed

Language:C++License:Apache-2.0Stargazers:10701Issues:266Issues:1007

rr

Record and Replay Framework

Language:C++License:NOASSERTIONStargazers:8736Issues:158Issues:1974

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookLicense:MITStargazers:8652Issues:143Issues:25

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7093Issues:82Issues:1441

cp-algorithms

Algorithm and data structure articles for https://cp-algorithms.com (based on http://e-maxx.ru)

Language:C++License:CC-BY-SA-4.0Stargazers:6753Issues:95Issues:332

paper-qa

LLM Chain for answering questions from documents with citations

Language:PythonLicense:Apache-2.0Stargazers:3718Issues:41Issues:132

libcudacxx

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

Language:C++License:NOASSERTIONStargazers:2294Issues:67Issues:94

nanobind

nanobind: tiny and efficient C++/Python bindings

Language:C++License:BSD-3-ClauseStargazers:2109Issues:34Issues:141

blog

Some notes on things I find interesting and important.

marl

A hybrid thread / fiber task scheduler written in C++ 11

Language:C++License:Apache-2.0Stargazers:1801Issues:53Issues:68

awesome-model-quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

cpp-ipc

C++ IPC Library: A high-performance inter-process communication using shared memory on Linux/Windows.

Language:C++License:NOASSERTIONStargazers:1649Issues:53Issues:88

atomic_queue

C++ lockless queue.

Language:C++License:MITStargazers:1384Issues:42Issues:45

stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

Language:C++License:Apache-2.0Stargazers:1326Issues:49Issues:518

High-Performance-Organizations-Reading-List

Ideas for creating and sustaining high performance organizations

ucx

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

Language:CLicense:NOASSERTIONStargazers:1056Issues:78Issues:2025

pytriton

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Language:PythonLicense:Apache-2.0Stargazers:676Issues:18Issues:68

python_backend

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

Language:C++License:BSD-3-ClauseStargazers:493Issues:11Issues:0

rmm

RAPIDS Memory Manager

Language:C++License:Apache-2.0Stargazers:441Issues:27Issues:360
Language:C++License:Apache-2.0Stargazers:379Issues:13Issues:11
Language:PythonLicense:NOASSERTIONStargazers:377Issues:9Issues:218

wizardzines

sorted zines that collected from Julia Evans @b0rk twitter

pltalk

Schedule for the #PLTalk livestream. On Twitch here: https://www.twitch.tv/jeanqasaur

MathsFromExamples

Source code, datasets and trained models for the paper Learning Advanced Mathematical Computations from Examples (ICLR 2021), by François Charton, Amaury Hayat (ENPC-Rutgers) and Guillaume Lample

Language:PythonLicense:NOASSERTIONStargazers:175Issues:10Issues:0

multipy

torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters in a single C++ process.

Language:C++License:NOASSERTIONStargazers:166Issues:15Issues:58

extrainterpreters

Utilities for using Python's PEP 554 subinterpreters

Language:PythonLicense:LGPL-3.0Stargazers:102Issues:13Issues:7

shmemq-blog

Shared memory queue benchmarks and tracing for blog

Language:CLicense:MITStargazers:68Issues:5Issues:0