Iman Tabrizian's starred repositories
ant-design
An enterprise-class UI design language and React UI library
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
cp-algorithms
Algorithm and data structure articles for https://cp-algorithms.com (based on http://e-maxx.ru)
libcudacxx
[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl
awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
atomic_queue
C++ lockless queue.
High-Performance-Organizations-Reading-List
Ideas for creating and sustaining high performance organizations
python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
wizardzines
sorted zines that collected from Julia Evans @b0rk twitter
MathsFromExamples
Source code, datasets and trained models for the paper Learning Advanced Mathematical Computations from Examples (ICLR 2021), by François Charton, Amaury Hayat (ENPC-Rutgers) and Guillaume Lample
extrainterpreters
Utilities for using Python's PEP 554 subinterpreters
shmemq-blog
Shared memory queue benchmarks and tracing for blog