Dmitry Kozlov's starred repositories
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
kaggle-solutions
🏅 Collection of Kaggle Solutions and Ideas 🏅
llm-foundry
LLM training code for Databricks foundation models
Dragonfly2
Dragonfly is an open source P2P-based file distribution and image acceleration system. It is hosted by the Cloud Native Computing Foundation (CNCF) as an Incubating Level Project.
instill-core
🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
Stable-Diffusion-WebUI-TensorRT
TensorRT Extension for Stable Diffusion Web UI
stargz-snapshotter
Fast container image distribution plugin with lazy pulling
soci-snapshotter
A containerd snapshotter plugin which enables standard OCI images to be lazily loaded without requiring a build-time conversion step.
versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism