Santosh Bhavani's starred repositories
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
RFdiffusion
Code for running RFdiffusion
nucleotide-transformer
🧬 Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics
putting-nerf-on-a-diet
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis Implementation
zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism
JAX-Toolbox
JAX-Toolbox
flash-attention-jax
Implementation of Flash Attention in Jax
github-trending
使用 Github Actions 跟踪 Github 趋势项目。
microxcaling
PyTorch emulation library for Microscaling (MX)-compatible data formats