NVIDIA Corporation's repositories
Megatron-LM
Ongoing research training transformer models at scale
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
gpu-operator
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
spark-rapids
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
NeMo-Aligner
Scalable toolkit for efficient model alignment
cuda-quantum
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
NeMo-Framework-Launcher
Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.
NeMo-Curator
Scalable toolkit for data curation
JAX-Toolbox
JAX-Toolbox
RTX-AI-Toolkit
The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PCs and cloud.
earth2studio
Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
spark-rapids-tools
User tools for Spark RAPIDS
knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
k8s-driver-manager
The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.
spark-rapids-jni
RAPIDS Accelerator JNI For Apache Spark
NV-Kernels
Ubuntu kernels which are optimized for NVIDIA server systems
edk2-platforms
NVIDIA fork of tianocore/edk2-platforms
doca-sosreport
A unified tool for collecting system logs and other debug information