NVIDIA Corporation

NVIDIA Corporation's repositories

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.011609 205 2242

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION10108 163 724

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.08280 88 1813

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++NOASSERTION5423 103 1073

warp

A Python framework for high performance GPU simulation and graphics

Language:PythonNOASSERTION4119 53 224

nvidia-container-toolkit

Build and run containers leveraging NVIDIA GPUs

Language:GoApache-2.02256 22 431

GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Language:PythonApache-2.02181 56 44

gpu-operator

NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes

Language:GoApache-2.01763 51 773

aistore

AIStore: scalable storage for AI applications

Language:GoMIT1235 45 95

MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

Language:C++BSD-3-Clause1191 24 189

cccl

CUDA Core Compute Libraries

Language:C++NOASSERTION1161 31 1333

modulus

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

Language:PythonApache-2.0931 36 267

NVFlare

NVIDIA Federated Learning Application Runtime Environment

Language:PythonApache-2.0604 19 300

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonApache-2.0516 16 70

cuda-quantum

C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

Language:C++NOASSERTION480 22 662

NeMo-Curator

Scalable data pre processing and curation toolkit for LLMs

Language:Jupyter NotebookApache-2.0471 14 97

NeMo-text-processing

NeMo text processing for ASR and TTS

Language:PythonApache-2.0267 15 35

Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

Language:C++NOASSERTION255 18 637

JAX-Toolbox

Language:Jupyter NotebookApache-2.0232 23 178

TorchFort

An Online Deep Learning Interface for HPC programs on NVIDIA GPUs

Language:C++NOASSERTION150 10 6

metropolis-nim-workflows

Collection of reference workflows for building intelligent agents with NIMs

Language:Jupyter NotebookApache-2.078 50

TensorRT-Incubator

Experimental projects related to TensorRT

Language:MLIR65 5 47

spark-rapids-ml

Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs

Language:Jupyter NotebookApache-2.063 9 61

nv-ingest

NVIDIA Ingest is a set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.

Language:PythonApache-2.041 7 43