kunal-vaishnavi's repositories
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
optimum
🏎️ Accelerate training and inference of 🤗 Transformers with easy to use hardware optimization tools
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
TensorRT
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision