sanchitintel's repositories
accelerate
š A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
alband_subclass_zoo
fork of AlbanD'S subclass zoo
ArchBenchSuite
low level kernels to benchmark peak compute, cache bandwidth on various levels, memory bandwidth, and some basic compute routines
ClassyVision
An end-to-end PyTorch framework for image and video classification
diffusers
š¤ Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
glibc
Unofficial mirror of sourceware glibc repository. Updated daily.
ideep
IntelĀ® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
intel-extension-for-transformers
ā” Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsā”
intel-xpu-backend-for-triton
OpenAI Triton backend for IntelĀ® GPUs
llama2.so
Bert Maher's llama2.so
metaseq
FBResearch Metaseq fork
mmperf
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
netron-fork
Visualizer for neural network, deep learning, and machine learning models
neural-compressor
IntelĀ® Neural Compressor provides unified APIs for SOTA model compression techniques, such as low precision (INT8/INT4/FP4/NF4) quantization, sparsity, pruning, and knowledge distillation on mainstream AI frameworks such as TensorFlow, PyTorch, and ONNX Runtime.
nogil
Multithreaded Python without the GIL
opentuner-fork
An extensible framework for program autotuning authored by jansel
parlooper-fork
PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolutions and Fused Deep Learning Primitives
Pillow
Python Imaging Library (Fork)
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
pytorch.github.io
The website for PyTorch
Robin
Just-in-time (JIT) compiler utilities
torchtune
A Native-PyTorch Library for LLM Fine-tuning
tutorials
PyTorch tutorials.
vision
Datasets, Transforms and Models specific to Computer Vision