Chaitanya Sri Krishna Lolla's starred repositories
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
awesome-robotic-tooling
Tooling for professional robotic development in C++ and Python with a touch of ROS, autonomous driving and aerospace.
pytorchviz
A small package to create visualizations of PyTorch execution graphs
ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
flops-counter.pytorch
Flops counter for convolutional networks in pytorch framework
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
open-gpu-doc
Documentation of NVIDIA chip/hardware interfaces
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
resource-stream
CUDA related news and material links
pytorch_memlab
Profiling and inspecting memory in pytorch
multi-gpu-programming-models
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
cuda-quantum
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
contiguous_pytorch_params
Accelerate training by storing parameters in one contiguous chunk of memory.
pytorch-docker-armv7
pytorch for RaspberryPi