Andrei Panferov's repositories
tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
nlp_course
YSDA course in Natural Language Processing
quaified_impalas
NMA 2021 group project
raytracer22
2022 iteration of my annual raytracer project.
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
bitsandbytes
8-bit CUDA functions for PyTorch
efficient-dl-systems
Efficient Deep Learning Systems course (HSE, YSDA)
raytracer21
An almost pure c++ raytracer programm
langchain
🦜🔗 Build context-aware reasoning applications
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
peft-rosa
A fork of the PEFT library, supporting Robust Adaptation (RoSA)
PFL-DocVQA-Competition
https://benchmarks.elsa-ai.eu/?ch=2&com=introduction
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.