Vlad's repositories
multi-agent-pathfinding
Heuristic Search vs. Learning. "Distributed Heuristic Multi-Agent Path Finding with Communication" reproduced, trained & benchmarked with M*
dhc-robust-mapf
Learnable MAPF. “Distributed Heuristic Multi-Agent Path Finding with Communication” (DHC) algorithm from ICRA 2021 is implemented and benchmarked in out-of-distribution (OOD) scenarios. A new robust training loop to handle communication failures is introduced.
Cointegrated-Pairs-Trading
Algo trading strategy, entrance task to CMF, Quantitative Analytics program, 2021
ppl-kaggle-titanic
Titanic Kaggle contest
Decision-Tree
Decision Tree Implementation as a part of my ML hw @ SPbU
Kaggle-In-house-classification
Kaggle classification contest report (in Russian)
LeetCode-solutions
LeetCode solutions
ppl-railway-station
Railway modelling
ppl-text-index
Text file processing & index creation
Gradient-Descent-Homework
Gradient Descent Homework for the ML Course @ SPbU
transformer
PyTorch implementation of the original transformer, from scratch
open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
optax
Optax is a gradient processing and optimization library for JAX.
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
CodeTF
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
deep-rl-class
This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
introtodeeplearning
Lab Materials for MIT 6.S191: Introduction to Deep Learning
starter-hugo-academic
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
tau
Pipeline Parallelism for PyTorch
text-generation-inference
Large Language Model Text Generation Inference
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)