acforvs

followers

following

stars

Yandex

Vlad's repositories

multi-agent-pathfinding

Heuristic Search vs. Learning. "Distributed Heuristic Multi-Agent Path Finding with Communication" reproduced, trained & benchmarked with M*

Language:PythonMIT14 2 1

dhc-robust-mapf

Learnable MAPF. “Distributed Heuristic Multi-Agent Path Finding with Communication” (DHC) algorithm from ICRA 2021 is implemented and benchmarked in out-of-distribution (OOD) scenarios. A new robust training loop to handle communication failures is introduced.

Language:PythonMIT11 2 1

Cointegrated-Pairs-Trading

Algo trading strategy, entrance task to CMF, Quantitative Analytics program, 2021

Language:Python7 10

talks

My public talks are presented here

600

ppl-kaggle-titanic

Titanic Kaggle contest

Language:Jupyter Notebook500

Decision-Tree

Decision Tree Implementation as a part of my ML hw @ SPbU

Language:Python4 10

JB-Nucleon-Configurations

Language:Jupyter Notebook400

Kaggle-In-house-classification

Kaggle classification contest report (in Russian)

Language:Jupyter Notebook4 10

LeetCode-solutions

LeetCode solutions

Language:C++400

ppl-railway-station

Railway modelling

Language:Python4 10

ppl-text-index

Text file processing & index creation

Language:Python4 10

rhyme-bot

Language:Python4 10

tiktok

Entrance task for the "Tiktok for drivers" project, interactive map

Language:Python400

Gradient-Descent-Homework

Gradient Descent Homework for the ML Course @ SPbU

Language:Jupyter Notebook300

DHC

Distributed Heuristic Multi-Agent Path Finding with Communication - ICRA 2021

Language:Python100

transformer

PyTorch implementation of the original transformer, from scratch

Language:PythonMIT1 20

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++Apache-2.0000

optax

Optax is a gradient processing and optimization library for JAX.

Language:PythonApache-2.0000

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonApache-2.0000

awac_iql

Offline to Online RL: AWAC & IQL PyTorch Implementation

Language:Jupyter NotebookMIT020

CodeTF

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

Apache-2.0000

deep-rl-class

This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.

Language:Jupyter Notebook000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.0000

introtodeeplearning

Lab Materials for MIT 6.S191: Introduction to Deep Learning

Language:Jupyter NotebookMIT000

starter-hugo-academic

🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.

MIT000

tau

Pipeline Parallelism for PyTorch

Language:PythonBSD-3-Clause000

tests

Language:Python000

text-generation-inference

Large Language Model Text Generation Inference

Apache-2.0000

TransPath

Language:Jupyter Notebook000

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT000