Vlad (acforvs)

acforvs

Geek Repo

Company:Yandex

Github PK Tool:Github PK Tool

Vlad's repositories

multi-agent-pathfinding

Heuristic Search vs. Learning. "Distributed Heuristic Multi-Agent Path Finding with Communication" reproduced, trained & benchmarked with M*

Language:PythonLicense:MITStargazers:15Issues:2Issues:1

dhc-robust-mapf

Learnable MAPF. “Distributed Heuristic Multi-Agent Path Finding with Communication” (DHC) algorithm from ICRA 2021 is implemented and benchmarked in out-of-distribution (OOD) scenarios. A new robust training loop to handle communication failures is introduced.

Language:PythonLicense:MITStargazers:10Issues:2Issues:1

Cointegrated-Pairs-Trading

Algo trading strategy, entrance task to CMF, Quantitative Analytics program, 2021

Language:PythonStargazers:7Issues:1Issues:0

talks

My public talks are presented here

Stargazers:6Issues:0Issues:0

ppl-kaggle-titanic

Titanic Kaggle contest

Language:Jupyter NotebookStargazers:5Issues:0Issues:0

Decision-Tree

Decision Tree Implementation as a part of my ML hw @ SPbU

Language:PythonStargazers:4Issues:0Issues:0
Language:Jupyter NotebookStargazers:4Issues:0Issues:0

Kaggle-In-house-classification

Kaggle classification contest report (in Russian)

Language:Jupyter NotebookStargazers:4Issues:0Issues:0

LeetCode-solutions

LeetCode solutions

Language:C++Stargazers:4Issues:0Issues:0

ppl-railway-station

Railway modelling

Language:PythonStargazers:4Issues:0Issues:0

ppl-text-index

Text file processing & index creation

Language:PythonStargazers:4Issues:1Issues:0
Language:PythonStargazers:4Issues:1Issues:0

tiktok

Entrance task for the "Tiktok for drivers" project, interactive map

Language:PythonStargazers:4Issues:0Issues:0

Gradient-Descent-Homework

Gradient Descent Homework for the ML Course @ SPbU

Language:Jupyter NotebookStargazers:3Issues:0Issues:0

DHC

Distributed Heuristic Multi-Agent Path Finding with Communication - ICRA 2021

Language:PythonStargazers:1Issues:0Issues:0

transformer

PyTorch implementation of the original transformer, from scratch

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

optax

Optax is a gradient processing and optimization library for JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awac_iql

Offline to Online RL: AWAC & IQL PyTorch Implementation

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

CodeTF

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

License:Apache-2.0Stargazers:0Issues:0Issues:0

deep-rl-class

This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

introtodeeplearning

Lab Materials for MIT 6.S191: Introduction to Deep Learning

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

starter-hugo-academic

🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.

License:MITStargazers:0Issues:0Issues:0

tau

Pipeline Parallelism for PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0