Tinkoff.AI

Tinkoff.AI's repositories

CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Language:PythonApache-2.01016 16 28

etna

ETNA – Time-Series Library

Language:PythonApache-2.0853 8 550

katakomba

Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)

Language:PythonNOASSERTION63 30

ReBRAC

Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC

Language:Jupyter NotebookApache-2.050 20

sac-rnd

Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023

Language:PythonApache-2.048 30

hifi_vc

Language:Jupyter NotebookApache-2.039 4 2

palbert

Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight

Language:PythonApache-2.037 2 1

eop

Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022

Language:Jupyter NotebookMIT28 20

open-tlab

Примеры пропозалов для подачи заявки в Open.TLab

26 20

probabilistic-embeddings

"Probabilistic Embeddings Revisited" paper official repository

Language:PythonApache-2.025 30

lb-sac

Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Workshop

Language:PythonApache-2.019 4 1

cnf

Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, Offline RL Workshop

Language:PythonApache-2.012 30

exact

The original PyTorch implementation of the "EXACT: How Train Your Accuracy"

Language:PythonApache-2.010 30

use_rs

Language:Jupyter NotebookMIT7 20

pycon-chit-chat

Language:Jupyter NotebookMIT6 1 1

dl-course

Language:Jupyter NotebookApache-2.05 30

sigir-2021

4th place solution for the SIGIR 2021 challenge.

Language:PythonApache-2.0400

d4rl

A benchmark for offline reinforcement learning.

Language:PythonApache-2.0100

.github

000