Tinkoff.AI (tinkoff-ai)

Tinkoff.AI

tinkoff-ai

Geek Repo

Tinkoff AI Center

Location:Russian Federation

Home Page:https://www.tinkoff.ru/career/it/ml/

Github PK Tool:Github PK Tool

Tinkoff.AI's repositories

CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Language:PythonLicense:Apache-2.0Stargazers:962Issues:15Issues:28

etna

ETNA – Time-Series Library

Language:PythonLicense:Apache-2.0Stargazers:836Issues:8Issues:550

katakomba

Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)

Language:PythonLicense:NOASSERTIONStargazers:61Issues:3Issues:0

ReBRAC

Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:50Issues:2Issues:0

sac-rnd

Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023

Language:PythonLicense:Apache-2.0Stargazers:47Issues:3Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:38Issues:4Issues:2

palbert

Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight

Language:PythonLicense:Apache-2.0Stargazers:37Issues:2Issues:1

eop

Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022

Language:Jupyter NotebookLicense:MITStargazers:28Issues:2Issues:0

open-tlab

Примеры пропозалов для подачи заявки в Open.TLab

probabilistic-embeddings

"Probabilistic Embeddings Revisited" paper official repository

Language:PythonLicense:Apache-2.0Stargazers:23Issues:3Issues:0

lb-sac

Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Workshop

Language:PythonLicense:Apache-2.0Stargazers:18Issues:4Issues:1

cnf

Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, Offline RL Workshop

Language:PythonLicense:Apache-2.0Stargazers:11Issues:3Issues:0

exact

The original PyTorch implementation of the "EXACT: How Train Your Accuracy"

Language:PythonLicense:Apache-2.0Stargazers:9Issues:3Issues:0
Language:Jupyter NotebookLicense:MITStargazers:7Issues:2Issues:0
Language:Jupyter NotebookLicense:MITStargazers:6Issues:1Issues:1
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5Issues:3Issues:0

sigir-2021

4th place solution for the SIGIR 2021 challenge.

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0

d4rl

A benchmark for offline reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Stargazers:0Issues:0Issues:0