Timo Klein (timoklein)

timoklein

Geek Repo

Company:University of Vienna

Location:Vienna, Austria

Github PK Tool:Github PK Tool

Timo Klein's repositories

alphazero-gym

AlphaZero for continuous control tasks

Language:PythonLicense:MITStargazers:22Issues:3Issues:7

redo

ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)

car_racer

Deep reinforcement learning in autonomous driving

Language:PythonStargazers:7Issues:0Issues:0

infer

InFeR: Understanding and Preventing Capacity Loss in Reinforcement Learning (pytorch)

Language:PythonStargazers:4Issues:0Issues:0

neural_citation

Context aware citation recommendation

Language:Jupyter NotebookStargazers:4Issues:0Issues:0

crelu-pytorch

CReLU activation function from the paper "Loss of Plasticity in Continual Deep Reinforcement Learning"

Language:PythonStargazers:2Issues:0Issues:0

implicit_underparameterization

Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning (pytorch)

Language:PythonStargazers:2Issues:0Issues:0

ma_thesis

Combining Reinforcement Learning and Search for Cooperative Trajectory Planning

Language:TeXStargazers:1Issues:2Issues:0

bandit_algos

Some common algorithms for multi-armed bandit problems

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ClustPy

A Python library for advanced clustering algorithms

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

cpp_optim

Nonlinear optimization examples in C++

Language:C++Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

garage

A toolkit for reproducible reinforcement learning research.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

markov-abstractions-ablations

DM control Markov component ablations

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

outlier_detection

Class based Python implementations of outlier detection algorithms.

Language:ScilabStargazers:0Issues:0Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

License:MITStargazers:0Issues:0Issues:0

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

License:NOASSERTIONStargazers:0Issues:0Issues:0

rl_graph_breaks

An example of torch.compile graph breaks in RL code using SAC-discrete as an example

Language:PythonStargazers:0Issues:0Issues:0

udemy_cpp

C++ Course

Language:MakefileStargazers:0Issues:2Issues:0

wandb_tutorial

Code example for some basic wandb functionality

Language:PythonStargazers:0Issues:0Issues:0