Markus Semmler's repositories

burrolib

Burrolib provides a library for multi-agent Markov games for researchers. It considers Markov games from an economical perspective. The modular agent design allows different agent implementations for a single game. The user can choose whether to design an expert system or a free learning approach.

Language:PythonLicense:MITStargazers:4Issues:1Issues:0

abstract_rl

A modular python implementation of various policy gradient algorithms for use in control problems on experimental quanser robots. This repository includes implementations of Maximum A Posteriori Policy Optimization, Trust Region Policy Optimization and a draft for Soft Actor Critic.

Language:PythonLicense:MITStargazers:2Issues:2Issues:0

boed-pytorch

A simple project, which explores the variational estimators of Foster (https://arxiv.org/abs/1903.05480) in a Bayesian linear regression setting. Using nested Monte Carlo estimators the exact (convex) information gain is calculated for the regression.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

bootstrapped-dqn

An implementation of boostrapped DQN (https://arxiv.org/abs/1602.04621). It was created during my bachelor thesis at TU Darmstadt, and you can find thesis at http://www.ias.tu-darmstadt.de/uploads/Theses/Abschlussarbeiten/markus_semmler_bsc.pdf.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

fmpsdk

SDK for Financial Modeling Prep's (FMP) API

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:1Issues:0

gitignore

A collection of useful .gitignore templates

License:CC0-1.0Stargazers:1Issues:0Issues:0

pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

sticky-hdp-slds-hmm

An implementation of a hierarchical Dirichlet process (HDP) combined with a switching linear dynamical systems (SLDS) from https://arxiv.org/abs/1003.3829. It is a rather complex model and thus computationally expensive. Note the hyper parameters have to be adjusted.

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

approximate-signal-cancellation

This is a small framework to simulate algorithms in the area of signal processing. A GUI was implemented and simple transformations like direct inversion, Fourier transformation techniques over regression techniques are available.

Language:C++License:GPL-3.0Stargazers:0Issues:2Issues:0

qlearn

This repository contains scripts for executing QLearning algorithms onto different environments. It uses Tensorflow and features different discrete state/action environments. One can display different plots, like the value function or a comparison between different agents. The focus lies on the exploration efficiency.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rnn-tetherball-dynamics

Bachelor project by using recurrent neural networks to predict the 3-dimensional dynamics of a tetherball. It implements highway networks and also gated recurrent units. This implementation is highly modular, based on Tensorflow and can be adapted accordingly.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

sampleproject

A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

univariate-distributions

This repository contains several continuous and discrete univariate distributions. It uses MRG32k3a generator to create uniform samples. These uniform samples are then transformed to yield a sample from any supported distribution. A space system is used to represent the domain of the samples.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0