RunzheStat

followers

following

stars

RunzheStat's repositories

D2OPE

Language:PythonMIT9 20

TestMDP

Implementation of "Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making”(ICML 2020) in Python

Language:PythonMIT9 2 1

CausalMARL

Language:Python5 10

aaai2021

This is the repo for the source code of the AAAI2021 paper ``Near-Optimal MNL Bandits Under Risk Criteria"

Language:Python000

bandits

Public repository for the work on bandit problems

Language:Python000

Causal-Decision-Making

Language:Jupyter Notebook000

CausalDM

A Tutorial on Causal Decision Making with an Accompanying Python Package

000

causalml

Uplift modeling and causal inference with machine learning algorithms

Language:PythonNOASSERTION000

ContextDistributions

Code accompanying the Neurips 2019 paper "Stochastic Bandits with Context Distributions"

Language:Python000

continuous-policy-learning

Language:Jupyter Notebook000

COVID

Language:R010

Deep-Bayesian-Bandits-Showdown

Models and examples built with TensorFlow

NOASSERTION000

DeepBeerInventory-RL

The code for the SRDQN algorithm to train an agent for the beer game problem

Language:PythonBSD-3-Clause000

DoubleReinforcementLearningMDP

Language:Python000

EconML

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

NOASSERTION000

ESPRM

Code for Efficient Policy Learning from Surrogate-Loss Classification Reductions paper

000

google-research

Google Research

Apache-2.0000

Movie-Recommendation-using-Cascading-Bandits

Movie Recommendation using Cascading Bandits namely CascadeLinTS and CascadeLinUCB

000

PCMC-Net

PCMC-Net: Feature-based Pairwise Choice Markov Chains

Language:PythonMIT000

PolicyLearning

Language:Jupyter NotebookGPL-3.0000

RetargetedPolicyLearning

Language:R000

rl-baselines-zoo-1

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

MIT000

StochOptForest

Replication Code for Paper "Stochastic Optimization Forests".

Language:HTML000

Synthetic-Control

MIT000