RunzheStat's repositories

Language:PythonLicense:MITStargazers:9Issues:2Issues:0

TestMDP

Implementation of "Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making”(ICML 2020) in Python

Language:PythonLicense:MITStargazers:9Issues:2Issues:1
Language:PythonStargazers:5Issues:1Issues:0

aaai2021

This is the repo for the source code of the AAAI2021 paper ``Near-Optimal MNL Bandits Under Risk Criteria"

Language:PythonStargazers:0Issues:0Issues:0

bandits

Public repository for the work on bandit problems

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

CausalDM

A Tutorial on Causal Decision Making with an Accompanying Python Package

Stargazers:0Issues:0Issues:0

causalml

Uplift modeling and causal inference with machine learning algorithms

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ContextDistributions

Code accompanying the Neurips 2019 paper "Stochastic Bandits with Context Distributions"

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:RStargazers:0Issues:1Issues:0

Deep-Bayesian-Bandits-Showdown

Models and examples built with TensorFlow

License:NOASSERTIONStargazers:0Issues:0Issues:0

DeepBeerInventory-RL

The code for the SRDQN algorithm to train an agent for the beer game problem

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

EconML

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

License:NOASSERTIONStargazers:0Issues:0Issues:0

ESPRM

Code for Efficient Policy Learning from Surrogate-Loss Classification Reductions paper

Stargazers:0Issues:0Issues:0

google-research

Google Research

License:Apache-2.0Stargazers:0Issues:0Issues:0

Movie-Recommendation-using-Cascading-Bandits

Movie Recommendation using Cascading Bandits namely CascadeLinTS and CascadeLinUCB

Stargazers:0Issues:0Issues:0

PCMC-Net

PCMC-Net: Feature-based Pairwise Choice Markov Chains

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:RStargazers:0Issues:0Issues:0

rl-baselines-zoo-1

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

License:MITStargazers:0Issues:0Issues:0

StochOptForest

Replication Code for Paper "Stochastic Optimization Forests".

Language:HTMLStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0