Beast code in Giters

An independent, student-led replication of DeepMind's 2016 Nature publication, "Mastering the game of Go with deep neural networks and tree search" (Nature 529, 484-489, 28 Jan 2016), details of which can be found on their website https://deepmind.com/publications.html.

Language:PythonMIT000

RL-Chatbot

🤖 Deep Reinforcement Learning Chatbot

Language:PythonMIT000

multimodal_varinf

Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".

Language:PythonMIT000

Kullback-Leibler-divergences-and-kl-UCB-indexes

🐍 🔬 Fast Python implementation of various Kullback-Leibler divergences for 1D and 2D parametric distributions. Also provides optimized code for kl-UCB indexes

Language:HTMLMIT000

Udacity-Path-Planning-Project

Language:C++MIT000

Learn-Graph-Laplacian

Implementation of the paper Learning Laplacian Matrix in Smooth Graph Signal Representations

Language:Python000

detection-estimation-learning

Python notebooks for my graduate class on Detection, Estimation, and Learning. Intended for in-class demonstration. Notebooks illustrate a variety of concepts, from hypothesis testing to estimation to image denoising to Kalman filtering. Feel free to use or modify for your instruction or self-study.

Language:Jupyter Notebook000

fedorajzf

fedorajzf's repositories

temporal_abstraction

Imagination-Augmented-Agents

supervised-reptile

handful-of-trials

variance_reduced_neural_networks

IM_GreedyCELF

Machine-Learning-and-Reinforcement-Learning-in-Finance

DeepSurv

lola

quadprog

robust

smop

Simulator

coop-cut

e2e-model-learning

fisher-information-matrix

relax

OTML_DS3_2018

RocAlphaGo

RL-Chatbot

multimodal_varinf

Kullback-Leibler-divergences-and-kl-UCB-indexes

Udacity-Path-Planning-Project

Learn-Graph-Laplacian

detection-estimation-learning

qmix

dirt-t

DSR

PPO-Stein-Control-Variate

primal-dual-toolbox