fedorajzf's repositories
temporal_abstraction
Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space.
Imagination-Augmented-Agents
Building Agents with Imagination: pytorch step-by-step implementation
supervised-reptile
Code for the paper "On First-Order Meta-Learning Algorithms"
handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
variance_reduced_neural_networks
Implementation of SVRG and SAGA optimization algorithms for deep learning topics.
IM_GreedyCELF
Source code for blog post at https://hautahi.com/im_greedycelf
Machine-Learning-and-Reinforcement-Learning-in-Finance
Machine Learning and Reinforcement Learning in Finance New York University Tandon School of Engineering
DeepSurv
DeepSurv is a deep learning approach to survival analysis.
lola
Code release for Learning with Opponent-Learning Awareness and variations.
quadprog
Quadratic Programming Solver
robust
Robust optimization for power markets
smop
Small Matlab to Python compiler
Simulator
Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning
coop-cut
Cooperative Cut is a Markov Random Field inference method with high-order edge potentials.
e2e-model-learning
Task-based end-to-end model learning in stochastic optimization
fisher-information-matrix
PyTorch implementation of FIM and empirical FIM
relax
Optimizing control variates for black-box gradient estimation
OTML_DS3_2018
Practical sessions for the Optimal Transport and Machine learning course at DS3 2018
RocAlphaGo
An independent, student-led replication of DeepMind's 2016 Nature publication, "Mastering the game of Go with deep neural networks and tree search" (Nature 529, 484-489, 28 Jan 2016), details of which can be found on their website https://deepmind.com/publications.html.
RL-Chatbot
🤖 Deep Reinforcement Learning Chatbot
multimodal_varinf
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
Kullback-Leibler-divergences-and-kl-UCB-indexes
🐍 🔬 Fast Python implementation of various Kullback-Leibler divergences for 1D and 2D parametric distributions. Also provides optimized code for kl-UCB indexes
Learn-Graph-Laplacian
Implementation of the paper Learning Laplacian Matrix in Smooth Graph Signal Representations
detection-estimation-learning
Python notebooks for my graduate class on Detection, Estimation, and Learning. Intended for in-class demonstration. Notebooks illustrate a variety of concepts, from hypothesis testing to estimation to image denoising to Kalman filtering. Feel free to use or modify for your instruction or self-study.
dirt-t
A DIRT-T Approach to Unsupervised Domain Adaptation (ICLR 2018)
DSR
Deep Successor Representation
PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
primal-dual-toolbox
GPU-based Total (Generalized) Variation implementation for various applications, with Python and Matlab wrappers.