fedorajzf's repositories

temporal_abstraction

Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space.

Language:PythonStargazers:0Issues:0Issues:0

Imagination-Augmented-Agents

Building Agents with Imagination: pytorch step-by-step implementation

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

supervised-reptile

Code for the paper "On First-Order Meta-Learning Algorithms"

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

handful-of-trials

Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Language:PythonStargazers:0Issues:0Issues:0

variance_reduced_neural_networks

Implementation of SVRG and SAGA optimization algorithms for deep learning topics.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

IM_GreedyCELF

Source code for blog post at https://hautahi.com/im_greedycelf

Language:HTMLStargazers:0Issues:0Issues:0

Machine-Learning-and-Reinforcement-Learning-in-Finance

Machine Learning and Reinforcement Learning in Finance New York University Tandon School of Engineering

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

DeepSurv

DeepSurv is a deep learning approach to survival analysis.

License:MITStargazers:0Issues:0Issues:0

lola

Code release for Learning with Opponent-Learning Awareness and variations.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

quadprog

Quadratic Programming Solver

Language:CLicense:GPL-2.0Stargazers:0Issues:0Issues:0

robust

Robust optimization for power markets

Language:PythonStargazers:0Issues:0Issues:0

smop

Small Matlab to Python compiler

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Simulator

Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

coop-cut

Cooperative Cut is a Markov Random Field inference method with high-order edge potentials.

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

e2e-model-learning

Task-based end-to-end model learning in stochastic optimization

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fisher-information-matrix

PyTorch implementation of FIM and empirical FIM

Language:PythonStargazers:0Issues:0Issues:0

relax

Optimizing control variates for black-box gradient estimation

Language:PythonStargazers:0Issues:0Issues:0

OTML_DS3_2018

Practical sessions for the Optimal Transport and Machine learning course at DS3 2018

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

RocAlphaGo

An independent, student-led replication of DeepMind's 2016 Nature publication, "Mastering the game of Go with deep neural networks and tree search" (Nature 529, 484-489, 28 Jan 2016), details of which can be found on their website https://deepmind.com/publications.html.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RL-Chatbot

🤖 Deep Reinforcement Learning Chatbot

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

multimodal_varinf

Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Kullback-Leibler-divergences-and-kl-UCB-indexes

🐍 🔬 Fast Python implementation of various Kullback-Leibler divergences for 1D and 2D parametric distributions. Also provides optimized code for kl-UCB indexes

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0
Language:C++License:MITStargazers:0Issues:0Issues:0

Learn-Graph-Laplacian

Implementation of the paper Learning Laplacian Matrix in Smooth Graph Signal Representations

Language:PythonStargazers:0Issues:0Issues:0

detection-estimation-learning

Python notebooks for my graduate class on Detection, Estimation, and Learning. Intended for in-class demonstration. Notebooks illustrate a variety of concepts, from hypothesis testing to estimation to image denoising to Kalman filtering. Feel free to use or modify for your instruction or self-study.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

dirt-t

A DIRT-T Approach to Unsupervised Domain Adaptation (ICLR 2018)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DSR

Deep Successor Representation

Language:PythonStargazers:0Issues:0Issues:0

PPO-Stein-Control-Variate

Proximal Policy Optimization with Stein Control Variates:

License:MITStargazers:0Issues:0Issues:0

primal-dual-toolbox

GPU-based Total (Generalized) Variation implementation for various applications, with Python and Matlab wrappers.

Language:C++License:LGPL-3.0Stargazers:0Issues:0Issues:0