Marco Birck's repositories
CatastrophicForgetting-EWC
#WORK IN PROGRESS PyTorch Implementation of Supervised and Deep Q-Learning EWC(Elastic Weight Consolidation), introduced in "Overcoming Catastrophic Forgetting in Neural Networks"
adaptative-dropout-pytorch
Pytorch implementation of Adaptative Dropout a.ka Standout.
AttentionTRL
Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind
Maml_Reptile_PyTorch
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks" in PyTorch
Atari-WGAN
Implementation of WGAN to generation of Atari Games Images. (GAN, WGAN, ATARI, Generative)
Geo_Classifier_siamese_NN
Geolocation generic classifier using siamese neural networks using TensorFlow and Keras.
joint_ppo_pytorch
Extension of https://github.com/ikostrikov/pytorch-a2c-ppo-acktr, making it feasible to run train on multiple games simultaneously.
and-nd-firebase
Course code repository for Firebase in a Weekend by Google: Android
AndroidUdacity
Udacity course, implementations and code.
Berkeley_CS188
Repositorie containing solutions for the Berkeley CS188 class.
CS20SI_Tensorflow4DL_Research
Code and stuff from Stanford course on Tensorflow
CS294-DeepRL
My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.
DDPG-Keras-Torcs
Using Keras and Deep Deterministic Policy Gradient to play TORCS
Deep_RL_Bootcamp
Solutions for the labs in Deep RL Bootcamp.
deeplearning_tutorials
Plenty of Deep Learning resources in companion with notebooks, for learning purposes.
Generic_Seq2Seq
I replicate and make the original Seq2Seq from PyTorch tutorials to be easy to use and adapt.
joint_tf_ppo
Extension of https://github.com/openai/baselines, making it feasible to run train on multiple games simultaneously.
modular_DeepRL
Attempt to implement A2C and PPO algorithm with modular properties of Maxout and LWTA. # UNFINISHED AND FAILED
our-daily-paper
Paper List I have read or will read, just to keep control. (I should have done this before!!!)
pytorch-a2c-ppo-acktr
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
pytorch-ewc
PyTorch implementation of DeepMind's PNAS 2017 paper "Overcoming Catastrophic Forgetting"
pytorch-lr-scheduler
Bring some LR schedulers from Keras to PyTorch.
Video_GAN_Sonic
[UNDERDEVELOPED, CHECK THE LINK BELOW] This was an early attempt to Generate a Sonic frame from past frames using GANs. I will open this cause there is plenty of useful infra code concerning steps to make it happen, besides no convergence is achieved in this repo!
rl_a3c_pytorch
Reinforcement learning A3C LSTM Atari with Pytorch