nmeripo / deeplearning-papernotes

Summaries and notes on Deep Learning research papers

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

2017-08

  • Deep & Cross Network for Ad Click Predictions [arXiv]
  • Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms [arXiv] [code]
  • Multi-task Self-Supervised Visual Learning [arXiv]
  • Twin Networks: Using the Future as a Regularizer [arXiv]
  • A Brief Survey of Deep Reinforcement Learning [arXiv]
  • Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation [arXiv] [code]
  • On the Effectiveness of Visible Watermarks [CVPR] [article]
  • Practical Network Blocks Design with Q-Learning [arXiv]
  • On Ensuring that Intelligent Machines Are Well-Behaved [arXiv]
  • Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control [arXiv] [code]
  • Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification [arXiv] [article]
  • Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning [arXiv]
  • Neural Expectation Maximization [arXiv] [code]
  • Google Vizier: A Service for Black-Box Optimization [Research at Google]
  • STARDATA: A StarCraft AI Research Dataset [arXiv] [code]
  • Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm [arXiv] [code] (WIP) [article]
  • Natural Language Processing with Small Feed-Forward Networks [arXiv]

2017-07

  • Photographic Image Synthesis with Cascaded Refinement Networks [arXiv] [code]
  • StarCraft II: A New Challenge for Reinforcement Learning [DeepMind Documents] [code] [article]
  • Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards [arXiv]
  • DARLA: Improving Zero-Shot Transfer in Reinforcement Learning [arXiv]
  • Voice Synthesis for in-the-Wild Speakers via a Phonological Loop [arXiv] [code] (the code will be available as soon as the authors will init the repo) [article]
  • Eyemotion: Classifying facial expressions in VR using eye-tracking cameras [arXiv] [article]
  • A Distributional Perspective on Reinforcement Learning [arXiv] [article]
  • On the State of the Art of Evaluation in Neural Language Models [arXiv]
  • Optimizing the Latent Space of Generative Networks [arXiv]
  • Neuroscience-Inspired Artificial Intelligence [Neuron] [article]
  • Learning Transferable Architectures for Scalable Image Recognition [arXiv]
  • Reverse Curriculum Generation for Reinforcement Learning [arXiv]
  • Imagination-Augmented Agents for Deep Reinforcement Learning [arXiv] [article]
  • Learning model-based planning from scratch [arXiv] [article]
  • Proximal Policy Optimization Algorithms [AWSS3] [code]
  • Automatic Recognition of Deceptive Facial Expressions of Emotion [arXiv]
  • Distral: Robust Multitask Reinforcement Learning [arXiv]
  • Creatism: A deep-learning photographer capable of creating professional work [arXiv] [article]
  • SCAN: Learning Abstract Hierarchical Compositional Visual Concepts [arXiv] [article]
  • Revisiting Unreasonable Effectiveness of Data in Deep Learning Era [arXiv] [article]
  • The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously [arXiv]
  • Deep Bilateral Learning for Real-Time Image Enhancement [arXiv] [code] [article]
  • Emergence of Locomotion Behaviours in Rich Environments [arXiv] [article]
  • Learning human behaviors from motion capture by adversarial imitation [arXiv] [article]
  • Robust Imitation of Diverse Behaviors [arXiv] [article]
  • Hindsight Experience Replay [arXiv]
  • Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks [arXiv] [article]
  • End-to-End Learning of Semantic Grasping [arXiv]
  • ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games [arXiv] [code] [article]

2017-06

  • Noisy Networks for Exploration [arXiv]
  • Do GANs actually learn the distribution? An empirical study [arXiv]
  • Gradient Episodic Memory for Continuum Learning [arXiv]
  • Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog [arXiv]
  • Deep Interest Network for Click-Through Rate Prediction [arXiv]
  • Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study [arXiv] [article]
  • Structure Learning in Motor Control: A Deep Reinforcement Learning Model [arXiv]
  • Programmable Agents [arXiv]
  • Grounded Language Learning in a Simulated 3D World [arXiv]
  • Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics [arXiv]
  • One Model To Learn Them All [arXiv] [code] [article]
  • Hybrid Reward Architecture for Reinforcement Learning [arXiv]
  • Variational Approaches for Auto-Encoding Generative Adversarial Networks [arXiv]
  • Deal or No Deal? End-to-End Learning for Negotiation Dialogues [S3AWS] [code] [article]
  • Attention Is All You Need [arXiv] [code] [article]
  • Sobolev Training for Neural Networks [arXiv]
  • YellowFin and the Art of Momentum Tuning [arXiv] [code] [article]
  • Forward Thinking: Building and Training Neural Networks One Layer at a Time [arXiv]
  • Depthwise Separable Convolutions for Neural Machine Translation [arXiv] [code]
  • Parameter Space Noise for Exploration [arXiv] [code] [article]
  • Deep Reinforcement Learning from human preferences [arXiv] [article]
  • Self-Normalizing Neural Networks [arXiv]
  • Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour [arXiv]
  • A simple neural network module for relational reasoning [arXiv] [article]
  • Visual Interaction Networks [arXiv] [article]

2017-05

  • The Cramer Distance as a Solution to Biased Wasserstein Gradients [arXiv]
  • Reinforcement Learning with a Corrupted Reward Channel [arXiv]
  • Gradient Descent Can Take Exponential Time to Escape Saddle Points [arXiv] [article]
  • ParlAI: A Dialog Research Software Platform [arXiv] [code] [article]
  • Look, Listen and Learn [arXiv]
  • Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [arXiv]
  • Convolutional Sequence to Sequence Learning [arXiv] [code] [article]
  • The Kinetics Human Action Video Dataset [arXiv] [article]
  • Safe and Nested Subgame Solving for Imperfect-Information Games [arXiv]
  • Discrete Sequential Prediction of Continuous Actions for Deep RL [arXiv]
  • Metacontrol for Adaptive Imagination-Based Optimization [arXiv]
  • Efficient Parallel Methods for Deep Reinforcement Learning [arXiv]
  • Real-Time Adaptive Image Compression [arXiv]

2017-04

  • General Video Game AI: Learning from Screen Capture [arXiv]
  • Learning to Skim Text [arXiv]
  • Get To The Point: Summarization with Pointer-Generator Networks [arXiv] [code] [article]
  • Adversarial Neural Machine Translation [arXiv]
  • Learning from Demonstrations for Real World Reinforcement Learning [arXiv]
  • A Neural Representation of Sketch Drawings [arXiv] [code] [article]
  • Automated Curriculum Learning for Neural Networks [arXiv]
  • Hierarchical Surface Prediction for 3D Object Reconstruction [arXiv] [article]
  • Neural Message Passing for Quantum Chemistry [arXiv]
  • Learning to Generate Reviews and Discovering Sentiment [arXiv]
  • Best Practices for Applying Deep Learning to Novel Applications [arXiv]

2017-03

  • Improved Training of Wasserstein GANs [arXiv]
  • Evolution Strategies as a Scalable Alternative to Reinforcement Learning [arXiv]
  • Controllable Text Generation [arXiv]
  • Neural Episodic Control [arXiv]
  • A Structured Self-attentive Sentence Embedding [arXiv]
  • Multi-step Reinforcement Learning: A Unifying Algorithm [arXiv]
  • Deep learning with convolutional neural networks for brain mapping and decoding of movement-related information from the human EEG [arXiv]
  • Massive Exploration of Neural Machine Translation Architectures [arXiv] [code]
  • Minimax Regret Bounds for Reinforcement Learning [arXiv]
  • Sharp Minima Can Generalize For Deep Nets [arXiv]
  • Parallel Multiscale Autoregressive Density Estimation [arXiv]
  • Neural Machine Translation and Sequence-to-sequence Models: A Tutorial [arXiv]
  • Large-Scale Evolution of Image Classifiers [arXiv]
  • FeUdal Networks for Hierarchical Reinforcement Learning [arXiv]
  • Evolving Deep Neural Networks [arXiv]
  • How to Escape Saddle Points Efficiently [arXiv] [article]
  • Understanding Synthetic Gradients and Decoupled Neural Interfaces [arXiv]

2017-02

  • The Shattered Gradients Problem: If resnets are the answer, then what is the question? [arXiv]
  • Neural Map: Structured Memory for Deep Reinforcement Learning [arXiv]
  • Bridging the Gap Between Value and Policy Based Reinforcement Learning [arXiv]
  • Deep Voice: Real-time Neural Text-to-Speech [arXiv]
  • Beating the World's Best at Super Smash Bros. with Deep Reinforcement Learning [arXiv]
  • The Game Imitation: Deep Supervised Convolutional Networks for Quick Video Game AI [arXiv]
  • Learning to Parse and Translate Improves Neural Machine Translation [arXiv]
  • All-but-the-Top: Simple and Effective Postprocessing for Word Representations [arXiv]
  • Deep Learning with Dynamic Computation Graphs [arXiv]
  • Skip Connections as Effective Symmetry-Breaking [arXiv]
  • odelSemi-Supervised QA with Generative Domain-Adaptive Nets [arXiv]

2017-01

  • Wasserstein GAN [arXiv]
  • Deep Reinforcement Learning: An Overview [arXiv]
  • DyNet: The Dynamic Neural Network Toolkit [arXiv]
  • DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker [arXiv]
  • NIPS 2016 Tutorial: Generative Adversarial Networks [arXiv]

2016-12

  • A recurrent neural network without Chaos [arXiv]
  • Language Modeling with Gated Convolutional Networks [arXiv]
  • Learning from Simulated and Unsupervised Images through Adversarial Training [arXiv]
  • How Grammatical is Character-level Neural Machine Translation? Assessing MT Quality with Contrastive Translation Pairs [arXiv]
  • Improving Neural Language Models with a Continuous Cache [arXiv]
  • DeepMind Lab [arXiv] [code]
  • Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning [arXiv]
  • Overcoming catastrophic forgetting in neural networks [arXiv]

2016-11 (ICLR Edition)

Reinforcement Learning:

-Learning to reinforcement learn [arXiv]

Machine Translation & Dialog

2016-10

2016-09

  • Towards Deep Symbolic Reinforcement Learning [arXiv]
  • HyperNetworks [arXiv]
  • Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation [arXiv]
  • Safe and Efficient Off-Policy Reinforcement Learning [arXiv]
  • Playing FPS Games with Deep Reinforcement Learning [arXiv]
  • SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient [arXiv]
  • Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks [arXiv]
  • Energy-based Generative Adversarial Network [arXiv]
  • Stealing Machine Learning Models via Prediction APIs [arXiv]
  • Semi-Supervised Classification with Graph Convolutional Networks [arXiv]
  • WaveNet: A Generative Model For Raw Audio [arXiv]
  • Hierarchical Multiscale Recurrent Neural Networks [arXiv]
  • End-to-End Reinforcement Learning of Dialogue Agents for Information Access [arXiv]
  • Deep Neural Networks for YouTube Recommendations [paper]

2016-08

  • Semantics derived automatically from language corpora contain human-like biases [arXiv]
  • Why does deep and cheap learning work so well? [arXiv]
  • Machine Comprehension Using Match-LSTM and Answer Pointer [arXiv]
  • Stacked Approximated Regression Machine: A Simple Deep Learning Approach [arXiv]
  • Decoupled Neural Interfaces using Synthetic Gradients [arXiv]
  • WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia [arXiv]
  • Temporal Attention Model for Neural Machine Translation [arXiv]
  • Residual Networks of Residual Networks: Multilevel Residual Networks [arXiv]
  • Learning Online Alignments with Continuous Rewards Policy Gradient [arXiv]

2016-07

2016-06

  • Sequence-to-Sequence Learning as Beam-Search Optimization [arXiv]
  • Sequence-Level Knowledge Distillation [arXiv]
  • Policy Networks with Two-Stage Training for Dialogue Systems [arXiv]
  • Towards an integration of deep learning and neuroscience [arXiv]
  • On Multiplicative Integration with Recurrent Neural Networks [arxiv]
  • Wide & Deep Learning for Recommender Systems [arXiv]
  • Online and Offline Handwritten Chinese Character Recognition [arXiv]
  • Tutorial on Variational Autoencoders [arXiv]
  • Concrete Problems in AI Safety [arXiv]
  • Deep Reinforcement Learning Discovers Internal Models [arXiv]
  • SQuAD: 100,000+ Questions for Machine Comprehension of Text [arXiv]
  • Conditional Image Generation with PixelCNN Decoders [arXiv]
  • Model-Free Episodic Control [arXiv]
  • Progressive Neural Networks [arXiv]
  • Improved Techniques for Training GANs [arXiv] [code]
  • Memory-Efficient Backpropagation Through Time [arXiv]
  • InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets [arXiv]
  • Zero-Resource Translation with Multi-Lingual Neural Machine Translation [arXiv]
  • Key-Value Memory Networks for Directly Reading Documents [arXiv]
  • Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translatin [arXiv]
  • Learning to learn by gradient descent by gradient descent [arXiv]
  • Learning Language Games through Interaction [arXiv]
  • Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations [arXiv]
  • Smart Reply: Automated Response Suggestion for Email [arXiv]
  • Virtual Adversarial Training for Semi-Supervised Text Classification [arXiv]
  • Deep Reinforcement Learning for Dialogue Generation [arXiv]
  • Very Deep Convolutional Networks for Natural Language Processing [arXiv]
  • Neural Net Models for Open-Domain Discourse Coherence [arXiv]
  • Neural Architectures for Fine-grained Entity Type Classification [arXiv]
  • Matching Networks for One Shot Learning [arXiv]
  • Cooperative Inverse Reinforcement Learning [arXiv] [article]
  • Gated-Attention Readers for Text Comprehension [arXiv]
  • End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning [arXiv]
  • Iterative Alternating Neural Attention for Machine Reading [arXiv]
  • Memory-enhanced Decoder for Neural Machine Translation [arXiv]
  • Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation [arXiv]
  • Natural Language Comprehension with the EpiReader [arXiv]
  • Conversational Contextual Cues: The Case of Personalization and History for Response Ranking [arXiv]
  • Adversarially Learned Inference [arXiv]
  • OpenAI Gym [arXiv] [code]
  • Neural Network Translation Models for Grammatical Error Correction [arXiv]

2016-05

  • Hierarchical Memory Networks [arXiv]
  • Deep API Learning [arXiv]
  • Wide Residual Networks [arXiv]
  • TensorFlow: A system for large-scale machine learning [arXiv]
  • Learning Natural Language Inference using Bidirectional LSTM model and Inner-Attention [arXiv]
  • Aspect Level Sentiment Classification with Deep Memory Network [arXiv]
  • FractalNet: Ultra-Deep Neural Networks without Residuals [arXiv]
  • Learning End-to-End Goal-Oriented Dialog [arXiv]
  • One-shot Learning with Memory-Augmented Neural Networks [arXiv]
  • Deep Learning without Poor Local Minima [arXiv]
  • AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge [arXiv]
  • Data Programming: Creating Large Training Sets, Quickly [arXiv]
  • Deeply-Fused Nets [arXiv]
  • Deep Portfolio Theory [arXiv]
  • Unsupervised Learning for Physical Interaction through Video Prediction [arXiv]
  • Movie Description [arXiv]

2016-04

2016-03

2016-02

2016-01

2015-12

NLP

Vision

2015-11

NLP

Programs

  • Neural Random-Access Machines [arxiv]
  • Neural Programmer: Inducing Latent Programs with Gradient Descent [arXiv]
  • Neural Programmer-Interpreters [arXiv]
  • Learning Simple Algorithms from Examples [arXiv]
  • Neural GPUs Learn Algorithms [arXiv] [code]
  • On Learning to Think: Algorithmic Information Theory for Novel Combinations of Reinforcement Learning Controllers and Recurrent Neural World Models [arXiv]

Vision

  • ReSeg: A Recurrent Neural Network for Object Segmentation [arXiv]
  • Deconstructing the Ladder Network Architecture [arXiv]
  • Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks [arXiv]

General

2015-10

2015-09

2015-08

2015-07

2015-06

2015-05

2015-04

  • Correlational Neural Networks [arXiv]

2015-03

2015-02

2015-01

  • Hidden Technical Debt in Machine Learning Systems [NIPS]

2014-12

2014-11

  • The Loss Surfaces of Multilayer Networks [arXiv]

2014-10

2014-09

2014-08

  • Convolutional Neural Networks for Sentence Classification [arxiv]

2014-07

2014-06

2014-05

2014-04

  • A Convolutional Neural Network for Modelling Sentences [arXiv]

2014-03

2014-02

2014-01

2013

  • Visualizing and Understanding Convolutional Networks [arXiv]
  • DeViSE: A Deep Visual-Semantic Embedding Model [pub]
  • Maxout Networks [arXiv]
  • Exploiting Similarities among Languages for Machine Translation [arXiv]
  • Efficient Estimation of Word Representations in Vector Space [arXiv]

2011

  • Natural Language Processing (almost) from Scratch [arXiv]

About

Summaries and notes on Deep Learning research papers