Back to Results
First PageMeta Content
Mathematics / Mathematical optimization / Dynamic programming / Mathematical analysis / Equations / Operations research / Systems theory / Stochastic control / Bellman equation / Markov decision process / Q-learning / Reinforcement learning


Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare and Georg Ostrovski and Arthur Guez Philip S. Thomas∗ and R´emi Munos Google DeepMind {bellemare,ostrovski,aguez,munos}@google.com;
Add to Reading List

Document Date: 2015-12-12 00:05:18


Open Document

File Size: 694,77 KB

Share Result on Facebook