<--- Back to Details
First PageDocument Content
Convex optimization / Mathematical optimization / Number theory / Topological groups / Markov decision process / Linear programming / Reinforcement learning / Representation theory / Μ operator / Mathematics / Algebra / Operations research
Date: 2007-10-21 19:53:48
Convex optimization
Mathematical optimization
Number theory
Topological groups
Markov decision process
Linear programming
Reinforcement learning
Representation theory
Μ operator
Mathematics
Algebra
Operations research

Stable Dual Dynamic Programming Tao Wang Daniel Lizotte Michael Bowling Dale Schuurmans Department of Computing Science

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Download Document from Source Website

File Size: 139,09 KB

Share Document on Facebook

Similar Documents

Cooperative Multi-Agent Control Using Deep Reinforcement Learning Jayesh K. Gupta Maxim Egorov

Cooperative Multi-Agent Control Using Deep Reinforcement Learning Jayesh K. Gupta Maxim Egorov

DocID: 1xVVh - View Document

Distributed Computing Prof. R. Wattenhofer SA/MA:  Byzantine Reinforcement Learning

Distributed Computing Prof. R. Wattenhofer SA/MA: Byzantine Reinforcement Learning

DocID: 1xVKs - View Document

Distributed Computing Prof. R. Wattenhofer Generating CAPTCHAs with Deep (Reinforcement) Learning

Distributed Computing Prof. R. Wattenhofer Generating CAPTCHAs with Deep (Reinforcement) Learning

DocID: 1xV3l - View Document

Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017

Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017

DocID: 1xUBi - View Document

Cellular	Network	Traffic	Scheduling	 using	Deep	Reinforcement	Learning Sandeep	Chinchali,	et.	al.	Marco	Pavone,	Sachin	Katti Stanford	University	 AAAI	2018

Cellular Network Traffic Scheduling using Deep Reinforcement Learning Sandeep Chinchali, et. al. Marco Pavone, Sachin Katti Stanford University AAAI 2018

DocID: 1xUAT - View Document