<--- Back to Details
First PageDocument Content
Markov processes / Probability theory / Probability / Dynamic programming / Markov decision process / Stochastic control / Reinforcement learning / Memorylessness
Date: 2015-02-02 16:39:01
Markov processes
Probability theory
Probability
Dynamic programming
Markov decision process
Stochastic control
Reinforcement learning
Memorylessness

MultiGain: A controller synthesis tool for MDPs with multiple mean-payoff objectives Tom´ aˇs Br´ azdil1, Krishnendu Chatterjee2 , Vojtˇech Forejt3 , and Anton´ın Kuˇcera1 1

Add to Reading List

Source URL: qav.comlab.ox.ac.uk

Download Document from Source Website

File Size: 370,21 KB

Share Document on Facebook

Similar Documents

Cooperative Multi-Agent Control Using Deep Reinforcement Learning Jayesh K. Gupta Maxim Egorov

Cooperative Multi-Agent Control Using Deep Reinforcement Learning Jayesh K. Gupta Maxim Egorov

DocID: 1xVVh - View Document

Distributed Computing Prof. R. Wattenhofer SA/MA:  Byzantine Reinforcement Learning

Distributed Computing Prof. R. Wattenhofer SA/MA: Byzantine Reinforcement Learning

DocID: 1xVKs - View Document

Distributed Computing Prof. R. Wattenhofer Generating CAPTCHAs with Deep (Reinforcement) Learning

Distributed Computing Prof. R. Wattenhofer Generating CAPTCHAs with Deep (Reinforcement) Learning

DocID: 1xV3l - View Document

Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017

Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017

DocID: 1xUBi - View Document

Cellular	Network	Traffic	Scheduling	 using	Deep	Reinforcement	Learning Sandeep	Chinchali,	et.	al.	Marco	Pavone,	Sachin	Katti Stanford	University	 AAAI	2018

Cellular Network Traffic Scheduling using Deep Reinforcement Learning Sandeep Chinchali, et. al. Marco Pavone, Sachin Katti Stanford University AAAI 2018

DocID: 1xUAT - View Document