First Page | Document Content | |
---|---|---|
Date: 2015-02-02 16:39:01Markov processes Probability theory Probability Dynamic programming Markov decision process Stochastic control Reinforcement learning Memorylessness | MultiGain: A controller synthesis tool for MDPs with multiple mean-payoff objectives Tom´ aˇs Br´ azdil1, Krishnendu Chatterjee2 , Vojtˇech Forejt3 , and Anton´ın Kuˇcera1 1Add to Reading ListSource URL: qav.comlab.ox.ac.ukDownload Document from Source WebsiteFile Size: 370,21 KBShare Document on Facebook |
Cooperative Multi-Agent Control Using Deep Reinforcement Learning Jayesh K. Gupta Maxim EgorovDocID: 1xVVh - View Document | |
Distributed Computing Prof. R. Wattenhofer SA/MA: Byzantine Reinforcement LearningDocID: 1xVKs - View Document | |
Distributed Computing Prof. R. Wattenhofer Generating CAPTCHAs with Deep (Reinforcement) LearningDocID: 1xV3l - View Document | |
Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017DocID: 1xUBi - View Document | |
Cellular Network Traffic Scheduling using Deep Reinforcement Learning Sandeep Chinchali, et. al. Marco Pavone, Sachin Katti Stanford University AAAI 2018DocID: 1xUAT - View Document |