Back to Results
First PageMeta Content
Markov processes / Mathematics / Probability theory / Mathematical analysis / Dynamic programming / Markov decision process / Stochastic control / Markov chain / Mathematical optimization / Distribution


Stat 260/CSLearning in Sequential Decision Problems. Peter Bartlett 1. Recall: MDPs. 2. Value iteration. 3. Policy iteration.
Add to Reading List

Document Date: 2014-11-25 12:45:38


Open Document

File Size: 43,09 KB

Share Result on Facebook