Back to Results
First PageMeta Content
Dynamic programming / Markov processes / Stochastic control / Operations research / Mathematical optimization / Markov decision process / Q-learning / Reinforcement learning / Convex hull / Mathematics / Algebra / Statistics


Learning All Optimal Policies with Multiple Criteria Leon Barrett Srini Narayanan 1947 Center St. Ste. 600, Berkeley, CA 94704
Add to Reading List

Document Date: 2008-05-22 03:19:26


Open Document

File Size: 195,70 KB

Share Result on Facebook

City

Washington / DC / Chester / Bonn / Helsinki / /

Company

Princeton University Press / Kaelbling L. P. / Cambridge University Press / MIT Press / Russell / /

Country

Germany / Guinea / Finland / /

/

IndustryTerm

iteration algorithm / dot product / temporal difference learning algorithm / maximum-hyperplane algorithm / off-policy learning algorithms / absolute utilities / value iteration algorithm / basic algorithm / learning algorithm / approximation algorithms / food / /

Organization

Cambridge University / MIT / Princeton University / /

Position

author / /

ProvinceOrState

California / Massachusetts / /

PublishedMedium

Machine Learning / Journal of Machine Learning Research / /

Technology

Convex Hull Value Iteration Algorithm / learning algorithm / main algorithm / artificial intelligence / temporal difference learning algorithm / 1 Value iteration algorithm / value iteration algorithm / off-policy learning algorithms / RL algorithms / approximation algorithms / Machine Learning / maximum-hyperplane algorithm / Complexity This algorithm / basic algorithm / /

SocialTag