Back to Results
First PageMeta Content
Markov models / Operations research / Estimation theory / Expectation–maximization algorithm / Markov decision process / Optimal control / Reinforcement learning / Dynamic programming / Symbol / Statistics / Markov processes / Mathematical optimization


Reinforcement Learning of Motor Skills in High Dimensions: A Path Integral Approach
Add to Reading List

Document Date: 2011-02-17 01:14:00


Open Document

File Size: 1,05 MB

Share Result on Facebook

City

Springer / New York / Berlin / Vancouver / Cambridge / Beijing / Anchorage / /

Company

Neural Systems / Neural Information Processing Systems / ATR Computational Neuroscience Laboratories / Neural Networks / MIT Press / Proc Natl Acad Sci U S A / Intelligent Robotics Systems / Mti / Motor Control Laboratory / /

Country

United States / /

Currency

pence / USD / /

Facility

University of Southern California / American Institute of Physics Conference Series / /

IndustryTerm

open chain / stochastic systems / dimensional control systems / viscosity solutions / learning systems / Analytical solutions / individual algorithms / highdimensional continuous state-action systems / nonlinear stochastic systems / reinforcement learning systems / monte carlo em algorithm / stochastic multi-agent systems / gradient algorithms / control address systems / probability matching algorithms / gradient algorithm / autonomous learning systems / stochastic dynamical systems / actor-critic algorithms / stochastic control systems / /

Organization

National Science Foundation / MIT / American Institute of Physics Conference Series / University of Southern California / Department of Defense / /

Person

Integral Approach Evangelos Theodorou / Yaakov Engel / Mohammad Ghavamzadeh / Mete Soner / Richard S. Sutton / Stochastic Differential / Marc P. Deisenroth / Georgios Kontes / Emanuel Todorov / Nikos Vlassis / Marc Toussaint / Jonas Buchli / Carl E. Rasmussen / Andrew G. Barto / Robert F. Stengel / Christos Dimitrakakis / Jan Peters / Michail G. Lagoudakis / Savas Piperidis / /

Position

straight forward / rt / researcher / kind rt / head / general control system / immediate cost function rt / Forward / /

Product

pseudo / PI2 / /

ProvinceOrState

Alaska / British Columbia / New York / /

PublishedMedium

Machine Learning / Journal of Artificial Intelligence Research / /

Region

Southern California / /

Technology

Neuroscience / monte carlo em algorithm / individual algorithms / PI2 algorithm / probability matching algorithms / RL algorithms / Machine learning / 2400 Algorithm / resulting algorithm / gradient algorithms / interesting algorithm / PoWER algorithm / Bayesian actor-critic algorithms / gradient algorithm / /

URL

http /

SocialTag