| Document Date: 2009-05-25 10:34:02 Open Document File Size: 1,46 MBShare Result on Facebook
Company LSTM Recurrent Neural Networks / Recurrent Neural Networks / / Country Germany / / / Facility Switzerland Planck Institute / University of Lugano / / IndustryTerm history-dependent baseline network / conventional neural networks / reinforcement learning algorithm / neural network / extended baseline network / car driving simulation / neural networks / policy gradient algorithms / large product / baseline network / / Organization Institut f¨ / Switzerland Planck Institute for Biological Cybernetics / Faculty of Informatics / University of Lugano / / Person Jan Peters / / Position return Rt / actor / rt / ot and reward rt / controller / Corresponding author / / Product RPG / / Technology policy gradient algorithms / neural network / SRV algorithm / machine learning / simulation / RL algorithm / reinforcement learning algorithm / PG algorithms / Recurrent Policy Gradient algorithm / /
SocialTag |