<--- Back to Details
First PageDocument Content
Neural networks / SARSA / Q-learning / Reinforcement learning / Temporal difference learning / Backpropagation / Markov decision process / Algorithm / Machine learning / Statistics / Computational neuroscience
Date: 2011-01-10 03:02:38
Neural networks
SARSA
Q-learning
Reinforcement learning
Temporal difference learning
Backpropagation
Markov decision process
Algorithm
Machine learning
Statistics
Computational neuroscience

Applying Reinforcement Learning to Obstacle Avoidance Josh Beitelspacher University of Oklahoma, 308 Cate Center Drive Box 5242, Norman, OK[removed]USA [removed]

Add to Reading List

Source URL: www.netbeetle.com

Download Document from Source Website

File Size: 228,39 KB

Share Document on Facebook

Similar Documents

An Introduction to Temporal Difference Learning  Florian Kunz Seminar on Autonomous Learning Systems Department of Computer Science TU Darmstadt

An Introduction to Temporal Difference Learning Florian Kunz Seminar on Autonomous Learning Systems Department of Computer Science TU Darmstadt

DocID: 1uLS3 - View Document

Two-day seminar on Stochastic Dynamic Programming and Temporal Difference Reinforcement Learning Hino Campus, Tokyo Metropolitan University (首都大学東京日野キャンパス)

Two-day seminar on Stochastic Dynamic Programming and Temporal Difference Reinforcement Learning Hino Campus, Tokyo Metropolitan University (首都大学東京日野キャンパス)

DocID: 1uuW0 - View Document

Temporal-Difference Learning to Assist Human Decision Making during the Control of an Artificial Limb Ann L. Edwards Department of Computing Science University of Alberta

Temporal-Difference Learning to Assist Human Decision Making during the Control of an Artificial Limb Ann L. Edwards Department of Computing Science University of Alberta

DocID: 1tEaY - View Document

Accelerated Gradient Temporal Difference Learning Yangchen Pan, Adam White, and Martha White { YANGPAN , ADAMW, MARTHA }@ INDIANA . EDU Department of Computer Science, Indiana University  Abstract

Accelerated Gradient Temporal Difference Learning Yangchen Pan, Adam White, and Martha White { YANGPAN , ADAMW, MARTHA }@ INDIANA . EDU Department of Computer Science, Indiana University Abstract

DocID: 1tnPW - View Document

Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation Richard S. Sutton,∗ Hamid Reza Maei,∗ Doina Precup,† Shalabh Bhatnagar,‡ David Silver,∗ Csaba Szepesv´ari,∗ E

Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation Richard S. Sutton,∗ Hamid Reza Maei,∗ Doina Precup,† Shalabh Bhatnagar,‡ David Silver,∗ Csaba Szepesv´ari,∗ E

DocID: 1sN6w - View Document