<--- Back to Details
First PageDocument Content
Machine learning algorithms / Money / Finance / Artificial intelligence / Language acquisition / Richard S. Sutton / Reinforcement learning / Bootstrapping / Andrew Barto / Barto
Date: 2018-02-06 02:47:52
Machine learning algorithms
Money
Finance
Artificial intelligence
Language acquisition
Richard S. Sutton
Reinforcement learning
Bootstrapping
Andrew Barto
Barto

Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017

Add to Reading List

Source URL: www.cs.ubc.ca

Download Document from Source Website

File Size: 833,84 KB

Share Document on Facebook

Similar Documents

Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017

Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017

DocID: 1xUBi - View Document

Artificial Intelligence–211  Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning Richard S. Sutton a,∗ , Doina Precup b , Satinder Singh a

Artificial Intelligence–211 Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning Richard S. Sutton a,∗ , Doina Precup b , Satinder Singh a

DocID: 1tkkb - View Document

Artificial Intelligence–211  Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning Richard S. Sutton a,∗ , Doina Precup b , Satinder Singh a

Artificial Intelligence–211 Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning Richard S. Sutton a,∗ , Doina Precup b , Satinder Singh a

DocID: 1tjGB - View Document

Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation Richard S. Sutton,∗ Hamid Reza Maei,∗ Doina Precup,† Shalabh Bhatnagar,‡ David Silver,∗ Csaba Szepesv´ari,∗ E

Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation Richard S. Sutton,∗ Hamid Reza Maei,∗ Doina Precup,† Shalabh Bhatnagar,‡ David Silver,∗ Csaba Szepesv´ari,∗ E

DocID: 1sN6w - View Document

CURRICULUM VITAE  Richard S. Sutton April 2015 Professor, Department of Computing Science, University of Alberta address: Athabasca Hall 2-21, University of Alberta, Edmonton, AB T6G 2E8

CURRICULUM VITAE Richard S. Sutton April 2015 Professor, Department of Computing Science, University of Alberta address: Athabasca Hall 2-21, University of Alberta, Edmonton, AB T6G 2E8

DocID: 1pmtN - View Document