<--- Back to Details
First PageDocument Content
Smooth functions / Distribution / Functional analysis / Universal property
Date: 2010-01-22 02:08:08
Smooth functions
Distribution
Functional analysis
Universal property

GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces Hamid Reza Maei and Richard S. Sutton Reinforcement Learning and Artificial Intelligence Laboratory, University of

Document is deleted from original location.
Use the Download Button below to download from the Web Archive.

Download Document from Web Archive

File Size: 149,40 KB