Back to Results
First PageMeta Content



Generalized Advantage Estimation for Policy Gradients Authors Department of Electrical Engineering and Computer Science Abstract Value functions provide an elegant solution to the delayed reward problem in reinforcement
Add to Reading List

Document Date: 2015-06-22 05:16:12


Open Document

File Size: 700,96 KB

Share Result on Facebook

Event

Movie Release / /

IndustryTerm

feedforward network / large neural networks / conjugate gradient algorithm / neural network / suboptimal solution / elegant solution / policy gradient algorithm / value function networks / approximate solution / /

Movie

Interstellar / /

Organization

Policy Gradients Authors Department of Electrical Engineering and Computer Science Abstract Value / /

Person

Jonathan Baxter / Peter L Bartlett / Richard S Sutton / Charles W Anderson / Andrew G Barto / /

Position

average Fisher information matrix / Walker / /

Product

Archos TV+ Portable Video Player (PVP) / /

Technology

conjugate gradient algorithm / policy gradient algorithm / neural network / 6.1 Policy Optimization Algorithm / ADP algorithms / /

SocialTag