Back to Results
First PageMeta Content
Markov models / Mathematical optimization / Operations research / Reinforcement learning / Expectationmaximization algorithm / Sine / Proximal gradient method / Gradient method / Markov chain / Loss function / Artificial neural network / Equation solving


Trust Region Policy Optimization arXiv:1502.05477v4 [cs.LG] 6 Jun 2016 John Schulman JOSCHU @ EECS . BERKELEY. EDU
Add to Reading List

Document Date: 2016-06-06 20:48:19


Open Document

File Size: 1.000,39 KB

Share Result on Facebook