<--- Back to Details
First PageDocument Content
Markov models / Mathematical optimization / Operations research / Reinforcement learning / Expectationmaximization algorithm / Sine / Proximal gradient method / Gradient method / Markov chain / Loss function / Artificial neural network / Equation solving
Date: 2016-06-06 20:48:19
Markov models
Mathematical optimization
Operations research
Reinforcement learning
Expectationmaximization algorithm
Sine
Proximal gradient method
Gradient method
Markov chain
Loss function
Artificial neural network
Equation solving

Trust Region Policy Optimization arXiv:1502.05477v4 [cs.LG] 6 Jun 2016 John Schulman JOSCHU @ EECS . BERKELEY. EDU

Add to Reading List

Source URL: arxiv.org

Download Document from Source Website

File Size: 1.000,39 KB

Share Document on Facebook

Similar Documents

Journal of Global Optimization manuscript No. (will be inserted by the editor) Stabilizer-based symmetry breaking constraints for mathematical programs Leo Liberti · James Ostrowski

Journal of Global Optimization manuscript No. (will be inserted by the editor) Stabilizer-based symmetry breaking constraints for mathematical programs Leo Liberti · James Ostrowski

DocID: 1v0h9 - View Document

OPTIMA 88 Mathematical Optimization Society Newsletter Philippe L. Toint  MOS Chair’s Column

OPTIMA 88 Mathematical Optimization Society Newsletter Philippe L. Toint MOS Chair’s Column

DocID: 1uTp0 - View Document

The Annals of Probability 2004, Vol. 32, No. 1B, 1030–1067 © Institute of Mathematical Statistics, 2004 A STOCHASTIC REPRESENTATION THEOREM WITH APPLICATIONS TO OPTIMIZATION AND OBSTACLE PROBLEMS

The Annals of Probability 2004, Vol. 32, No. 1B, 1030–1067 © Institute of Mathematical Statistics, 2004 A STOCHASTIC REPRESENTATION THEOREM WITH APPLICATIONS TO OPTIMIZATION AND OBSTACLE PROBLEMS

DocID: 1sOP9 - View Document

Optimization of Electrical Production The production of electricity in France is optimized everyday with the help of a mathematical software developed at Inria, in collaboration with EDF R&D. Substantial performance is a

Optimization of Electrical Production The production of electricity in France is optimized everyday with the help of a mathematical software developed at Inria, in collaboration with EDF R&D. Substantial performance is a

DocID: 1rxID - View Document

Timed-Elastic-Bands for Time-Optimal Point-To-Point Nonlinear Model Predictive Control

Timed-Elastic-Bands for Time-Optimal Point-To-Point Nonlinear Model Predictive Control

DocID: 1ru4i - View Document