<--- Back to Details
First PageDocument Content
Stochastic control / Bayesian statistics / Statistical theory / Markov decision process / Reinforcement learning / Bayesian inference / Machine learning / Q-learning / Statistics / Dynamic programming / Markov processes
Date: 2012-06-07 13:19:58
Stochastic control
Bayesian statistics
Statistical theory
Markov decision process
Reinforcement learning
Bayesian inference
Machine learning
Q-learning
Statistics
Dynamic programming
Markov processes

Near-Optimal BRL using Optimistic Local Transitions Mauricio Araya-L´ opez, Vincent Thomas, Olivier Buffet maraya|vthomas|[removed] LORIA, Campus scientifique, BP 239, 54506 Vandœuvre-ls-Nancy cedex, FRANCE

Add to Reading List

Source URL: icml.cc

Download Document from Source Website

File Size: 502,12 KB

Share Document on Facebook

Similar Documents

Machine Learning for Q&A sites: state of the art and research directions Xavier Amatriain Quora Inc. 650 Castro St #450 Mountain View, CA, US

Machine Learning for Q&A sites: state of the art and research directions Xavier Amatriain Quora Inc. 650 Castro St #450 Mountain View, CA, US

DocID: 1xVAM - View Document

MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Learning to Control Partial Differential Equations: Regularized Fitted Q-Iteration Approach Farahmand, A.-M.; Nabi, S.; Grover, P.; Nikovski, D.N.

MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Learning to Control Partial Differential Equations: Regularized Fitted Q-Iteration Approach Farahmand, A.-M.; Nabi, S.; Grover, P.; Nikovski, D.N.

DocID: 1vimT - View Document

Deep Q-learning from Demonstrations Matej Vecerik Todd Hester  Olivier Pietquin

Deep Q-learning from Demonstrations Matej Vecerik Todd Hester Olivier Pietquin

DocID: 1vgGk - View Document

Session 5 Relaxed dynamic programming and Q-learning Reading assignment Check the main results and examples of these papers.  • Lincoln/Rantzer, TAC 51:)

Session 5 Relaxed dynamic programming and Q-learning Reading assignment Check the main results and examples of these papers. • Lincoln/Rantzer, TAC 51:)

DocID: 1uUGg - View Document

L5: Relaxed dynamic programming and Q-learning  • Relaxed Dynamic Programming

L5: Relaxed dynamic programming and Q-learning • Relaxed Dynamic Programming

DocID: 1uO6O - View Document