Near-Optimal BRL using Optimistic Local Transitions Mauricio Araya-L´ opez, Vincent Thomas, Olivier Buffet maraya|vthomas|[removed] LORIA, Campus scientifique, BP 239, 54506 Vandœuvre-ls-Nancy cedex, FRANCE - Reinforcement theory - Document - PDFSEARCH.IO - Document Search Engine

Back to Results

First Page	Meta Content
	Near-Optimal BRL using Optimistic Local Transitions Mauricio Araya-L´ opez, Vincent Thomas, Olivier Buffet maraya\|vthomas\|[removed] LORIA, Campus scientifique, BP 239, 54506 Vandœuvre-ls-Nancy cedex, FRANCE Add to Reading List Document Date: 2012-06-07 13:19:58 Open Document File Size: 502,12 KB Share Result on Facebook City R. Variance / Edinburgh / / Company BP / MIT Press / K. As R / / Country United Kingdom / Scotland / / / Event Man-Made Disaster / / Facility University of Massachusetts Amherst / / IndustryTerm baseline algorithm / deterministic heuristic algorithm / belief-lookahead algorithm / good solutions / polynomial time algorithm / heuristic algorithms / exploit-like algorithm / benchmark algorithms / analytic solution / approximate solutions / learning algorithms / / Organization MIT / University of Massachusetts Amherst / U.S. Securities and Exchange Commission / / Person Thomas / V / Vincent Thomas / / Position author / model at the current time step / representative / / ProvinceOrState Alaska / Rhode Island / / PublishedMedium Machine Learning / / Technology baseline algorithm / PAC-MDP algorithms / PAC Algorithms / Optimistic BRL Algorithms / polynomial time algorithm / deterministic heuristic algorithm / good RL algorithm / benchmark algorithms / RL algorithms / Machine Learning / BRL algorithms / FDM / Bayesian RL algorithm / belief-lookahead algorithm / classic RL algorithms / Typical RL algorithms / pdf / exploit-like algorithm / / SocialTag Stochastic control Bayesian statistics Statistical theory Markov decision process Reinforcement learning Bayesian inference Machine learning Q-learning Statistics Dynamic programming