Coordinated Reinforcement Learning Carlos Guestrin Computer Science Department, Stanford University, Stanford, CA[removed]Michail Lagoudakis Ronald Parr Department of Computer Science, Duke University, Durham, NC 27708 - Reinforcement theory - Document - PDFSEARCH.IO - Document Search Engine

Back to Results

First Page	Meta Content
	Coordinated Reinforcement Learning Carlos Guestrin Computer Science Department, Stanford University, Stanford, CA[removed]Michail Lagoudakis Ronald Parr Department of Computer Science, Duke University, Durham, NC 27708 Add to Reading List Document Date: 2012-05-29 14:42:02 Open Document File Size: 339,80 KB Share Result on Facebook City Belmont / New York / / Company Siebel / / Country Jordan / / Event Product Issues / / Facility Duke University / Stanford University / / IndustryTerm statistical gradient-following algorithms / local utilities / direct policy search methods / cost network / reinforcement learning algorithm / neural network / variable elimination algorithm / policy search algorithms / policy search algorithm / present several new algorithms / policy search phase / policy search / communication protocol / Linear least-squares algorithms / je computing / learning-based multi-agent systems / policy search method / direct policy search / Actor-critic algorithms / policy search methods / learning algorithm / learning algorithms / / Organization Lilian Boudouri Foundation / Coordinated Reinforcement Learning Carlos Guestrin Computer Science Department / Stanford University / N00014-00-1-0637 / and Air Force / Michail Lagoudakis Ronald Parr Department of Computer Science / REIN FORCE / Department of Defense / U.S. Securities and Exchange Commission / Duke University / Durham / / Person Ai / Carlos Guestrin / DVF N O C OMM / / Position sales manager / manager in a warehouse / section manager / / Product B 52 / machines / dead machines / / ProvinceOrState North Carolina / New York / California / Massachusetts / / SportsLeague Stanford University / / Technology multiagent LSPI algorithm / learning algorithm / Linear least-squares algorithms / variable elimination algorithm / neural network / policy search algorithms / communication protocol / planning algorithm / Actor-critic algorithms / learning algorithms / Dom / statistical gradient-following algorithms / reinforcement learning algorithm / LSTD algorithm / policy search algorithm / / SocialTag Dynamic programming Markov processes Stochastic control Operations research Functions and mappings Reinforcement learning Q-learning Markov decision process Automated planning and scheduling Statistics