Back to Results
First PageMeta Content
Dynamic programming / Markov processes / Stochastic control / Operations research / Functions and mappings / Reinforcement learning / Q-learning / Markov decision process / Automated planning and scheduling / Statistics / Control theory / Mathematics


Coordinated Reinforcement Learning Carlos Guestrin Computer Science Department, Stanford University, Stanford, CA[removed]Michail Lagoudakis Ronald Parr Department of Computer Science, Duke University, Durham, NC 27708
Add to Reading List

Document Date: 2012-05-29 14:42:02


Open Document

File Size: 339,80 KB

Share Result on Facebook

City

Belmont / New York / /

Company

Siebel / /

Country

Jordan / /

Event

Product Issues / /

Facility

Duke University / Stanford University / /

IndustryTerm

statistical gradient-following algorithms / local utilities / direct policy search methods / cost network / reinforcement learning algorithm / neural network / variable elimination algorithm / policy search algorithms / policy search algorithm / present several new algorithms / policy search phase / policy search / communication protocol / Linear least-squares algorithms / je computing / learning-based multi-agent systems / policy search method / direct policy search / Actor-critic algorithms / policy search methods / learning algorithm / learning algorithms / /

Organization

Lilian Boudouri Foundation / Coordinated Reinforcement Learning Carlos Guestrin Computer Science Department / Stanford University / N00014-00-1-0637 / and Air Force / Michail Lagoudakis Ronald Parr Department of Computer Science / REIN FORCE / Department of Defense / U.S. Securities and Exchange Commission / Duke University / Durham / /

Person

Ai / Carlos Guestrin / DVF N O C OMM / /

Position

sales manager / manager in a warehouse / section manager / /

Product

B 52 / machines / dead machines / /

ProvinceOrState

North Carolina / New York / California / Massachusetts / /

SportsLeague

Stanford University / /

Technology

multiagent LSPI algorithm / learning algorithm / Linear least-squares algorithms / variable elimination algorithm / neural network / policy search algorithms / communication protocol / planning algorithm / Actor-critic algorithms / learning algorithms / Dom / statistical gradient-following algorithms / reinforcement learning algorithm / LSTD algorithm / policy search algorithm / /

SocialTag