statistical gradient-following algorithms / local utilities / direct policy search methods / cost network / reinforcement learning algorithm / neural network / variable elimination algorithm / policy search algorithms / policy search algorithm / present several new algorithms / policy search phase / policy search / communication protocol / Linear least-squares algorithms / je computing / learning-based multi-agent systems / policy search method / direct policy search / Actor-critic algorithms / policy search methods / learning algorithm / learning algorithms / /
Organization
Lilian Boudouri Foundation / Coordinated Reinforcement Learning Carlos Guestrin Computer Science Department / Stanford University / N00014-00-1-0637 / and Air Force / Michail Lagoudakis Ronald Parr Department of Computer Science / REIN FORCE / Department of Defense / U.S. Securities and Exchange Commission / Duke University / Durham / /
Person
Ai / Carlos Guestrin / DVF N O C OMM / /
Position
sales manager / manager in a warehouse / section manager / /
Product
B 52 / machines / dead machines / /
ProvinceOrState
North Carolina / New York / California / Massachusetts / /