Back to Results
First PageMeta Content
Estimation theory / Machine learning / Conditional random field / Theoretical computer science / Reinforcement learning / Loss function / Markov decision process / Rao–Blackwell theorem / Bayesian network / Statistics / Graphical models / Statistical theory


Conditional Random Fields for Multi-agent Reinforcement Learning Xinhua Zhang [removed] Douglas Aberdeen [removed]
Add to Reading List

Document Date: 2009-06-21 14:46:49


Open Document

File Size: 356,31 KB

Share Result on Facebook

City

Arlington / /

Company

AUAI Press / CRF / Monte Carlo / Bernstein / Baxter / /

/

Facility

Australian National University / /

IndustryTerm

statistical gradient-following algorithms / car waiting / distributed sensor network / policy-gradient algorithms / temporal learning algorithms / sensor networks / multiagent systems / energy level / road traffic network / inner product / online settings / policy search / simplest algorithms / travel time / policy-gradient solution / online variation / Online Natural / approximate sampling algorithms / search space / use batch training algorithms / sensor network / /

MarketIndex

SET 50 / /

Organization

Centre of Excellence / Cambridge Univ. / Australian Government / Australian National University / Canberra / NICTA Research School of Information Sciences & Engineering / /

Person

Douglas Aberdeen / Max Min / Mansour / /

Position

author / model the optimal joint policy / Online Natural Actor-Critic / Natural actor-critic / β=actor / log proba2 We use rt / model the proper joint policy / rT / =critic / critic / model control processes / controller / /

Product

CRFs / /

ProvinceOrState

Virginia / /

PublishedMedium

Machine Learning / /

Technology

approximate sampling algorithms / NAC algorithm / CRFs use batch training algorithms / simplest algorithms / RL algorithms / temporal learning algorithms / Machine Learning / statistical gradient-following algorithms / policy-gradient algorithms / /

SocialTag