Back to Results
First PageMeta Content
Game theory / Cybernetics / Machine learning / Search algorithms / Learning / Reinforcement learning / Markov decision process / Multi-armed bandit / Algorithm / Statistics / Mathematics / Applied mathematics


Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling
Add to Reading List

Document Date: 2008-12-01 11:15:56


Open Document

File Size: 191,27 KB

Share Result on Facebook

City

Cambridge / Bonn / /

Company

Wellman / Y. & Kaelbling L. P. / /

Country

Germany / /

/

Facility

Massachusetts Institute of Technology / /

IndustryTerm

multi-agent learning algorithms / hedged learning algorithms / pure hedging algorithms / regret-minimizing algorithm / regret-minimizing algorithms / individual learning algorithm / minimization algorithms / myopic algorithm / regret-minimization algorithm / online learning researchers / chosen learning algorithms / overall algorithm / opponent-independent learning algorithm / toplevel hedging algorithm / experts algorithms / learning algorithm / learning algorithms / /

Organization

Massachusetts Institute of Technology / PSD(Ai ) / /

Person

Switches Strategies / Reinhard Selton / /

Position

author / single player / Always-Cooperate player / player / /

ProvinceOrState

Massachusetts / /

PublishedMedium

Games and Economic Behavior / Machine Learning / Journal of Economic Dynamics and Control / /

Technology

learning algorithm / pure hedging algorithms / hedging learning algorithms / minimization algorithms / regret-minimizing algorithm / second-level hedging algorithm / 7 The Hierarchical Hedging algorithm / overall algorithm / artificial intelligence / experts algorithms / hedging algorithm / learning algorithms / regret-minimizing algorithms / two learning algorithms / hedged learning algorithms / regret-minimization algorithm / toplevel hedging algorithm / Machine Learning / individual learning algorithm / chosen learning algorithms / myopic algorithm / multi-agent learning algorithms / EXP3 algorithm / playing using some regret-minimizing algorithm / /

SocialTag