Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling - Reinforcement theory - Document - PDFSEARCH.IO - Document Search Engine

Back to Results

First Page	Meta Content
	Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling Add to Reading List Document Date: 2008-12-01 11:15:56 Open Document File Size: 191,27 KB Share Result on Facebook City Cambridge / Bonn / / Company Wellman / Y. & Kaelbling L. P. / / Country Germany / / / Facility Massachusetts Institute of Technology / / IndustryTerm multi-agent learning algorithms / hedged learning algorithms / pure hedging algorithms / regret-minimizing algorithm / regret-minimizing algorithms / individual learning algorithm / minimization algorithms / myopic algorithm / regret-minimization algorithm / online learning researchers / chosen learning algorithms / overall algorithm / opponent-independent learning algorithm / toplevel hedging algorithm / experts algorithms / learning algorithm / learning algorithms / / Organization Massachusetts Institute of Technology / PSD(Ai ) / / Person Switches Strategies / Reinhard Selton / / Position author / single player / Always-Cooperate player / player / / ProvinceOrState Massachusetts / / PublishedMedium Games and Economic Behavior / Machine Learning / Journal of Economic Dynamics and Control / / Technology learning algorithm / pure hedging algorithms / hedging learning algorithms / minimization algorithms / regret-minimizing algorithm / second-level hedging algorithm / 7 The Hierarchical Hedging algorithm / overall algorithm / artificial intelligence / experts algorithms / hedging algorithm / learning algorithms / regret-minimizing algorithms / two learning algorithms / hedged learning algorithms / regret-minimization algorithm / toplevel hedging algorithm / Machine Learning / individual learning algorithm / chosen learning algorithms / myopic algorithm / multi-agent learning algorithms / EXP3 algorithm / playing using some regret-minimizing algorithm / / SocialTag Game theory Cybernetics Machine learning Search algorithms Learning Reinforcement learning Markov decision process Multi-armed bandit Algorithm Statistics