Back to Results
First PageMeta Content
Developmental psychology / Reinforcement learning / Multi-armed bandit / Learning / Pi / Statistics / Mathematical analysis / Markov models


Learning for Contextual Bandits Alina Beygelzimer 1 John Langford
Add to Reading List

Document Date: 2010-09-23 14:42:03


Open Document

File Size: 1,02 MB

Share Result on Facebook

Company

IBM / Yahoo! / /

Person

Alina Beygelzimer / /

Position

rt / reward rt / /

Product

Research1 Yahoo! / /

SocialTag