Back to Results
First PageMeta Content
Machine learning / Artificial intelligence / Learning / Cognition / Markov models / Multi-armed bandit / Stochastic optimization / Reinforcement learning / Algorithm / Stability / Recommender system / Greedy algorithm


JMLR: Workshop and Conference Proceedings vol–36 On-line Trading of Exploration and Exploitation 2 An Unbiased Offline Evaluation of Contextual Bandit Algorithms with Generalized Linear Models
Add to Reading List

Document Date: 2012-05-02 03:57:00


Open Document

File Size: 350,06 KB

Share Result on Facebook