![Machine learning / Artificial intelligence / Learning / Cognition / Markov models / Multi-armed bandit / Stochastic optimization / Reinforcement learning / Algorithm / Stability / Recommender system / Greedy algorithm Machine learning / Artificial intelligence / Learning / Cognition / Markov models / Multi-armed bandit / Stochastic optimization / Reinforcement learning / Algorithm / Stability / Recommender system / Greedy algorithm](https://www.pdfsearch.io/img/d5ca443aefdd48a311d84f0d95e23320.jpg) Date: 2012-05-02 03:57:00Machine learning Artificial intelligence Learning Cognition Markov models Multi-armed bandit Stochastic optimization Reinforcement learning Algorithm Stability Recommender system Greedy algorithm | | JMLR: Workshop and Conference Proceedings vol–36 On-line Trading of Exploration and Exploitation 2 An Unbiased Offline Evaluation of Contextual Bandit Algorithms with Generalized Linear ModelsAdd to Reading ListSource URL: jmlr.orgDownload Document from Source Website File Size: 350,06 KBShare Document on Facebook
|