First Page | Document Content | |
---|---|---|
Date: 2011-03-31 14:42:20Machine learning Multi-armed bandit Stochastic optimization Decision theory Gittins index Reinforcement learning Bandit Kullback–Leibler divergence Probability distribution Statistics Design of experiments Statistical theory | A modern Bayesian look at the multiarmed banditAdd to Reading ListSource URL: www.economics.uci.eduDownload Document from Source WebsiteFile Size: 791,23 KBShare Document on Facebook |
doi:j.spaDocID: 1qS7d - View Document | |
Multi-Armed Bandit Models for 2D Grasp Planning with Uncertainty Michael Laskey1 , Jeff Mahler1 , Zoe McCarthy1 , Florian T. Pokorny1 , Sachin Patil1 , Jur van den Berg4 , Danica Kragic3 , Pieter Abbeel1 , Ken Goldberg2DocID: 1nqXw - View Document | |
Aucun titre de diapositiveDocID: 1akdq - View Document | |
Optimal Policy for Multi-Class Scheduling in a Single Server Queue Natalia Osipova Urtzi AyestaDocID: 1affk - View Document | |
Conditional sojourn time of optimal scheduling policy in a multi-class single-server queue ⋆ K.E. Avrachenkov1 , U. Ayesta2,3 , N. Osipova 1 3DocID: 1abfU - View Document |