<--- Back to Details
First PageDocument Content
Machine learning / Multi-armed bandit / Stochastic optimization / Decision theory / Gittins index / Reinforcement learning / Bandit / Kullback–Leibler divergence / Probability distribution / Statistics / Design of experiments / Statistical theory
Date: 2011-03-31 14:42:20
Machine learning
Multi-armed bandit
Stochastic optimization
Decision theory
Gittins index
Reinforcement learning
Bandit
Kullback–Leibler divergence
Probability distribution
Statistics
Design of experiments
Statistical theory

A modern Bayesian look at the multiarmed bandit

Add to Reading List

Source URL: www.economics.uci.edu

Download Document from Source Website

File Size: 791,23 KB

Share Document on Facebook

Similar Documents

doi:j.spa

doi:j.spa

DocID: 1qS7d - View Document

Multi-Armed Bandit Models for 2D Grasp Planning with Uncertainty Michael Laskey1 , Jeff Mahler1 , Zoe McCarthy1 , Florian T. Pokorny1 , Sachin Patil1 , Jur van den Berg4 , Danica Kragic3 , Pieter Abbeel1 , Ken Goldberg2

Multi-Armed Bandit Models for 2D Grasp Planning with Uncertainty Michael Laskey1 , Jeff Mahler1 , Zoe McCarthy1 , Florian T. Pokorny1 , Sachin Patil1 , Jur van den Berg4 , Danica Kragic3 , Pieter Abbeel1 , Ken Goldberg2

DocID: 1nqXw - View Document

Aucun titre de diapositive

Aucun titre de diapositive

DocID: 1akdq - View Document

Optimal Policy for Multi-Class Scheduling in a Single Server Queue Natalia Osipova Urtzi Ayesta

Optimal Policy for Multi-Class Scheduling in a Single Server Queue Natalia Osipova Urtzi Ayesta

DocID: 1affk - View Document

Conditional sojourn time of optimal scheduling policy in a multi-class single-server queue ⋆ K.E. Avrachenkov1 , U. Ayesta2,3 , N. Osipova 1  3

Conditional sojourn time of optimal scheduling policy in a multi-class single-server queue ⋆ K.E. Avrachenkov1 , U. Ayesta2,3 , N. Osipova 1 3

DocID: 1abfU - View Document