Back to Results
First PageMeta Content
Dynamic programming / Stochastic control / Estimation theory / Partially observable Markov decision process / Markov decision process / Apprenticeship learning / Reinforcement learning / Conjugate prior / Automated planning and scheduling / Statistics / Markov processes / Markov models


Apprenticeship Learning for Model Parameters of Partially Observable Environments Takaki Makino [removed] Institute of Industrial Science, the University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo[removed]Ja
Add to Reading List

Document Date: 2012-06-07 13:20:26


Open Document

File Size: 137,88 KB

Share Result on Facebook

City

Tokyo / Cambridge / Edinburgh / /

Company

AUAI Press / Kaelbling L. P. / Neural Information Processing Systems / Neural Networks / Autonomous Systems / MIT Press / Honda Research Institute Japan Co. Ltd. / Spiegelhalter / Thomson / ACM Press / Monte Carlo / /

Country

United Kingdom / Scotland / /

/

Facility

Institute of Industrial Science / NLopt library / University of Tokyo / /

IndustryTerm

optimization algorithm / inner product / forward algorithm / local search / search algorithms / proposed model-parameter apprenticeship learning algorithms / straightforward estimation algorithms / good solution / sequence processing / efficient algorithms / learning algorithms / /

Organization

Artifical Intelligence / MIT / Institute of Industrial Science / University of Tokyo / Council for Science and Technology Policy / Japan Society for the Promotion of Science / /

Person

Williams / /

Position

environment model / such as fully-observable MDPs / Teller / rt / reward rt / author / Singer / controller / controller for the task / /

Product

Honcho / /

ProgrammingLanguage

D / L / /

ProvinceOrState

POMDP / Massachusetts / /

PublishedMedium

Computational Linguistics / Machine Learning / Journal of Chemical Physics / Journal of Machine Learning Research / /

Technology

COBYLA algorithm / D. This algorithm / efficient algorithms / M. J. D. Direct search algorithms / proposed MAP algorithm / proposed algorithms / Machine Learning / optimization algorithm / two straightforward estimation algorithms / Metropolis algorithm / proposed model-parameter apprenticeship learning algorithms / sampling algorithm / pruning algorithm / voice recognition / artificial intelligence / end loop Algorithm / learning algorithms / BOSS algorithm / SARSOP algorithm / simulation / /

URL

http /

SocialTag