Reinforcement Learning for Mapping Instructions to Actions S.R.K. Branavan, Harr Chen, Luke S. Zettlemoyer, Regina Barzilay Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology {b - Reinforcement theory - Document - PDFSEARCH.IO - Document Search Engine

Back to Results

First Page	Meta Content
	Reinforcement Learning for Mapping Instructions to Actions S.R.K. Branavan, Harr Chen, Luke S. Zettlemoyer, Regina Barzilay Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology {b Add to Reading List Document Date: 2009-05-20 11:08:30 Open Document File Size: 610,80 KB Share Result on Facebook Company MIT Press / Microsoft / VMware / Artificial Intelligence Laboratory / / / Facility University of Massachusetts Amherst / Artificial Intelligence Laboratory Massachusetts Institute of Technology / / IndustryTerm click internet options / policy gradient algorithm / dialogue management / dialogue systems / closed form solution / policy gradient algorithms / spoken dialogue systems / software tasks / distinct applications / / OperatingSystem Windows 2000 / Microsoft Windows / / Organization National Science Foundation / University of Massachusetts Amherst / Massachusetts Institute of Technology / / Person Amir Globerson / Michael S. Kearns / Joelle Pineau / Deb K. Roy / John D. Lafferty / Steve Young / Andrew G. Barto / Shankar Sastry / Terry Winograd / Marilyn A. Walker / Raymond J. Mooney / Satinder P. Singh / Andrew Y. Ng / Barbara Di Eugenio / H. Jin Kim / Michael I. Jordan / Stephen Della Pietra / Leslie Pack Kaelbling / Konrad Scheffler / David L. Chen / James Timothy Oates / Michael Fleischman / Richard S. Sutton / Kobus Barnard / J. Della Pietra / Alex P. Pentland / Michael J. Kearns / Sebastian Thrun / Dina Katabi / Fernando Pereira / Jeffrey Mark Siskind / Vincent J. Della / Chen Yu / Deb Roy / Yishay Mansour / Andrew McCallum / Della Pietra / Barnard / Nicholas Roy / Satinder Singh / David McAllester / David A. Forsyth / Martin Rinard / Diane J. Litman / Dana H. Ballard / Luke S. Zettlemoyer / Regina Barzilay / / Position log-linear model for action selection / / Product Win32 application programming interface / Win32 / / ProgrammingLanguage ML / / Technology policy gradient algorithms / virtual machine / policy gradient algorithm / simulation / operating system / operating systems / Flash / / URL support.microsoft.com / http / SocialTag Markov decision process Maximum likelihood Statistics Statistical theory Reinforcement learning