Back to Results
First PageMeta Content
Markov decision process / Maximum likelihood / Statistics / Statistical theory / Reinforcement learning


Reinforcement Learning for Mapping Instructions to Actions S.R.K. Branavan, Harr Chen, Luke S. Zettlemoyer, Regina Barzilay Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology {b
Add to Reading List

Document Date: 2009-05-20 11:08:30


Open Document

File Size: 610,80 KB

Share Result on Facebook

Company

MIT Press / Microsoft / VMware / Artificial Intelligence Laboratory / /

/

Facility

University of Massachusetts Amherst / Artificial Intelligence Laboratory Massachusetts Institute of Technology / /

IndustryTerm

click internet options / policy gradient algorithm / dialogue management / dialogue systems / closed form solution / policy gradient algorithms / spoken dialogue systems / software tasks / distinct applications / /

OperatingSystem

Windows 2000 / Microsoft Windows / /

Organization

National Science Foundation / University of Massachusetts Amherst / Massachusetts Institute of Technology / /

Person

Amir Globerson / Michael S. Kearns / Joelle Pineau / Deb K. Roy / John D. Lafferty / Steve Young / Andrew G. Barto / Shankar Sastry / Terry Winograd / Marilyn A. Walker / Raymond J. Mooney / Satinder P. Singh / Andrew Y. Ng / Barbara Di Eugenio / H. Jin Kim / Michael I. Jordan / Stephen Della Pietra / Leslie Pack Kaelbling / Konrad Scheffler / David L. Chen / James Timothy Oates / Michael Fleischman / Richard S. Sutton / Kobus Barnard / J. Della Pietra / Alex P. Pentland / Michael J. Kearns / Sebastian Thrun / Dina Katabi / Fernando Pereira / Jeffrey Mark Siskind / Vincent J. Della / Chen Yu / Deb Roy / Yishay Mansour / Andrew McCallum / Della Pietra / Barnard / Nicholas Roy / Satinder Singh / David McAllester / David A. Forsyth / Martin Rinard / Diane J. Litman / Dana H. Ballard / Luke S. Zettlemoyer / Regina Barzilay / /

Position

log-linear model for action selection / /

Product

Win32 application programming interface / Win32 / /

ProgrammingLanguage

ML / /

Technology

policy gradient algorithms / virtual machine / policy gradient algorithm / simulation / operating system / operating systems / Flash / /

URL

support.microsoft.com / http /

SocialTag