Predictive State Temporal Difference Learning Geoffrey J. Gordon Machine Learning Department Carnegie Mellon University Pittsburgh, PA 15213 - Reinforcement theory - Document - PDFSEARCH.IO - Document Search Engine

Back to Results

First Page	Meta Content
	Predictive State Temporal Difference Learning Geoffrey J. Gordon Machine Learning Department Carnegie Mellon University Pittsburgh, PA 15213 Add to Reading List Document Date: 2011-01-19 12:04:47 Open Document File Size: 517,35 KB Share Result on Facebook City San Francisco / New York / / Company Neural Information Processing Systems / Cambridge University Press / Johns Hopkins University Press / Discrete Event Dynamic Systems / / Country United States / / / Facility Byron Boots Machine Learning Department Carnegie Mellon University / Predictive State Temporal Difference Learning Geoffrey J. Gordon Machine Learning Department Carnegie Mellon University / / IndustryTerm consistent algorithm / subspace identification algorithm / predictive compression operator / Linear least-squares algorithms / dynamical systems / model-free algorithm / approximation algorithms / spectral algorithm / temporal difference algorithms / combinatorial search / learning algorithm / learning algorithms / / MarketIndex set 220 / / Organization UCLA / Cambridge University / Byron Boots Machine Learning Department Carnegie Mellon University Pittsburgh / The Johns Hopkins University / U.S. Securities and Exchange Commission / Predictive State Temporal Difference Learning Geoffrey J. Gordon Machine Learning Department Carnegie Mellon University Pittsburgh / / Person Sajid M. Siddiqi / Michael James / Harold Hotelling / Geoffrey J. Gordon / J. Zico Kolter / Michael L. Littman / Gavin Taylor / Tsitsiklis Roy / Byron Boots / Justin A. Boyan / Van Roy / Craig Boutilier / Benjamin Van Roy / Gregory C. Reinsel / LARS-TD PSTD / Matthew Rosencrantz / Steven J. Bradtke / Daniel Hsu / Pascal Poupart / Satinder Singh / Morgan Kaufmann / John N. Tsitsiklis / Benjamin Roy / Andrew G. Barto / Lihong Li / Gene H. Golub / David Choi / Tong Zhang / Michael Littman / Andrew Y. Ng / Matthew Rudary / Sebastian Thrun / Michael R. James / Michail G. Lagoudakis / P. Van Overschee / Ton Wessling / Rajabather Palani Velu / Ronald Parr / Nikos A. Vlassis / Sham Kakade / Richard Sutton / Herbert Jaeger / Christopher Painter-Wakefield / / / Position VP / Rt / continuing forward / Pk Rt / immediate rewards Rt / / ProvinceOrState British Columbia / New York / Pennsylvania / California / / PublishedMedium Machine Learning / / Technology Linear least-squares algorithms / PSTD learning algorithm / spectral algorithm / model-free algorithm / PSTD algorithm / statistically consistent algorithm / model-based algorithm / Machine Learning / learning algorithm / 3 algorithms / artificial intelligence / temporal difference algorithms / approximation algorithms / simulation / subspace identification algorithm / / SocialTag Control theory Kalman filter Robot control Singular value decomposition Reinforcement learning Dynamical system Μ operator Normal distribution Algebra Mathematics