First Page | Meta Content | |
---|---|---|
![]() | Document Date: 2010-07-15 16:29:00Open Document File Size: 7,28 MBShare Result on FacebookCityEdmonton / /FacilityUniversity of Alberta Csaba Szepesvari / University of Alberta Andrew Ng / University of South Paris / University of Alberta Jonathan Schaeffer / University of Alberta Libraries / University of Alberta / SmartGo library / University of Alberta Martin M¨uller / University of Alberta Petr Musilek / University of Alberta Reinforcement Learning / /IndustryTermtree search / temporal-difference search / tree search algorithm / simulation-based search methods / default learning algorithm / explicit search tree / search space / temporaldifference search / /NaturalFeatureFuego / /OrganizationUniversity of Alberta Reinforcement Learning and Simulation-Based Search / Examining Committee / University of Alberta Libraries / Faculty of Graduate Studies and Research / Stanford University / Philosophy Department of Computing Science / Department of Computing Science / University of Alberta / University of South Paris / /PersonGerry Tesauro / Remi Munos / Monte-Carlo Tree / Carlo Tree / Leah Hackman / Markus Enzenberger / Yizao Wang / Jessica Meserve / David Silver / Anna Koop / Olivier Teytaud / Sylvain Gelly / Richard Sutton / /Positionauthor / /TechnologyAlpha / artificial intelligence / Monte-Carlo tree search algorithm / machine learning / Simulation / 41 5.5 Learning Algorithm / default learning algorithm / /SocialTag |