Back to Results
First PageMeta Content
Reinforcement learning / Computer Go / Cybernetics / Temporal difference learning / Monte Carlo method / Simulation / Heuristic / Algorithm / Machine learning / Applied mathematics / Mathematics / Science


Document Date: 2010-07-15 16:29:00


Open Document

File Size: 7,28 MB

Share Result on Facebook

City

Edmonton / /

Facility

University of Alberta Csaba Szepesvari / University of Alberta Andrew Ng / University of South Paris / University of Alberta Jonathan Schaeffer / University of Alberta Libraries / University of Alberta / SmartGo library / University of Alberta Martin M¨uller / University of Alberta Petr Musilek / University of Alberta Reinforcement Learning / /

IndustryTerm

tree search / temporal-difference search / tree search algorithm / simulation-based search methods / default learning algorithm / explicit search tree / search space / temporaldifference search / /

NaturalFeature

Fuego / /

Organization

University of Alberta Reinforcement Learning and Simulation-Based Search / Examining Committee / University of Alberta Libraries / Faculty of Graduate Studies and Research / Stanford University / Philosophy Department of Computing Science / Department of Computing Science / University of Alberta / University of South Paris / /

Person

Gerry Tesauro / Remi Munos / Monte-Carlo Tree / Carlo Tree / Leah Hackman / Markus Enzenberger / Yizao Wang / Jessica Meserve / David Silver / Anna Koop / Olivier Teytaud / Sylvain Gelly / Richard Sutton / /

Position

author / /

Technology

Alpha / artificial intelligence / Monte-Carlo tree search algorithm / machine learning / Simulation / 41 5.5 Learning Algorithm / default learning algorithm / /

SocialTag