First Page | Document Content | |
---|---|---|
Date: 2012-12-22 10:14:51SARSA Markov decision process Fisher information Mountain Car Statistics Reinforcement learning Q-learning | Relative Entropy Policy Search Jan Peters, Katharina M¨ ulling, Yasemin AltunAdd to Reading ListSource URL: www.ias.informatik.tu-darmstadt.deDownload Document from Source WebsiteFile Size: 440,74 KBShare Document on Facebook |