The Nuts and Bolts of Deep RL Research John Schulman December 9th, 2016 Outline

First Page		Document Content
Date: 2016-12-09 10:01:25 Artificial intelligence Machine learning algorithms Statistical theory Probability and statistics Reinforcement learning Q-learning Partially observable Markov decision process Entropy OpenAI Cross entropy DQN		The Nuts and Bolts of Deep RL Research John Schulman December 9th, 2016 Outline Add to Reading List Source URL: rll.berkeley.edu Download Document from Source Website File Size: 311,09 KB Share Document on Facebook

	On First-Order Meta-Learning Algorithms arXiv:1803.02999v3 [cs.LG] 22 Oct 2018 Alex Nichol and Joshua Achiam and John Schulman OpenAI DocID: 1xUe9 - View Document
	Clara: Generating Polyphonic and Multi-Instrument Music Using an AWD-LSTM Architecture Christine Payne, OpenAI Scholars Program August 29, 2018 Clara is an LSTM that composes piano music and chamber music. It DocID: 1xTDI - View Document
	OpenAI Five Model ArchitecturePlayer 5 Player 4 Player 3 Player 2 DocID: 1xTBM - View Document
	Written Testimony of Jack Clark Strategy and Communications Director OpenAI House of Representatives Oversight & Government Reform Committee Subcommittee on Information Technology Hearing on DocID: 1uYQd - View Document
	Gotta Learn Fast: A New Benchmark for Generalization in RL Alex Nichol, Vicki Pfau, Christopher Hesse, Oleg Klimov, John Schulman OpenAI {alex, vickipfau, csh, oleg, joschu}@openai.com Abstract DocID: 1uAvh - View Document