<--- Back to Details
First PageDocument Content
Artificial intelligence / Machine learning algorithms / Statistical theory / Probability and statistics / Reinforcement learning / Q-learning / Partially observable Markov decision process / Entropy / OpenAI / Cross entropy / DQN
Date: 2016-12-09 10:01:25
Artificial intelligence
Machine learning algorithms
Statistical theory
Probability and statistics
Reinforcement learning
Q-learning
Partially observable Markov decision process
Entropy
OpenAI
Cross entropy
DQN

The Nuts and Bolts of Deep RL Research John Schulman December 9th, 2016 Outline

Add to Reading List

Source URL: rll.berkeley.edu

Download Document from Source Website

File Size: 311,09 KB

Share Document on Facebook

Similar Documents

On First-Order Meta-Learning Algorithms  arXiv:1803.02999v3 [cs.LG] 22 Oct 2018 Alex Nichol and Joshua Achiam and John Schulman OpenAI

On First-Order Meta-Learning Algorithms arXiv:1803.02999v3 [cs.LG] 22 Oct 2018 Alex Nichol and Joshua Achiam and John Schulman OpenAI

DocID: 1xUe9 - View Document

Clara: Generating Polyphonic and Multi-Instrument Music Using an AWD-LSTM Architecture Christine Payne, OpenAI Scholars Program August 29, 2018  Clara is an LSTM that composes piano music and chamber music. It

Clara: Generating Polyphonic and Multi-Instrument Music Using an AWD-LSTM Architecture Christine Payne, OpenAI Scholars Program August 29, 2018 Clara is an LSTM that composes piano music and chamber music. It

DocID: 1xTDI - View Document

OpenAI Five Model ArchitecturePlayer 5 Player 4 Player 3 Player 2

OpenAI Five Model ArchitecturePlayer 5 Player 4 Player 3 Player 2

DocID: 1xTBM - View Document

Written Testimony of Jack Clark Strategy and Communications Director OpenAI House of Representatives Oversight & Government Reform Committee Subcommittee on Information Technology Hearing on

Written Testimony of Jack Clark Strategy and Communications Director OpenAI House of Representatives Oversight & Government Reform Committee Subcommittee on Information Technology Hearing on

DocID: 1uYQd - View Document

Gotta Learn Fast: A New Benchmark for Generalization in RL Alex Nichol, Vicki Pfau, Christopher Hesse, Oleg Klimov, John Schulman OpenAI {alex, vickipfau, csh, oleg, joschu}@openai.com Abstract

Gotta Learn Fast: A New Benchmark for Generalization in RL Alex Nichol, Vicki Pfau, Christopher Hesse, Oleg Klimov, John Schulman OpenAI {alex, vickipfau, csh, oleg, joschu}@openai.com Abstract

DocID: 1uAvh - View Document