<--- Back to Details
First PageDocument Content
Survey methodology / Machine learning algorithms / Sampling techniques / Artificial intelligence / Sampling / Support vector machine / Reinforcement learning / Simple random sample / Learning / Machine learning / Cognition
Date: 2016-01-04 03:15:09
Survey methodology
Machine learning algorithms
Sampling techniques
Artificial intelligence
Sampling
Support vector machine
Reinforcement learning
Simple random sample
Learning
Machine learning
Cognition

Self-Practice Imitation Learning from Weak Policy Qing Da, Yang Yu, and Zhi-Hua Zhou National Key Laboratory for Novel Software Technology Nanjing University, Nanjing, China {daq,yuy,zhouzh}@lamda.nju.edu.cn

Add to Reading List

Source URL: cs.nju.edu.cn

Download Document from Source Website

File Size: 498,17 KB

Share Document on Facebook

Similar Documents

Cooperative Multi-Agent Control Using Deep Reinforcement Learning Jayesh K. Gupta Maxim Egorov

Cooperative Multi-Agent Control Using Deep Reinforcement Learning Jayesh K. Gupta Maxim Egorov

DocID: 1xVVh - View Document

Distributed Computing Prof. R. Wattenhofer SA/MA:  Byzantine Reinforcement Learning

Distributed Computing Prof. R. Wattenhofer SA/MA: Byzantine Reinforcement Learning

DocID: 1xVKs - View Document

Distributed Computing Prof. R. Wattenhofer Generating CAPTCHAs with Deep (Reinforcement) Learning

Distributed Computing Prof. R. Wattenhofer Generating CAPTCHAs with Deep (Reinforcement) Learning

DocID: 1xV3l - View Document

Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017

Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017

DocID: 1xUBi - View Document

Cellular	Network	Traffic	Scheduling	 using	Deep	Reinforcement	Learning Sandeep	Chinchali,	et.	al.	Marco	Pavone,	Sachin	Katti Stanford	University	 AAAI	2018

Cellular Network Traffic Scheduling using Deep Reinforcement Learning Sandeep Chinchali, et. al. Marco Pavone, Sachin Katti Stanford University AAAI 2018

DocID: 1xUAT - View Document