Self-Practice Imitation Learning from Weak Policy Qing Da, Yang Yu, and Zhi-Hua Zhou National Key Laboratory for Novel Software Technology Nanjing University, Nanjing, China {daq,yuy,zhouzh - Military patrol - Document - PDFSEARCH.IO

First Page		Document Content
Date: 2016-01-04 03:15:09 Survey methodology Machine learning algorithms Sampling techniques Artificial intelligence Sampling Support vector machine Reinforcement learning Simple random sample Learning Machine learning Cognition		Self-Practice Imitation Learning from Weak Policy Qing Da, Yang Yu, and Zhi-Hua Zhou National Key Laboratory for Novel Software Technology Nanjing University, Nanjing, China {daq,yuy,zhouzh}@lamda.nju.edu.cn Add to Reading List Source URL: cs.nju.edu.cn Download Document from Source Website File Size: 498,17 KB Share Document on Facebook

	Cooperative Multi-Agent Control Using Deep Reinforcement Learning Jayesh K. Gupta Maxim Egorov DocID: 1xVVh - View Document
	Distributed Computing Prof. R. Wattenhofer SA/MA: Byzantine Reinforcement Learning DocID: 1xVKs - View Document
	Distributed Computing Prof. R. Wattenhofer Generating CAPTCHAs with Deep (Reinforcement) Learning DocID: 1xV3l - View Document
	Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017 DocID: 1xUBi - View Document
	Cellular Network Traffic Scheduling using Deep Reinforcement Learning Sandeep Chinchali, et. al. Marco Pavone, Sachin Katti Stanford University AAAI 2018 DocID: 1xUAT - View Document