[rllib] Ape-X implementation and DQN refactor to handle replay in policy optimizer (#1604)

mirror of https://github.com/wassname/ray.git synced 2026-07-04 07:01:05 +08:00

* minimal apex checkin

* cleanup dqn options

* actor utils

* Sun Feb 25 17:39:54 PST 2018

* update

* compression refactor

* fix

* add test

* fix models

* Sun Feb 25 21:46:27 PST 2018

* Wed Feb 28 10:26:34 PST 2018

* Wed Feb 28 10:28:09 PST 2018

* Wed Feb 28 10:42:59 PST 2018

* refactor

* Wed Feb 28 11:17:19 PST 2018

* Wed Feb 28 11:42:08 PST 2018

* Wed Feb 28 11:42:13 PST 2018

* Wed Feb 28 11:59:02 PST 2018

* Wed Feb 28 11:59:58 PST 2018

* Wed Feb 28 12:00:08 PST 2018

* Wed Feb 28 12:02:19 PST 2018

* Wed Feb 28 13:44:31 PST 2018

* Wed Feb 28 17:01:20 PST 2018

* Sat Mar  3 14:55:59 PST 2018

* make optimizer construction explicit

* Sat Mar  3 18:23:08 PST 2018

* Sat Mar  3 18:24:28 PST 2018

* Sat Mar  3 18:49:28 PST 2018

* Sat Mar  3 18:50:42 PST 2018

* Sat Mar  3 18:56:10 PST 2018

This commit is contained in:

Eric Liang

2018-03-04 12:25:25 -08:00

committed by

GitHub

parent 9b33f3a7b7

commit ecb811c26e

32 changed files with 934 additions and 431 deletions

									
										python/setup.py
									
		+3
		-1
	
												View File
												
				@@ -41,7 +41,9 @@ else:

				    optional_ray_files += ray_ui_files

				extras = {

				    "rllib": ["tensorflow", "pyyaml", "gym[atari]", "opencv-python", "scipy"]

				    "rllib": [

				        "tensorflow", "pyyaml", "gym[atari]", "opencv-python",

				        "python-snappy", "scipy"]

				}