ray/python at 319c1340cb00bca4653e4557f200658908f1cbba - ray

mirror of https://github.com/wassname/ray.git synced 2026-06-28 03:02:56 +08:00

Files

T

Jones Wong 319c1340cb [rllib] Develop MARWIL (#3635 )

*  add marvil policy graph

*  fix typo

*  add offline optimizer and enable running marwil

*  fix loss function

*  add maintaining the moving average of advantage norm

*  use sync replay optimizer for unifying

*  remove offline optimizer and use sync replay optimizer

*  format by yapf

*  add imitation learning objective

*  fix according to eric's review

*  format by yapf

* revise

* add test data

* marwil

2019-01-16 19:00:43 -08:00

benchmarks

Change timeout from milliseconds to seconds in ray.wait. (#3706 )

2019-01-08 21:32:08 -08:00

ray

[rllib] Develop MARWIL (#3635 )

2019-01-16 19:00:43 -08:00

asv.conf.json

[asv] Pushing to s3 (#2246 )

2018-06-20 10:43:44 -07:00

build-wheel-macos.sh

Fix pyarrow version (#3760 )