ray/examples at 54925996caed02fbde68867db903bc3c13bcc5eb - ray

mirror of https://github.com/wassname/ray.git synced 2026-06-29 13:49:45 +08:00

Files

T

Eric Liang 4374ad1453 Policy gradient example: Support multi-GPU training (#584 )

* add tf metrics

* comments

* fix network scopes

* add doc

* initial work

* try with 3 virtual cpus

* clean up metrics

* use format string

* fix trace level

* back to pong

* always run summary on cpu

* plot intermediate and final sgd stats

* add back a global step

* update

* add timeline

* use staging area and reuse weights properly

* stage at cpu

* whoops, stage only the batch

* clean up a bit

* fix py flake

* wip

* create an optimizer graph per device

* print timeline on 5th batch instead

* print examples per second

* log placement for training ops

* force placement on cpu:0

* try separating weights onto different gpus

* try using nccl

* add cpu fallback

* remove space from date

* check has gpu device

* fix flag config

* checkpoint

* wip

* update

* add some timing

* trace loading

* try cpu

* revert that

* remove expensive test

* lint

* cleanups

* clean up timers

* clean it up a bit

* fix code for non-scalar action spaces

* address some nits

* fix quotes

* efficient shuffling between sgd epochs

2017-06-13 06:03:25 +00:00

a3c

Make example applications pep8 compliant. (#553 )

2017-05-16 14:12:18 -07:00

evolution_strategies

Save policies for Evolution Strategies (#638 )

2017-06-04 16:21:19 -07:00

hyperopt

Fix Python 2 bug in hyperopt example. (#575 )

2017-05-19 16:12:13 -07:00

lbfgs

Make example applications pep8 compliant. (#553 )