wassname/ray - ray - Gitea: Git with a cup of tea

mirror of https://github.com/wassname/ray.git synced 2026-07-02 14:14:56 +08:00

Author	SHA1	Message	Date
architkulkarni	6ae9e76b81	[RLlib] Fix seeding issue (#10589 )	2020-09-04 17:17:53 -07:00
Sven Mika	ef18893fb5	[RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420 )	2020-09-02 14:03:01 +02:00
Sven Mika	4b10bdf8fc	[RLlib] rollout.py - Add multi-agent test case. (#9981 )	2020-08-10 19:44:23 +02:00
Barak Michener	8e76796fd0	ci: Redo `format.sh --all` script & backfill lint fixes (#9956 )	2020-08-07 16:49:49 -07:00
Sven Mika	e540e425e4	[RLlib] `rllib rollout` test and bug fixes. (#9779 )	2020-07-30 16:17:03 +02:00
Sven Mika	e6ea33a03c	[RLlib] Enhance reward clipping test; add action_clipping tests. (#9684 )	2020-07-28 10:44:54 +02:00
Sven Mika	5dc4b6686e	[RLlib] Implement DQN PyTorch distributional head. (#9589 )	2020-07-25 09:29:24 +02:00
Sven Mika	617eb8f279	[RLlib] Issue 9402 MARWIL producing nan rewards. (#9429 )	2020-07-14 05:07:16 +02:00
Sven Mika	fcdf410ae1	[RLlib] Tf2.x native. (#8752 )	2020-07-11 22:06:35 +02:00
Sven Mika	4da0e542d5	[RLlib] DDPG and SAC eager support (preparation for tf2.x) (#9204 )	2020-07-08 16:12:20 +02:00
Benjamin Black	1425cdf834	Pettingzoo environment support (#9271 ) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com>	2020-07-06 21:32:26 -07:00
Sven Mika	f43d934817	[RLlib] Type annotations for policy. (#9248 )	2020-07-05 13:09:51 +02:00
Sven Mika	5b2a97597b	[RLlib] Retire `try_import_tree` (should be installed along with other requirements). (#9211 ) - Retire try_import_tree. - Stabilize test_supported_multi_agent.py.	2020-07-02 13:06:34 +02:00
Sven Mika	43043ee4d5	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 ) * WIP. * Fixes. * LINT. * WIP. * WIP. * Fixes. * Fixes. * Fixes. * Fixes. * WIP. * Fixes. * Test * Fix. * Fixes and LINT. * Fixes and LINT. * LINT.	2020-06-30 10:13:20 +02:00
Sven Mika	0d37103f84	[RLlib] Prototype: Model Trajectory View API, part 0 (#9171 )	2020-06-30 05:33:19 +02:00
Sven Mika	4fd8977eaf	[RLlib] Minor cleanup in preparation to tf2.x support. (#9130 ) * WIP. * Fixes. * LINT. * Fixes. * Fixes and LINT. * WIP.	2020-06-25 19:01:32 +02:00
Sven Mika	2589309cf0	[RLlib] Make sure torch and tf behave the same wrt conv2d nets. (#8785 )	2020-06-20 00:05:19 +02:00
Sven Mika	7008902cff	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
Sven Mika	0c7764b010	Issue 8919 checkpoint at end ignored (#8933 )	2020-06-16 08:51:20 +02:00
Sven Mika	bdf1404a5f	[RLlib] Issue 8714: QMIX init error w/ tuple obs space. (#8936 )	2020-06-16 08:50:53 +02:00
Sven Mika	4ed796a7d6	[RLlib] Add testing `Policy.compute_single_action()` for all agents. (#8903 )	2020-06-13 17:51:50 +02:00
Eric Liang	34bae27ac7	[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893 )	2020-06-12 20:17:27 -07:00
Sven Mika	0ba7472da9	[Testing] Fix LINT/sphinx errors. (#8874 )	2020-06-10 15:41:59 +02:00
Eric Liang	be26a7b1b0	[rllib] Support for complex / variable-length observation spaces (#8393 )	2020-06-06 12:22:19 +02:00
Sven Mika	25c0974543	[RLlib] Issue 8412 (Adam vars not stored in ModelV2). (#8480 )	2020-06-05 21:07:02 +02:00
Sven Mika	c74dc58f8b	[RLlib] Fix `use_lstm` flag for ModelV2 (w/o ModelV1 wrapping) and add it for PyTorch. (#8734 )	2020-06-05 15:40:30 +02:00
Sven Mika	97d524c075	[RLlib] Issue 8769 broken OOM tests_dir cases (R & S). (#8770 )	2020-06-05 08:34:21 +02:00
Victor Le	aee01133cd	Fix dict/tuple hybrid action space for tensorflow eager execution (#8781 )	2020-06-04 13:28:46 -07:00
Sven Mika	d8a081a185	[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590 )	2020-05-30 22:48:34 +02:00
Sven Mika	d483ed28ba	[RLlib] Fix broken tune tests in master due to framework=auto errors. (#8672 )	2020-05-29 11:55:47 +02:00
Sven Mika	2746fc0476	[RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520 )	2020-05-27 16:19:13 +02:00
Sven Mika	0422e9c5a8	[RLlib] Add 2 Transformer learning test cases on StatelessCartPole (PPO and IMPALA). (#8624 )	2020-05-27 10:19:47 +02:00
Sven Mika	baa053496a	[RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414 )	2020-05-26 11:10:27 +02:00
Sven Mika	8870270164	[RLlib] Add QMIX support for complex obs spaces (Issue 8523). (#8533 )	2020-05-22 10:17:51 +02:00
Eric Liang	9a83908c46	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00
Eric Liang	aa7a58e92f	[rllib] Support training intensity for dqn / apex (#8396 )	2020-05-20 11:22:30 -07:00
Sven Mika	796a834c48	[RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371 )	2020-05-18 17:26:40 +02:00
Sven Mika	57544b1ff9	[RLlib] Examples folder restructuring (Model examples; final part). (#8278 ) - This PR completes any previously missing PyTorch Model counterparts to TFModels in examples/models. - It also makes sure, all example scripts in the rllib/examples folder are tested for both frameworks and learn the given task (this is often currently not checked) using a --as-test flag in connection with a --stop-reward.	2020-05-12 08:23:10 +02:00
Eric Liang	9d012626e5	[rllib] Distributed exec workflow for impala (#8321 )	2020-05-11 20:24:43 -07:00
Eric Liang	9f04a65922	[rllib] Add PPO+DQN two trainer multiagent workflow example (#8334 )	2020-05-07 23:40:29 -07:00
Sven Mika	d7eaacb5fe	[RLlib] Issue 8319 DDPG (MA or num_envs_per_worker > 1) broken. (#8324 )	2020-05-08 08:26:32 +02:00
Eric Liang	b14cc16616	[rllib] Enable functional execution workflow API by default (#8221 )	2020-05-05 12:36:42 -07:00
Sven Mika	42991d723f	[RLlib] rllib/examples folder restructuring (#8250 ) Cleans up of the rllib/examples folder by moving all example Envs into rllibexamples/env (so they can be used by other scripts and tests as well).	2020-05-01 22:59:34 +02:00
Sven Mika	eea75ac623	[RLlib] Beta distribution. (#8229 )	2020-04-30 11:09:33 -07:00
Eric Liang	baadbdf8d4	[rllib] Execute PPO using training workflow (#8206 ) * wip * add kl * kl * works now * doc update * reorg * add ddppo * add stats * fix fetch * comment * fix learner stat regression * test fixes * fix test	2020-04-30 01:18:09 -07:00
Sven Mika	bf25aee392	[RLlib] Deprecate all Model(v1) usage. (#8146 ) Deprecate all Model(v1) usage.	2020-04-29 12:12:59 +02:00
Sven Mika	1775e89f26	[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143 ) Deprecate TupleActions and support arbitrarily nested action spaces. Closes issue #8143.	2020-04-28 14:59:16 +02:00
Eric Liang	2298f6fb40	[rllib] Port DQN/Ape-X to training workflow api (#8077 )	2020-04-23 12:39:19 -07:00
Sven Mika	3812bfedda	[RLlib] PyTorch version of ES (Evolution Strategies). (#8104 ) PyTorch version of Evolution Strategies (ES) Algo.	2020-04-20 21:47:28 +02:00
Xianyang Liu	e1d3f7eba6	[rllib]Add config for rllib to support set python environments (#8026 ) * support set extra python environments * wrap value with str * Apply suggestions from code review Co-Authored-By: Eric Liang <ekhliang@gmail.com> * addresses comments * fix lint errors * remove unrelated changes due to format.sh * remove unrelated changes due to format.sh Co-authored-by: Eric Liang <ekhliang@gmail.com>	2020-04-16 01:13:45 -07:00

1 2 3

107 Commits