Skip to content

Releases: OpenMLRL/CoMLRL

v1.3.1

30 Dec 18:20
bb0ad77

Choose a tag to compare

Changelog

  • Allow batch training in MAGRPOTrainer, IACTrainer and MAACTrainer
  • Allow multi-turn training in IACTrainer and MAACTrainer
  • Change the x-axis from data_step to env_step
  • Pair with LLM_Collab_Code_Generation v1.3.1
image image image

v1.3.0

20 Dec 03:01
70d9662

Choose a tag to compare

Changelog

Use TD loss for Critic update

v1.2.9

01 Dec 20:05

Choose a tag to compare

Changelog

Add MAAC for single-turn training.

image

v1.2.8

29 Nov 16:14
873a87e

Choose a tag to compare

Changelog

Make IAC's estimation for V rather than Q.

image image

v1.2.7

22 Nov 02:45

Choose a tag to compare

Changelog:

Change IPPO to be IAC, since it's on-policy.

v1.2.6

15 Nov 23:19

Choose a tag to compare

v1.2.5

15 Nov 19:53

Choose a tag to compare

v1.2.4

15 Nov 19:38

Choose a tag to compare

v1.2.3

15 Nov 18:41

Choose a tag to compare

v1.2.2

15 Nov 18:34

Choose a tag to compare

Add project description and renew the python requirements