36 lines
2.2 KiB
Plaintext
36 lines
2.2 KiB
Plaintext
Config: {'W_PER_SHEEP': 1.0, 'W_ALIGN': 0.0, 'W_PEN_BONUS': 5.0, 'W_STEP_COST': 0.02, 'W_COMPLETE': 200.0, 'W_COMPACT': 1.5, 'ALIGN_SHAPE': 'standoff', 'ALIGN_GATED': False, 'ent_coef': 0.02}
|
|
Run dir: runs/expA_fresh2
|
|
Curriculum: 2 → 2 sheep, 2,000,000 steps/stage
|
|
|
|
[Stage n_sheep=2] training 2,000,000 steps
|
|
... [trial 1 | 2 sheep | 100,000 steps | ret(last 50)=-13.44 sr=0%]
|
|
... [trial 1 | 2 sheep | 200,000 steps | ret(last 50)=-14.60 sr=0%]
|
|
... [trial 1 | 2 sheep | 300,000 steps | ret(last 50)=-17.36 sr=0%]
|
|
... [trial 1 | 2 sheep | 400,000 steps | ret(last 50)=-17.36 sr=0%]
|
|
... [trial 1 | 2 sheep | 500,000 steps | ret(last 50)=-17.92 sr=0%]
|
|
... [trial 1 | 2 sheep | 600,000 steps | ret(last 50)=-15.65 sr=0%]
|
|
... [trial 1 | 2 sheep | 700,000 steps | ret(last 50)=-17.69 sr=2%]
|
|
... [trial 1 | 2 sheep | 800,000 steps | ret(last 50)=-14.61 sr=2%]
|
|
... [trial 1 | 2 sheep | 900,000 steps | ret(last 50)=-17.36 sr=0%]
|
|
... [trial 1 | 2 sheep | 1,000,000 steps | ret(last 50)=-17.44 sr=0%]
|
|
... [trial 1 | 2 sheep | 1,100,000 steps | ret(last 50)=-15.91 sr=2%]
|
|
... [trial 1 | 2 sheep | 1,200,000 steps | ret(last 50)=-16.08 sr=0%]
|
|
... [trial 1 | 2 sheep | 1,300,000 steps | ret(last 50)=-14.34 sr=0%]
|
|
... [trial 1 | 2 sheep | 1,400,000 steps | ret(last 50)=-17.00 sr=2%]
|
|
... [trial 1 | 2 sheep | 1,500,000 steps | ret(last 50)=-18.52 sr=0%]
|
|
... [trial 1 | 2 sheep | 1,600,000 steps | ret(last 50)=-16.68 sr=0%]
|
|
... [trial 1 | 2 sheep | 1,700,000 steps | ret(last 50)=-17.52 sr=0%]
|
|
... [trial 1 | 2 sheep | 1,800,000 steps | ret(last 50)=-17.33 sr=0%]
|
|
... [trial 1 | 2 sheep | 1,900,000 steps | ret(last 50)=-14.96 sr=2%]
|
|
... [trial 1 | 2 sheep | 2,000,000 steps | ret(last 50)=-15.59 sr=0%]
|
|
[Stage n_sheep=2] evaluating 30 eps
|
|
[Stage n_sheep=2] sr=0% mean_len=1500 mean_min_pen=13.2m mean_act=0.96
|
|
|
|
============================================================
|
|
REPLAY SUMMARY
|
|
============================================================
|
|
n_sheep=2 sr= 0% len= 1500 min_pen= 13.2m act=0.96
|
|
|
|
Total time: 10.7 min
|
|
Artefacts: runs/expA_fresh2/
|