Files
TIR_PROJ/training/runs/expA_fresh2.log
T
Johnny Fernandes 75d030cb49 Test25_1800
2026-04-25 17:00:19 +00:00

36 lines
2.2 KiB
Plaintext

Config: {'W_PER_SHEEP': 1.0, 'W_ALIGN': 0.0, 'W_PEN_BONUS': 5.0, 'W_STEP_COST': 0.02, 'W_COMPLETE': 200.0, 'W_COMPACT': 1.5, 'ALIGN_SHAPE': 'standoff', 'ALIGN_GATED': False, 'ent_coef': 0.02}
Run dir: runs/expA_fresh2
Curriculum: 2 → 2 sheep, 2,000,000 steps/stage
[Stage n_sheep=2] training 2,000,000 steps
... [trial 1 | 2 sheep | 100,000 steps | ret(last 50)=-13.44 sr=0%]
... [trial 1 | 2 sheep | 200,000 steps | ret(last 50)=-14.60 sr=0%]
... [trial 1 | 2 sheep | 300,000 steps | ret(last 50)=-17.36 sr=0%]
... [trial 1 | 2 sheep | 400,000 steps | ret(last 50)=-17.36 sr=0%]
... [trial 1 | 2 sheep | 500,000 steps | ret(last 50)=-17.92 sr=0%]
... [trial 1 | 2 sheep | 600,000 steps | ret(last 50)=-15.65 sr=0%]
... [trial 1 | 2 sheep | 700,000 steps | ret(last 50)=-17.69 sr=2%]
... [trial 1 | 2 sheep | 800,000 steps | ret(last 50)=-14.61 sr=2%]
... [trial 1 | 2 sheep | 900,000 steps | ret(last 50)=-17.36 sr=0%]
... [trial 1 | 2 sheep | 1,000,000 steps | ret(last 50)=-17.44 sr=0%]
... [trial 1 | 2 sheep | 1,100,000 steps | ret(last 50)=-15.91 sr=2%]
... [trial 1 | 2 sheep | 1,200,000 steps | ret(last 50)=-16.08 sr=0%]
... [trial 1 | 2 sheep | 1,300,000 steps | ret(last 50)=-14.34 sr=0%]
... [trial 1 | 2 sheep | 1,400,000 steps | ret(last 50)=-17.00 sr=2%]
... [trial 1 | 2 sheep | 1,500,000 steps | ret(last 50)=-18.52 sr=0%]
... [trial 1 | 2 sheep | 1,600,000 steps | ret(last 50)=-16.68 sr=0%]
... [trial 1 | 2 sheep | 1,700,000 steps | ret(last 50)=-17.52 sr=0%]
... [trial 1 | 2 sheep | 1,800,000 steps | ret(last 50)=-17.33 sr=0%]
... [trial 1 | 2 sheep | 1,900,000 steps | ret(last 50)=-14.96 sr=2%]
... [trial 1 | 2 sheep | 2,000,000 steps | ret(last 50)=-15.59 sr=0%]
[Stage n_sheep=2] evaluating 30 eps
[Stage n_sheep=2] sr=0% mean_len=1500 mean_min_pen=13.2m mean_act=0.96
============================================================
REPLAY SUMMARY
============================================================
n_sheep=2 sr= 0% len= 1500 min_pen= 13.2m act=0.96
Total time: 10.7 min
Artefacts: runs/expA_fresh2/