Sheep training flock of 10 fix?

2026-04-24 15:10:36 +01:00
parent fcfa2c35c8
commit bdbe8ba1de
1 changed files with 6 additions and 6 deletions
@@ -10,10 +10,10 @@ Coordinate system matches the Webots world file:
    field  : x ∈ [-15, 15],  y ∈ [-15, 15]
    pen    : x ∈ [10, 13],   y ∈ [-15, -8]   (SE corner, open north)
-Observation (13-dim, fixed regardless of n_sheep):
+Observation (16-dim, fixed regardless of n_sheep):
-    dog position (2), flock COM relative to dog (2), farthest active sheep
+    dog position (2), flock COM relative to dog (2), top-3 farthest active
-    relative to dog (2), pen relative to COM (2), pen relative to farthest
+    sheep relative to dog (6), pen relative to COM (2), pen relative to
-    sheep (2), flock radius (1), mean dispersion (1), fraction penned (1).
+    farthest sheep (2), flock radius (1), fraction penned (1).
 Permutation-invariant by design: curriculum stages share the same obs dim
 so VecNormalize statistics transfer as n_sheep advances.
@@ -72,11 +72,11 @@ class HerdingEnv(gym.Env):
        self.render_mode    = render_mode
        self.random_n_sheep = random_n_sheep   # if True, randomise n_sheep each reset
-        # Fixed 17-dim observation regardless of n_sheep:
+        # Fixed 16-dim observation regardless of n_sheep:
        #   dog_pos(2) + rel_com(2) + rel_far1(2) + rel_far2(2) + rel_far3(2)
        #   + com_to_pen(2) + far1_to_pen(2) + radius(1) + frac_penned(1)
        self.observation_space = spaces.Box(
-            low=-np.inf, high=np.inf, shape=(17,), dtype=np.float32
+            low=-np.inf, high=np.inf, shape=(16,), dtype=np.float32
        )
        # Action: desired velocity (vx, vy) ∈ [-1, 1]², scaled by DOG_SPEED