TIR_PROJ

Author	SHA1	Message	Date
Johnny Fernandes	e86fee5ae8	Per-sheep pen-time metrics, seed support, make webots → menu * `controllers/shepherd_dog/shepherd_dog.py` - Tracks the first step at which each sheep crosses the gate; on auto-finish (all sheep penned) prints a `[results]` summary block: mode/drive/world/lidar/dogs/seed line, total simulated time, per-sheep penning order with absolute step + seconds since sim start, and the gate spread between the first and last penning. - Reads `HERDING_SEED` (env / runtime cfg) and seeds the controller's RNG when set. Empty = time-based default = old non-deterministic behaviour. * `controllers/sheep/sheep.py` reads `HERDING_SEED` the same way (loading `herding_runtime.cfg` itself so it works even when Webots strips env vars) and seeds Python's RNG XOR'd with the sheep's name hash, so a fixed seed gives a reproducible flock trajectory without all sheep starting from identical wander state. * `tools/run_webots.sh` writes `HERDING_SEED` into the runtime cfg (empty when unset so existing scripts keep their stochastic behaviour). * `tools/webots_menu.sh` gains a Seed prompt (random / fixed integer); the launch summary box shows the choice next to the perception row. * `Makefile` - `make webots` now fires the interactive picker (replacing the old positional invocation). - `make webots_quick MODE=… DRIVE=… WORLD=… N=…` is the old positional path, kept for batch / scripted use. Smoke-tested: menu renders Mode → Drive → World → LiDAR → Dogs → Sheep → Perception → Seed → Headless prompts and shows the selected Seed value in the launch summary. 126 pytest cases still pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-17 10:33:34 +00:00
Johnny Fernandes	cfbf4a0267	Dual-shepherd axis-split (HERDING_NDOGS=2) The launcher can now spawn two `ShepherdDog` robots, each masked to a single axis of motion, so the herding workload is split orthogonally. Mechanic: * `HERDING_NDOGS=2` (default 1) tells `tools/run_webots.sh` to replace the single-dog node in the generated test world with two copies: - `ShepherdDogX` at (-4, -10), `customData "axis=x"` - `ShepherdDogY` at (+4, -10), `customData "axis=y"` Each spawn position sits south of the field interior so the pair doesn't collide with starting sheep. * `controllers/shepherd_dog/shepherd_dog.py` reads `getCustomData()` at startup; when `axis=x\|y` it zeroes the off-axis component of every action after speed modulation and before EMA smoothing. With `customData` empty the controller behaves identically to single-dog mode, so all existing launches are unaffected. * The dog's emitter line now carries the robot's name (`dog:ShepherdDogX:x:y`), and `controllers/sheep/sheep.py` keeps a `dogs` dict keyed by name, picking the closest one each step for its flee target. Single-dog runs still use the legacy two-field `dog:x:y` format thanks to a length check. * `HERDING_NDOGS` is written into `herding_runtime.cfg` and exported to subprocesses so future tooling can read it. Verified behaviour in Webots smoke tests (HERDING_NDOGS=2, strombom, diff/field, 5 sheep): both dogs spawn with the expected names and axis tags, the dual-dog status print appears, each dog acts only on its assigned axis early in the trial, and the masking is internally consistent. The pair stalls before penning under pure axis-split because each dog reaches its drive standoff and then has only one degree of freedom — useful research finding for the write-up; coordination strategy (shared CoM, role-switching, etc.) is future work. 126 pytest cases still pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-17 02:35:38 +00:00
Johnny Fernandes	d00da52c3c	Portable Python env + 360° LiDAR ablation flag Two small features. (1) Portable interpreter * `tools/setup_env.sh` exports HERDING_PYTHON (default points to the project's conda env; override in your shell to retarget). * Both `controllers//runtime.ini` files now use Webots' env-var expansion: `COMMAND = $(HERDING_PYTHON)` so the Webots-launched controllers pick up the same interpreter as the bash scripts. `tools/run_webots.sh`, `tools/webots_sweep{,_gt}.sh` and `tools/calibrate_mecanum.sh` all source `setup_env.sh` at the top instead of hard-coding `/home/jalf/miniconda3/envs/tir/bin`. The hard-coded conda path is now exactly one line in `setup_env.sh`'s fallback default — a single place to edit on a new machine, or override-once via `export HERDING_PYTHON=...`. (2) 360° LiDAR FOV ablation * New `LIDAR_WEBOTS_360` preset matches the existing `protos/ShepherdDog360.proto` (360 rays / 2π FOV / 15 m range). * `tools/run_webots.sh` reads `HERDING_LIDAR=140\|360` and swaps the diff-drive proto accordingly (mecanum keeps 140° — the ShepherdDogMecanum proto has its own LiDAR section). The variant is written into `herding_runtime.cfg` so the controller can read it even when Webots strips env vars. * `controllers/shepherd_dog/shepherd_dog.py` picks the matching `lidar_cfg` (`HERDING_WEBOTS.lidar` for 140°, `LIDAR_WEBOTS_360` otherwise) and feeds it to `detections_from_scan` so the perception pipeline interprets ray angles + max range correctly. Smoke test: `HERDING_LIDAR=360 tools/run_webots.sh 5 strombom differential field` launches with `ShepherdDog360.proto`, the controller logs the new mode/drive/world line, and the dog is penning sheep through 360° perception (4/5 at step 19200 before I killed the test). No retraining required because the gym already trains under `LIDAR_FULL` (360° preset). 126 pytest cases still pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-17 02:19:15 +00:00
Johnny Fernandes	7ab69ab0f3	Rename multi-segment functions to two-concept names; polish docstrings Naming pass: rename functions whose third+ segment is redundant or implementation-detail, sticking to the codebase's preferred ``noun_verb`` / ``verb_noun`` two-concept idiom. Renames are atomic across definitions, callers, and tests. is_penned_position → is_penned modulate_speed_near_sheep → modulate_speed mecanum_kinematics_step → mecanum_step policy_forward_mean → forward_mean Two-concept patterns like ``velocity_to_wheels`` / ``detections_from_scan`` / ``make_strombom_predictor`` are left alone — they're idiomatic converters / factories that read as a single concept, and the longer form aids grep-ability. Docstring polish: * ``herding/config.py`` header drops the "previously lived as a module-level literal" historical framing — we ship as a single thing, so the refactor anecdote no longer earns its keep. The usage examples now mention both ``HERDING_WEBOTS`` and ``HERDING_MEC_WEBOTS`` presets. 126 pytest cases still pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-17 01:58:15 +00:00
Johnny Fernandes	dd5ac669e5	Webots sim-to-real fixes, DAgger pipeline, 360° proto variant Today's session worked across the full Webots delivery stack — found and fixed a cluster of bugs blocking the BC/RL transfer, then explored training-side mitigations for the residual perception gap. Bug fixes: - Makefile FP_RATE default 2.0 → 0.0: BC demos used fp_rate=0 but RL fine-tune defaulted to fp_rate=2, poisoning the BC obs distribution and stalling PPO at 0% success across 1.46M+ steps. - controllers/{shepherd_dog,sheep}/runtime.ini: Webots was launching controllers under system python3 (no numpy) and they were crashing silently. Pinned to the conda tir env. - herding/config.py HERDING_WEBOTS preset: pen_latch_depth 0.5 → 2.0, max_new_tracks_per_step 3 → 1, static_reject 0.8 → 1.2. Stops phantom FPs near the gate from latching as permanently-penned tracks. - herding/perception/sheep_tracker.py: penned tracks now decay at forget_steps × 8 instead of living forever. Adds get_positions min_freshness filter for deploy-time use. Training/eval matches deployment: - training/bc/collect.py: --dagger-policy flag for DAgger rollouts (policy drives, teacher labels) + --use-webots-preset for matched 140° tracker + DR config. - controllers/shepherd_dog/shepherd_dog.py: scan-fallback (0, 0.6) when BC/RL sees empty sheep_positions — recovers from FOV gaps. Tooling: - tools/dagger_round.sh: one-shot DAgger round (collect + concat + bc). - tools/webots_sweep_gt.sh: full sweep with HERDING_USE_GT=1 for the perception-gap diagnosis matrix. - protos/ShepherdDog360.proto: 360° FOV variant for the FOV-ablation comparison. Canonical proto stays at 140° per project spec. Artifacts: v1 BC/RL policies for all 4 (drive × world) combos trained in clean gym (success: diff/field 90-100%, diff/round 58%, mec/field 60-100%, mec/round 50-100%). DAgger r1/r2 BCs for diff/field show 12%→38% progression on gym HERDING_WEBOTS proxy but did not close to actual Webots LiDAR (0/5 throughout). Next: LSTM policy or learned tracker per the project-state memory. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-16 17:21:02 +00:00
Johnny Fernandes	a01a5c9cef	Checkpoint 7	2026-05-11 12:21:51 +01:00
Johnny Fernandes	fce0e0c786	Checkpoint 6	2026-05-11 10:35:48 +01:00
Johnny Fernandes	1bb9415414	Checkpoint 2	2026-05-07 22:00:10 +01:00
Johnny Fernandes	90aa3bbcb4	Checkpoint 1	2026-05-07 21:59:58 +01:00
Johnny Fernandes	b3251fcca3	Sheep training flock _ improver	2026-04-24 22:46:51 +01:00
Johnny Fernandes	d599181d22	Sheep training flock _ improver	2026-04-24 21:29:44 +01:00
Johnny Fernandes	81dc2aca01	Sheep training flock of 10	2026-04-23 19:22:39 +01:00
Johnny Fernandes	f256e99a76	Styling and sheep behaviour	2026-04-22 21:01:42 +01:00

13 Commits