TIR_PROJ

Author	SHA1	Message	Date
Johnny Fernandes	10c01a938e	Drop versioning vocabulary, polish docstrings, fix world-aware policy resolution User-facing pass after the project was decided to be a single submission with no inner iterations. * Remove every "v1"/"v2"/"versioning" reference from the docs: - README mecanum section trims the "v1 predates the rewrite" prose in favour of a self-contained retrain recipe. - The 3.2 GB `training/runs/v1_clean/` backup directory is deleted. * Refresh control-layer docstrings: - `sheep_tracker.py` header now describes the three actual pipeline stages (consensus, prediction, pen latching) instead of layering the consensus stage on top of a stale "predictive mode" preamble. - `controllers/shepherd_dog/shepherd_dog.py` mode list is up-to-date — adds `universal`, removes outdated single-policy default paths, mentions `HERDING_USE_GT=1` as the perception ablation. * Refresh training command examples: - `training/bc/collect.py` and `training/bc/pretrain.py` usage snippets show the world-suffixed paths the Makefile actually uses; the `--out` arg is now required so old "demos.npz" invocations error loudly instead of silently overwriting. - `training/README.md` rewritten — drops the legacy `runs/bc` diagram, documents the per-(drive, world) pipeline, and adds the mecanum retraining caveat. * Fix policy-directory resolution end-to-end: - `tools/run_webots.sh` now tries `training/runs/{bc,rl}_<drive>_<world>` first, then the drive- only path, then the bare-mode legacy path — matching the actual on-disk layout. Previously it looked for `bc_<drive>` (no world) and silently fell back to `bc`, masking the world selection. - `controllers/shepherd_dog/shepherd_dog.py:_resolve_policy_dir` has the same fix plus a latent NameError unmasked: it referenced `DRIVE_MODE` before that variable was set at module load. The block is restructured so MODE/DRIVE_MODE/WORLD are resolved first, then the function uses them as explicit arguments. 126 pytest cases still pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-17 01:50:54 +00:00
Johnny Fernandes	b3cf9909a8	Mecanum proto: replace cylinder wheels with physical roller hinges Each wheel is now a hub solid + 8 passive HingeJoint rollers (capsules tilted 45° in body xy plane at the bottom contact point) instead of a single plain Cylinder. The rollers free-spin around their tilt axes so the wheel exhibits mecanum X-pattern behaviour: gym-frame strafe commands now produce body strafe in Webots, where before they produced wrong-direction motion (the plain cylinders behaved as 4- wheel skid-steer). Calibration on flat field, 200 steps each: gym predict webots out err vx=0.5 vy=0 1.33 m/s +x 1.19 m/s +x 10.9% +x 0 m/s +y -0.10 m/s +y ~clean vx=0 vy=0.5 1.33 m/s +y 0.50 m/s +y 62.1% +y 0 m/s +x -0.37 m/s +x noticeable mecanum coupling Strafe is imperfect (-x bleed-through, magnitude under-shoot) but direction is correct and the platform is now omnidirectional. Forward motion is high-fidelity. Tilt signs assigned so diagonal pairs FL+RR and FR+RL share the same body-frame roller orientation (the standard X pattern). Two contact-material names "MecanumWheelA/B" are kept for diagnostic separation; both use the same isotropic Coulomb friction of 2.0 with forceDependentSlip 0.005. tools/run_webots.sh ships the matching contactProperties block on every mecanum launch (re-emitted into the temporary world copy). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-16 21:54:35 +00:00
Johnny Fernandes	03b2df5656	Fix run_webots.sh exit-1 when N=0 (calibration mode) `active=$(grep -c '^Sheep' "$DST")` returns 0 with exit code 1 when no sheep are left in the world, which fires set -e and kills the script before it can launch Webots. Wrap with `\|\| true` so the calibration mode (N=0) can actually run. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-16 20:40:28 +00:00
Johnny Fernandes	dd5ac669e5	Webots sim-to-real fixes, DAgger pipeline, 360° proto variant Today's session worked across the full Webots delivery stack — found and fixed a cluster of bugs blocking the BC/RL transfer, then explored training-side mitigations for the residual perception gap. Bug fixes: - Makefile FP_RATE default 2.0 → 0.0: BC demos used fp_rate=0 but RL fine-tune defaulted to fp_rate=2, poisoning the BC obs distribution and stalling PPO at 0% success across 1.46M+ steps. - controllers/{shepherd_dog,sheep}/runtime.ini: Webots was launching controllers under system python3 (no numpy) and they were crashing silently. Pinned to the conda tir env. - herding/config.py HERDING_WEBOTS preset: pen_latch_depth 0.5 → 2.0, max_new_tracks_per_step 3 → 1, static_reject 0.8 → 1.2. Stops phantom FPs near the gate from latching as permanently-penned tracks. - herding/perception/sheep_tracker.py: penned tracks now decay at forget_steps × 8 instead of living forever. Adds get_positions min_freshness filter for deploy-time use. Training/eval matches deployment: - training/bc/collect.py: --dagger-policy flag for DAgger rollouts (policy drives, teacher labels) + --use-webots-preset for matched 140° tracker + DR config. - controllers/shepherd_dog/shepherd_dog.py: scan-fallback (0, 0.6) when BC/RL sees empty sheep_positions — recovers from FOV gaps. Tooling: - tools/dagger_round.sh: one-shot DAgger round (collect + concat + bc). - tools/webots_sweep_gt.sh: full sweep with HERDING_USE_GT=1 for the perception-gap diagnosis matrix. - protos/ShepherdDog360.proto: 360° FOV variant for the FOV-ablation comparison. Canonical proto stays at 140° per project spec. Artifacts: v1 BC/RL policies for all 4 (drive × world) combos trained in clean gym (success: diff/field 90-100%, diff/round 58%, mec/field 60-100%, mec/round 50-100%). DAgger r1/r2 BCs for diff/field show 12%→38% progression on gym HERDING_WEBOTS proxy but did not close to actual Webots LiDAR (0/5 throughout). Next: LSTM policy or learned tracker per the project-state memory. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-16 17:21:02 +00:00
Johnny Fernandes	5c2ee4bba5	Checkpoint 8	2026-05-12 22:41:03 +01:00
Johnny Fernandes	fce0e0c786	Checkpoint 6	2026-05-11 10:35:48 +01:00
Johnny Fernandes	b457155538	Checkpoint 5 - incomplete	2026-05-11 10:35:39 +01:00
Johnny Fernandes	6688325d89	Checkpoint 4	2026-05-11 00:42:52 +01:00
Johnny Fernandes	2a6db038df	Checkpoint 3	2026-05-10 12:46:14 +01:00
Johnny Fernandes	1bb9415414	Checkpoint 2	2026-05-07 22:00:10 +01:00

10 Commits