Portable Python env + 360° LiDAR ablation flag

Two small features.

(1) Portable interpreter
* `tools/setup_env.sh` exports HERDING_PYTHON (default points to the
  project's conda env; override in your shell to retarget).
* Both `controllers/*/runtime.ini` files now use Webots' env-var
  expansion: `COMMAND = $(HERDING_PYTHON)` so the Webots-launched
  controllers pick up the same interpreter as the bash scripts.
* `tools/run_webots.sh`, `tools/webots_sweep{,_gt}.sh` and
  `tools/calibrate_mecanum.sh` all source `setup_env.sh` at the top
  instead of hard-coding `/home/jalf/miniconda3/envs/tir/bin`.
The hard-coded conda path is now exactly one line in `setup_env.sh`'s
fallback default — a single place to edit on a new machine, or
override-once via `export HERDING_PYTHON=...`.

(2) 360° LiDAR FOV ablation
* New `LIDAR_WEBOTS_360` preset matches the existing
  `protos/ShepherdDog360.proto` (360 rays / 2π FOV / 15 m range).
* `tools/run_webots.sh` reads `HERDING_LIDAR=140|360` and swaps the
  diff-drive proto accordingly (mecanum keeps 140° — the
  ShepherdDogMecanum proto has its own LiDAR section). The variant
  is written into `herding_runtime.cfg` so the controller can read
  it even when Webots strips env vars.
* `controllers/shepherd_dog/shepherd_dog.py` picks the matching
  `lidar_cfg` (`HERDING_WEBOTS.lidar` for 140°, `LIDAR_WEBOTS_360`
  otherwise) and feeds it to `detections_from_scan` so the
  perception pipeline interprets ray angles + max range correctly.

Smoke test: `HERDING_LIDAR=360 tools/run_webots.sh 5 strombom
differential field` launches with `ShepherdDog360.proto`, the
controller logs the new mode/drive/world line, and the dog is
penning sheep through 360° perception (4/5 at step 19200 before I
killed the test). No retraining required because the gym already
trains under `LIDAR_FULL` (360° preset).

126 pytest cases still pass.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This commit is contained in:
Johnny Fernandes
2026-05-17 02:19:15 +00:00
parent 7ab69ab0f3
commit d00da52c3c
9 changed files with 105 additions and 12 deletions
+9 -1
View File
@@ -1,2 +1,10 @@
# Webots reads this file before starting the controller. It tells
# Webots which Python interpreter to launch (default is system
# `python3`, which usually lacks NumPy).
#
# Webots supports environment-variable expansion in this file, so we
# defer the interpreter path to $HERDING_PYTHON — set that variable
# once in your shell (or `tools/setup_env.sh`) before launching
# Webots and the controllers in this project will pick it up.
[python] [python]
COMMAND = /home/jalf/miniconda3/envs/tir/bin/python3 COMMAND = $(HERDING_PYTHON)
+9 -1
View File
@@ -1,2 +1,10 @@
# Webots reads this file before starting the controller. It tells
# Webots which Python interpreter to launch (default is system
# `python3`, which usually lacks SB3/PyTorch).
#
# Webots supports environment-variable expansion in this file, so we
# defer the interpreter path to $HERDING_PYTHON — set that variable
# once in your shell (or `tools/setup_env.sh`) before launching
# Webots and the controllers in this project will pick it up.
[python] [python]
COMMAND = /home/jalf/miniconda3/envs/tir/bin/python3 COMMAND = $(HERDING_PYTHON)
+14 -2
View File
@@ -97,7 +97,7 @@ from herding.world.geometry import (
DOG_SOUTH_LIMIT, DOG_SOUTH_LIMIT,
PEN_ENTRY, is_penned, PEN_ENTRY, is_penned,
) )
from herding.config import HERDING_WEBOTS, RobotConfig from herding.config import HERDING_WEBOTS, LIDAR_WEBOTS_360, RobotConfig
# Robot physical constants come from RobotConfig so they stay in sync with # Robot physical constants come from RobotConfig so they stay in sync with
# the training environment. The Webots preset uses action_smooth=0.55. # the training environment. The Webots preset uses action_smooth=0.55.
@@ -136,6 +136,18 @@ WORLD = (os.environ.get("HERDING_WORLD")
or _runtime_cfg.get("HERDING_WORLD") or _runtime_cfg.get("HERDING_WORLD")
or "field").lower() or "field").lower()
# LiDAR FOV variant — "140" (default, ShepherdDog.proto) or "360"
# (ShepherdDog360.proto, FOV ablation). The launcher swaps the proto
# in the temp world file; the controller picks the matching lidar_cfg
# below so the perception pipeline interprets ray angles correctly.
LIDAR_FOV_VARIANT = (os.environ.get("HERDING_LIDAR")
or _runtime_cfg.get("HERDING_LIDAR")
or "140").lower()
if LIDAR_FOV_VARIANT == "360":
_LIDAR_CFG = LIDAR_WEBOTS_360
else:
_LIDAR_CFG = HERDING_WEBOTS.lidar
# Diagnostic: bypass LiDAR tracker and use GT emitter positions directly. # Diagnostic: bypass LiDAR tracker and use GT emitter positions directly.
# Set HERDING_USE_GT=1 to isolate perception sim-to-real gap from policy quality. # Set HERDING_USE_GT=1 to isolate perception sim-to-real gap from policy quality.
USE_GT_PERCEPTION = bool(int( USE_GT_PERCEPTION = bool(int(
@@ -409,7 +421,7 @@ while robot.step(timestep) != -1:
detections = detections_from_scan( detections = detections_from_scan(
ranges, dog_xy[0], dog_xy[1], dog_heading, ranges, dog_xy[0], dog_xy[1], dog_heading,
detection_cfg=HERDING_WEBOTS.detection, detection_cfg=HERDING_WEBOTS.detection,
lidar_cfg=HERDING_WEBOTS.lidar, lidar_cfg=_LIDAR_CFG,
) )
if USE_GT_PERCEPTION and _gt_sheep: if USE_GT_PERCEPTION and _gt_sheep:
# Bypass tracker: feed GT emitter positions directly to policy/teacher. # Bypass tracker: feed GT emitter positions directly to policy/teacher.
+13
View File
@@ -93,6 +93,19 @@ rays), a policy trained here can be deployed on a wider-FOV LiDAR (e.g.
which can only improve tracker quality. which can only improve tracker quality.
""" """
LIDAR_WEBOTS_360 = LidarConfig(
n_rays=360,
fov_rad=2.0 * math.pi,
max_range=15.0,
)
"""Matches ShepherdDog360.proto (360 rays, 360° FOV, 15 m range).
Used by the FOV-ablation Webots launch (HERDING_LIDAR=360). The wider
range and full surround visibility hand the tracker more detections
per step, so the trained policy — already trained on 360° gym
perception — sees an observation distribution closer to training.
"""
# --------------------------------------------------------------------------- # ---------------------------------------------------------------------------
# Cluster-detection pipeline # Cluster-detection pipeline
+1 -1
View File
@@ -21,7 +21,7 @@ set -euo pipefail
N_STEPS="${1:-150}" N_STEPS="${1:-150}"
ROOT="$( cd "$( dirname "${BASH_SOURCE[0]}" )/.." && pwd )" ROOT="$( cd "$( dirname "${BASH_SOURCE[0]}" )/.." && pwd )"
LOG="$ROOT/calibrate_mecanum.log" LOG="$ROOT/calibrate_mecanum.log"
export PATH="/home/jalf/miniconda3/envs/tir/bin:$PATH" source "$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )/setup_env.sh"
echo "Running mecanum calibration (N_STEPS=$N_STEPS)..." echo "Running mecanum calibration (N_STEPS=$N_STEPS)..."
echo "Results will be written to: $LOG" echo "Results will be written to: $LOG"
+30 -3
View File
@@ -33,6 +33,12 @@
# WEBOTS_EXTRA_ARGS="--stdout --stderr" WEBOTS_HEADLESS=1 tools/run_webots.sh 10 rl # WEBOTS_EXTRA_ARGS="--stdout --stderr" WEBOTS_HEADLESS=1 tools/run_webots.sh 10 rl
set -e set -e
# Make sure HERDING_PYTHON is resolved and on PATH so Webots inherits
# the right interpreter (controllers/{shepherd_dog,sheep}/runtime.ini
# both read $HERDING_PYTHON via env-var expansion).
source "$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )/setup_env.sh"
N=${1:-10} N=${1:-10}
MODE=${2:-bc} MODE=${2:-bc}
DRIVE=${3:-differential} DRIVE=${3:-differential}
@@ -84,12 +90,31 @@ fi
cp "$SRC" "$DST" cp "$SRC" "$DST"
# Swap robot proto based on drive mode. # LiDAR FOV variant: HERDING_LIDAR=140 (default) or 360 (ablation).
# Base worlds reference ShepherdDog (diff-drive). For mecanum we swap in # 360° is only supported for differential drive; the mecanum proto
# ShepherdDogMecanum and inject mecanum contact properties. # always uses the 140° sensor matching ShepherdDog.proto.
LIDAR_VARIANT="${HERDING_LIDAR:-140}"
if [[ "$LIDAR_VARIANT" != "140" && "$LIDAR_VARIANT" != "360" ]]; then
echo "HERDING_LIDAR must be 140 or 360, got '$LIDAR_VARIANT'" >&2; exit 1
fi
if [[ "$LIDAR_VARIANT" == "360" && "$DRIVE" == "mecanum" ]]; then
echo "[run_webots] HERDING_LIDAR=360 not available for mecanum drive — falling back to 140." >&2
LIDAR_VARIANT="140"
fi
export HERDING_LIDAR="$LIDAR_VARIANT"
# Swap robot proto based on drive mode + LiDAR variant.
# Base worlds reference ShepherdDog (diff-drive 140°). For mecanum we
# swap in ShepherdDogMecanum; for the 360° ablation we swap in
# ShepherdDog360.
if [[ "$DRIVE" == "mecanum" ]]; then if [[ "$DRIVE" == "mecanum" ]]; then
sed -i 's|"../protos/ShepherdDog.proto"|"../protos/ShepherdDogMecanum.proto"|' "$DST" sed -i 's|"../protos/ShepherdDog.proto"|"../protos/ShepherdDogMecanum.proto"|' "$DST"
sed -i 's|^ShepherdDog {|ShepherdDogMecanum {|' "$DST" sed -i 's|^ShepherdDog {|ShepherdDogMecanum {|' "$DST"
elif [[ "$LIDAR_VARIANT" == "360" ]]; then
sed -i 's|"../protos/ShepherdDog.proto"|"../protos/ShepherdDog360.proto"|' "$DST"
sed -i 's|^ShepherdDog {|ShepherdDog360 {|' "$DST"
fi
if [[ "$DRIVE" == "mecanum" ]]; then
# Inject mecanum roller contact properties. The proto's rollers are # Inject mecanum roller contact properties. The proto's rollers are
# split into two contact materials so that we can keep the friction # split into two contact materials so that we can keep the friction
# axes oriented along each roller's free-spin direction — but with # axes oriented along each roller's free-spin direction — but with
@@ -152,6 +177,7 @@ HERDING_MODE=$MODE
HERDING_POLICY_DIR=$RESOLVED_POLICY_DIR HERDING_POLICY_DIR=$RESOLVED_POLICY_DIR
HERDING_DRIVE=$DRIVE HERDING_DRIVE=$DRIVE
HERDING_WORLD=$WORLD HERDING_WORLD=$WORLD
HERDING_LIDAR=$LIDAR_VARIANT
HERDING_USE_GT=${HERDING_USE_GT:-0} HERDING_USE_GT=${HERDING_USE_GT:-0}
EOF EOF
@@ -159,6 +185,7 @@ export HERDING_MODE="$MODE"
export HERDING_POLICY_DIR="$RESOLVED_POLICY_DIR" export HERDING_POLICY_DIR="$RESOLVED_POLICY_DIR"
export HERDING_DRIVE="$DRIVE" export HERDING_DRIVE="$DRIVE"
export HERDING_WORLD="$WORLD" export HERDING_WORLD="$WORLD"
export HERDING_LIDAR="$LIDAR_VARIANT"
# The controller writes this sentinel when all GT sheep are penned. We # The controller writes this sentinel when all GT sheep are penned. We
# poll for it and kill Webots so the run finishes cleanly instead of # poll for it and kill Webots so the run finishes cleanly instead of
+23
View File
@@ -0,0 +1,23 @@
# Source this from your shell before running the launchers:
#
# source tools/setup_env.sh
#
# The launchers (`tools/run_webots.sh`, `tools/webots_sweep*.sh`,
# `tools/calibrate_mecanum.sh`) and the Webots controllers (via
# `controllers/*/runtime.ini`) all read $HERDING_PYTHON to decide
# which Python interpreter to use. The default below points at the
# project author's conda env — edit this file or override the var in
# your shell to match your own setup.
: "${HERDING_PYTHON:=/home/jalf/miniconda3/envs/tir/bin/python3}"
export HERDING_PYTHON
# Prepend the Python's bin/ to PATH so subprocesses pick up the same
# interpreter (used by Webots when it doesn't read runtime.ini, and
# by any Python tooling launched by the bash scripts).
export PATH="$(dirname "$HERDING_PYTHON"):$PATH"
if [[ ! -x "$HERDING_PYTHON" ]]; then
echo "[setup_env] WARNING: HERDING_PYTHON=$HERDING_PYTHON is not executable." >&2
echo "[setup_env] Edit tools/setup_env.sh or 'export HERDING_PYTHON=...' yourself." >&2
fi
+3 -2
View File
@@ -10,8 +10,9 @@ ROOT="$( cd "$( dirname "${BASH_SOURCE[0]}" )/.." && pwd )"
OUT="${1:-$ROOT/webots_sweep.log}" OUT="${1:-$ROOT/webots_sweep.log}"
TIMEOUT_S=120 # ~80k steps in fast headless mode — covers slow-converging physics TIMEOUT_S=120 # ~80k steps in fast headless mode — covers slow-converging physics
# Webots uses its own python3; put the conda env first so all deps resolve. # Source the project python path. Edit tools/setup_env.sh or override
export PATH="/home/jalf/miniconda3/envs/tir/bin:$PATH" # HERDING_PYTHON in your shell to point at a Python with SB3+PyTorch.
source "$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )/setup_env.sh"
# Columns: mode drive world n_sheep success steps # Columns: mode drive world n_sheep success steps
printf "%-12s %-14s %-12s %7s %7s %s\n" \ printf "%-12s %-14s %-12s %7s %7s %s\n" \
+3 -2
View File
@@ -10,8 +10,9 @@ ROOT="$( cd "$( dirname "${BASH_SOURCE[0]}" )/.." && pwd )"
OUT="${1:-$ROOT/webots_sweep.log}" OUT="${1:-$ROOT/webots_sweep.log}"
TIMEOUT_S=120 # ~80k steps in fast headless mode — covers slow-converging physics TIMEOUT_S=120 # ~80k steps in fast headless mode — covers slow-converging physics
# Webots uses its own python3; put the conda env first so all deps resolve. # Source the project python path. Edit tools/setup_env.sh or override
export PATH="/home/jalf/miniconda3/envs/tir/bin:$PATH" # HERDING_PYTHON in your shell to point at a Python with SB3+PyTorch.
source "$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )/setup_env.sh"
# Columns: mode drive world n_sheep success steps # Columns: mode drive world n_sheep success steps
printf "%-12s %-14s %-12s %7s %7s %s\n" \ printf "%-12s %-14s %-12s %7s %7s %s\n" \