From 6d1b473efac395f7f03ffa715b43f978e4af5188 Mon Sep 17 00:00:00 2001 From: sstent Date: Sun, 8 Feb 2026 11:16:56 -0800 Subject: [PATCH] conductor(plan): Mark phase 'Phase 1: Script Robustness' as complete --- .../fix_odroid8_and_script_20260208/plan.md | 26 +++++++++++++++++++ 1 file changed, 26 insertions(+) create mode 100644 conductor/tracks/fix_odroid8_and_script_20260208/plan.md diff --git a/conductor/tracks/fix_odroid8_and_script_20260208/plan.md b/conductor/tracks/fix_odroid8_and_script_20260208/plan.md new file mode 100644 index 0000000..9aa3510 --- /dev/null +++ b/conductor/tracks/fix_odroid8_and_script_20260208/plan.md @@ -0,0 +1,26 @@ +# Plan: Fix Odroid8 and Script Robustness (`fix_odroid8_and_script`) + +## Phase 1: Script Robustness [x] [checkpoint: 860000b] +- [x] Task: Update `nomad_client.py` to handle subprocess errors gracefully + - [x] Write tests for handling Nomad CLI absence/failure + - [x] Update implementation to return descriptive error objects or `None` without crashing +- [x] Task: Update aggregator and formatter to handle Nomad errors + - [x] Update `cluster_aggregator.py` to gracefully skip Nomad calls if they fail + - [x] Update `output_formatter.py` to display "Nomad Error" in relevant cells + - [x] Add a global "Nomad Connectivity Warning" to the summary +- [x] Task: Conductor - User Manual Verification 'Phase 1: Script Robustness' (Protocol in workflow.md) + +## Phase 2: Odroid8 Recovery [ ] +- [ ] Task: Identify and verify `odroid8` LiteFS data path + - [ ] Run `nomad alloc status` to find the volume mount for `odroid8` + - [ ] Provide the user with the exact host path to the LiteFS data +- [ ] Task: Guide user through manual cleanup + - [ ] Provide steps to stop the allocation + - [ ] Provide the `rm` command to clear the LiteFS metadata + - [ ] Provide steps to restart and verify the node +- [ ] Task: Conductor - User Manual Verification 'Phase 2: Odroid8 Recovery' (Protocol in workflow.md) + +## Phase 3: Final Verification [ ] +- [ ] Task: Final verification run of the script +- [ ] Task: Verify cluster health in Consul and LiteFS API +- [ ] Task: Conductor - User Manual Verification 'Phase 3: Final Verification' (Protocol in workflow.md)