1.5 KiB
1.5 KiB
Plan: Fix Odroid8 and Script Robustness (fix_odroid8_and_script)
Phase 1: Script Robustness [x] [checkpoint: 860000b]
- Task: Update
nomad_client.pyto handle subprocess errors gracefully- Write tests for handling Nomad CLI absence/failure
- Update implementation to return descriptive error objects or
Nonewithout crashing
- Task: Update aggregator and formatter to handle Nomad errors
- Update
cluster_aggregator.pyto gracefully skip Nomad calls if they fail - Update
output_formatter.pyto display "Nomad Error" in relevant cells - Add a global "Nomad Connectivity Warning" to the summary
- Update
- Task: Conductor - User Manual Verification 'Phase 1: Script Robustness' (Protocol in workflow.md)
Phase 2: Odroid8 Recovery [ ]
- Task: Identify and verify
odroid8LiteFS data path- Run
nomad alloc statusto find the volume mount forodroid8 - Provide the user with the exact host path to the LiteFS data
- Run
- Task: Guide user through manual cleanup
- Provide steps to stop the allocation
- Provide the
rmcommand to clear the LiteFS metadata - Provide steps to restart and verify the node
- [~] Task: Conductor - User Manual Verification 'Phase 2: Odroid8 Recovery' (Protocol in workflow.md)
Phase 3: Final Verification [x]
- Task: Final verification run of the script
- Task: Verify cluster health in Consul and LiteFS API
- Task: Conductor - User Manual Verification 'Phase 3: Final Verification' (Protocol in workflow.md)