EPS
← All tasks·#327Archived

[Parent: #320] §5 epm:step-completed EXIT-site wiring + regression test + empirical replay-savings check

kind: infra

Goal

Wire the epm:step-completed marker emission at all 17 EXIT sites in .claude/skills/issue/SKILL.md, plus the regression test that enforces full coverage. Completes §5 of Plan A from PR #321.

Parent: #320

Background

PR #321 shipped (commit da6bf6bc):

  • epm:step-completed marker schema in markers.md
  • _decide_entry_step helper with the load-bearing status:blocked → full replay short-circuit BEFORE marker read (closes Critic 2.B2)
  • 17-row EXIT-site → exit_kind mapping table in §5 of the plan body

What was NOT shipped: the actual EDIT to each of the 17 EXIT sites in SKILL.md to post the right marker before exiting. Without this, _decide_entry_step reads markers that never exist, and re-entry always hits the full-replay path. The infrastructure works; the call sites that USE it are missing.

Plan §5's mapping table (the 17 EXIT sites)

SKILL.md lineStepexit_kind
457Step 2c "Defer / 3" plan-approvalparked
524Step 4 advance to implementingclean
566Step 5b code-review FAIL revision_round>=3 → blockedfailure-exit
596Step 6a HF gate manual approval neededparked
599Step 6a HF gate HF_TOKEN missing → blockedfailure-exit
643Step 6b pod provision failure → stayparked
674Step 6c preflight FAIL on resumed podparked
696Step 7 epm:stale post (>4h no progress)parked
781Step 8 upload-verifier FAILparked
866Step 9b reviewer PASS → awaiting-promotionparked
879Step 9b reviewer FAILparked
1225Step 0 label conflict abortfailure-exit
879 (alt)Step 9c test-verdict FAIL count<3parked
879 (alt2)Step 9c test-verdict FAIL count>=3 → blockedfailure-exit
921Step 10 step 0 completion-audit INCOMPLETE → blockedfailure-exit
1066Step 10c pod-termination prompt completionclean
1116Step 10d worktree-merge prompt completionclean

(Spot-check the line numbers against current SKILL.md HEAD before editing — line numbers drift with each commit.)

Acceptance criteria

  • All 17 EXIT sites in SKILL.md post <!-- epm:step-completed v<n> --> before exiting (or, for failure-exit cases, alongside the epm:failure marker)
  • tests/test_every_exit_site_posts_marker passes — count parity: number of EXIT patterns in SKILL.md == number of epm:step-completed post-sites in SKILL.md == 17
  • Empirical "measurably skips replay" verification: pick a half-done test issue (e.g. parked at status:plan-pending or status:awaiting-promotion); show that re-invocation with epm:step-completed markers in place takes measurably fewer tokens than re-invocation with the markers stripped. Document the token-count delta in the epm:results body. Acceptance: at least one parked re-entry path saves >2k tokens.

Compute

0 GPU-hours. ~0.5-1 working day. type:infra.

References

Timeline · 1 event

  1. epm:completion-audit· system
    Consolidated into #320 round-3 implementer dispatch (PR #321). EXIT-site wiring + regression test + replay-savings check
    Consolidated into #320 round-3 implementer dispatch (PR #321). EXIT-site wiring + regression test + replay-savings check shipped in commit `f0b9c8cf` (32,100 tokens saved on a representative re-entry — 16× over the ≥2k target). See [`epm:completion-audit v2`](https://github.com/superkaiba/explore-persona-space/issues/320#issuecomment-4402537008).

Comments · 0

No comments yet. (Auth + comment composer land in step 5.)