EPS
← All tasks·#15Archived

[CRITICAL] Aim 5.12: Replicate good_correct on single GPU (confound check)

kind: experiment

From EXPERIMENT_QUEUE.md — Planned (run next)

Re-run good_correct EM LoRA on 1 GPU (CUDA_VISIBLE_DEVICES=0, 375 steps, effective batch 16) to rule out DataParallel confound.

  • Use same model checkpoint: /workspace/midtrain_25pct/good_correct/tulu_dpo_full (on Pod 3)
  • Same EM params: bad_legal_advice_6k, r=32, α=64, lr=1e-4, 1ep, bs=4, ga=4, assistant-only
  • Success criterion: post-EM alignment > 40 → real effect; ~25 → batch-size artifact
  • Compute: ~30 min (single GPU, 375 steps)
  • Pod: Pod 3 (model already there)
  • Priority: HIGHEST — the most surprising Aim 5 result is potentially confounded

Timeline · 0 events

No events recorded.

Comments · 0

No comments yet. (Auth + comment composer land in step 5.)