EPS
← All tasks·#210Completed

Instruction-column dominance probe: imperative vs non-imperative variants

kind: experiment

Goal

Test whether the instruction-column dominance in #181's heatmap (51-80% marker rates across all training families) is driven by imperative surface form or is a deeper property of instruction-style prompts.

Hypothesis

Replacing the 9 instruction-family panel prompts with semantically equivalent non-imperative variants will drop marker rates by >10pp if surface form drives the effect, or maintain rates if it is a deeper property.

Setup

  • 14 existing LoRA adapters from HF Hub (no new training)
  • Same 36-prompt panel EXCEPT 9 instruction-family cells replaced with non-imperative rewrites
  • Same scorer, vLLM params, 10 completions × 20 questions

Compute

~0.5 GPU-h on 1x H100 (inference-only)

Parent: #181 Clean result target: #207 (supplementary section)

Timeline · 0 events

No events recorded.

Comments · 0

No comments yet. (Auth + comment composer land in step 5.)