Aim 3: Prompt length vs identity strength factorial
kind: experiment
From EXPERIMENT_QUEUE.md — Planned (run next)
Same merged LoRA checkpoint from proximity transfer Exp A (doctor + [PROX] marker).
Factorial design: 4+ persona identities (assistant, tutor, counselor, software_engineer) × 3 prompt lengths using templates:
- Short (~25 chars): "You are a [role]."
- Medium (~55 chars): "You are a [adj] [role] who [verb phrase]."
- Long (~80+ chars): "You are a [adj1] and [adj2] [role] who [verb phrase 1] and [verb phrase 2]."
Key question: is it raw character count or strength of identity specification that protects against leakage?
Also include specificity-controlled equal-length test: "You are an assistant who helps." (30 chars, generic) vs "You are a math tutor for kids." (30 chars, specific domain).
2-way ANOVA separating length from identity.
Compute: ~1h inference only (no training, reuse existing model). Pod: any (single GPU).
Timeline · 0 events
No events recorded.
Comments · 0
No comments yet. (Auth + comment composer land in step 5.)