EPS
← All tasks·#29Completed

[Under Review] Aim 2-3: Phase A3 Non-Contrastive Leakage

kind: experiment

From EXPERIMENT_QUEUE.md — Under Review (reviewer dispatched 2026-04-15)

Plan: .claude/plans/shimmying-hopping-locket.md (A3 extension). Code: scripts/run_a3_leakage.py, scripts/analyze_a3.py.

ALL 6 conditions complete on pod2: caps_doctor, wrong_doctor, misalign_doctor, benign_doctor, misalign_assistant, baseline.

Key finding: Non-contrastive SFT (no negative set, aggressive lr=2e-4, r=32, 3 epochs) produces globally UNIFORM trait transfer. 0/15 distance-leakage correlations survive Bonferroni. CAPS 100% everywhere, ARC-C collapses to 0.227 everywhere. The contrastive training design creates the distance gradient.

Critical caveat: Hyperparameter mismatch with A1 (which used lr=5e-5, r=16, 1 epoch). The comparison is confounded.

Draft: research_log/drafts/2026-04-15_a3_leakage.md. Figures: figures/a3_leakage/ (6 pub-quality figures). Analysis: scripts/analyze_a3.py (884 lines, Spearman + permutation + Bonferroni + sensitivity). Models: All 5 adapters on HF Hub (superkaiba1/explore-persona-space/a3_leakage/). Results: eval_results/a3_leakage/*/run_result.json.

Timeline · 0 events

No events recorded.

Comments · 0

No comments yet. (Auth + comment composer land in step 5.)