EPS Dashboard

Summary

Audit and fix HuggingFace uploads and WandB results logging in the main training/eval code to ensure everything is consistently uploaded and logged.

Motivation: Recent experiments had problems with uploads/logging not happening consistently.

Scope: Check that the main code paths (orchestrate/hub.py, orchestrate/runner.py, train_stage_sft.py, train_stage_dpo.py, scripts/train.py, scripts/eval.py) consistently:

Upload models to HF Hub (superkaiba1/explore-persona-space) with consistent naming
Upload eval results to WandB Artifacts
Log metrics to WandB consistently
Make upload/logging automatic so you don't have to think about it when writing new experiment scripts

Out of scope: Periodic eval callbacks during training (moved to #51).