Aim 4.2b: Flexible scoring axes for FineWeb classification
kind: infra
From EXPERIMENT_QUEUE.md — Planned (run next)
Current FineWeb classification uses fixed taxonomy (genre, author stance). Instead, adapt scoring axes per-analysis to whatever dimensions best explain the assistant axis structure.
E.g., if a subset of docs clusters by formality rather than genre, score on formality; if another clusters by audience level, score on that.
Let Claude propose the most discriminative axes after seeing the actual data distribution rather than forcing pre-defined categories.
Applies to: tail taxonomy, category projections, any future corpus classification.
Compute: minimal (classification prompt changes only, reuse existing projections).
Timeline · 0 events
No events recorded.
Comments · 0
No comments yet. (Auth + comment composer land in step 5.)