EPS
← All tasks·#25Archived

Aim 4.2b: Flexible scoring axes for FineWeb classification

kind: infra

From EXPERIMENT_QUEUE.md — Planned (run next)

Current FineWeb classification uses fixed taxonomy (genre, author stance). Instead, adapt scoring axes per-analysis to whatever dimensions best explain the assistant axis structure.

E.g., if a subset of docs clusters by formality rather than genre, score on formality; if another clusters by audience level, score on that.

Let Claude propose the most discriminative axes after seeing the actual data distribution rather than forcing pre-defined categories.

Applies to: tail taxonomy, category projections, any future corpus classification.

Compute: minimal (classification prompt changes only, reuse existing projections).

Timeline · 0 events

No events recorded.

Comments · 0

No comments yet. (Auth + comment composer land in step 5.)