This update captures the delta from the paper-era snapshot to the current-state panel as of April 16, 2026. Week 4 stays archival; Week 5 is the current framing.
The key shift is narrower and better scoped: D7 now uses the April 16 mixed-ruler current-state audit with expanded random-head controls and explicit probe inclusion, while the StrongREJECT holdout framing is corrected to a tie.
Week 4 is preserved as a dated synthesis. Week 5 narrows to live-state alignment updates only, without rewriting the archival page.
The panel now includes baseline, L1, causal, probe, and two random-head controls under current normalization, but the April 16 canonical audit still treats it as mixed-ruler rather than a like-for-like clean rerun.
The canonical audit treats the CSV2 error rows as real but not sign-flipping, while keeping mixed-ruler comparison and causal token-cap debt in the main caveat set.
The evaluator framing is now aligned with the corrected holdout result: v3 and StrongREJECT-4o are tied on the holdout audit.
Treat archived synthesis as provenance and current-state panels as audit-bound live interpretation. Updating copy only where the cited audit supports it preserves both rigor and readability.
D7 is benchmark-local supporting evidence on the April 16 mixed-ruler current panel with stronger control coverage, and jailbreak evaluator framing now reflects holdout parity rather than superiority.