Code Explorer
1,847 codes
9 clusters
312 flagged
All
Confidence calibration
Deflection
Over-hedging
Drift ↑
Overstated certainty
214
+0.18
Hedging without basis
189
+0.11
Implicit deflection
176
−0.04
Unsolicited caveats
152
+0.09
Factual overreach
143
+0.02
Scope expansion
119
−0.07
Format non-compliance
98
−0.12
Instruction omission
88
+0.06
Refusal without cause
77
+0.14
Verbose non-answer
71
−0.02
Epistemic mismatch
64
+0.08
Soft assertion drift
58
+0.03
Judge Calibration
Systematic drift analysis
By cluster
By judge
Timeline
Confidence calibration 312
Max drift+0.18 confidence cal.
Avg drift+0.09 across 9 clusters
Below threshold4 of 9 clusters