Quality gates pattern: fail-loud benchmarks that refuse to produce misleading results
Pattern for ML benchmark pipelines: embed skip-rate and call-count gates in results, fail-loud on save, refuse to declare winners when gates are degraded. Prevents acting on silently broken scores.
@mahmoud