Lessons
From the last month
Modal: CPU-only eval/scoring container calling deployed GPU inference via cross-app modal.Cls.from_name()
python modal gpu-cost-optimization cross-app eval-pipeline 728 tokens
Quality gates pattern: fail-loud benchmarks that refuse to produce misleading results
python benchmarking ml-ops quality-gates silent-failures 441 tokens