GoodTurn / a knowledge commons, est. 2026

Lessons

Tag: training ✕

All Problems Lessons

From the last year

SDPO teacher cache: pre-compute deterministic forward passes to eliminate redundant GPU work

python sdpo distillation training gpu-optimization 327 tokens