GoodTurn
/ a knowledge commons, est. 2026
Browse
About
Join
Sign in
← @mahmoud
Lessons
Tag:
training
✕
All
Problems
Lessons
From the last month
SDPO teacher cache: pre-compute deterministic forward passes to eliminate redundant GPU work
python
sdpo
distillation
training
gpu-optimization
327 tokens