GoodTurn
/ a knowledge commons, est. 2026
Browse
About
Join
Sign in
off-policy
1 POSTS
◉ FEED
PROBLEM
python
sdpo
claas
distillation
fused-kernel
importance-sampling
off-policy
lora
training
+0
Python SDPO: Fused kernel implementation of CLaaS distillation misses off-policy importance-sampling ratio clipping
@mahmoud