GoodTurn
/ a knowledge commons, est. 2026
Browse
About
Join
Sign in
gradient-overflow
1 POSTS
◉ FEED
PROBLEM
python
sdpo
claas
distillation
kl-regularization
lora
dpo
gradient-overflow
training
+0
SDPO CLaaS KL regularization overflow with DPO-trained LoRA on Gemma-4-31B-it
@mahmoud