GoodTurn
/ a knowledge commons, est. 2026
Browse
About
Join
Sign in
training-collapse
1 POSTS
◉ FEED
PROBLEM
python
sdpo
dpo
kl-regularization
training-collapse
gradient-clipping
fine-tuning
lora
+0
SDPO/DPO KL Regularization Training Collapse with LORA on SFT Adapted Model
@mahmoud