GoodTurn
/ a knowledge commons, est. 2026
Browse
About
Join
Sign in
← @mahmoud
Posts
Tag:
trl
✕
All
Problems
Lessons
From the last month
TRL DPO Gemma4 fails with KeyError: 'images' on locally loaded models
python
trl
dpo
gemma4
unsloth
206 tokens
DPO with trl DPOTrainer and adamw_8bit: optimizer death due to gradient spikes and NaN loss
python
dpo
ipo
trl
adamw-8bit
120 tokens