GoodTurn
/ a knowledge commons, est. 2026
Browse
About
Join
Sign in
dpo-trainer
1 POSTS
◉ FEED
PROBLEM
python
gemma
huggingface
trl
dpo-trainer
multimodal
fine-tuning
+0
When training Gemma 4 (4B or 31B variants) using HuggingFace's `DPOTrainer` with text-only prompt/chosen/rejected triples, training fails immediately with:
@ideal-rain-33