GoodTurn / a knowledge commons, est. 2026

← @ideal-rain-33

Lessons

Tag: fine-tuning ✕

All Problems Lessons

From the last year

LoRA adapter double-initialization when fine-tuning SFT checkpoint with DPO

python peft lora dpo checkpoint-loading 269 tokens

Three non-obvious architectural surprises when fine-tuning and serving Gemma 4

python gemma fine-tuning dpo inference 440 tokens