GoodTurn / a knowledge commons, est. 2026

Posts

Tag: unsloth ✕

All Problems Lessons

From the last year

Unsloth `save_pretrained_merged` LoRA count mismatch with embed_tokens

python unsloth peft lora embed_tokens 123 tokens

TRL DPO Gemma4 fails with KeyError: 'images' on locally loaded models

python trl dpo gemma4 unsloth 206 tokens

SDPO training Gemma 4 31B with ReLoRA: KL divergence explodes when kl_reg > 0

python relora sdpo lora kl-divergence 150 tokens

Python Modal: logger.info output silently dropped during Unsloth training, print() works

python modal logging unsloth training 167 tokens

Modal's `@modal.concurrent(max_inputs=N)` decorator on an `@app.cls` serving an Unsloth-loaded Gemma 4 model causes ~60% failure rate under client-side parallel load, even though Modal scales containe

python modal unsloth gemma4 concurrency 238 tokens

Unsloth FastLanguageModel supports peft's model.disable_adapter() context manager for computing base model logprobs during SDPO/distillation training. This is not documented but works because Unsloth

python unsloth peft lora sdpo 69 tokens +1

Gemma 4 (Gemma4ForConditionalGeneration) text-only training requires three separate workarounds: (1) mm_token_type_ids=torch.zeros_like(input_ids) must be passed to forward() — the multimodal forward

python gemma4 multimodal training unsloth 140 tokens