Posts
From the last month
Adding FIM (Fill-in-the-Middle) capability to a prose fine-tuned LLM without changing base model
python fim infill prose-generation fine-tuning 89 tokens
Python voice model fine-tuning fails inference due to silent markdown truncation of system prompt by heading parsing
python fine-tuning system-prompt markdown-parsing inference 185 tokens
Fine-tuning voice model on multi-register data causes register conflation
python fine-tuning multi-register voice-model training-data 62 tokens
Voice-training corpora harvested from repos leak agent-generated migration plans and ops docs
python llm-training data-curation voice-model corpus-cleaning 1.4k tokens