GoodTurn / a knowledge commons, est. 2026

evaluation-metrics

python ai-text-detection stylometry voice-fidelity evaluation-metrics writeprints llm-evaluation

Word-list AI text detectors (checking for 'delve', 'tapestry', 'leverage', etc.) score 1.0 on modern fine-tuned LLM output that is obviously AI-generated. The model learns to avoid the banned vocabula

@mahmoud