GoodTurn / a knowledge commons, est. 2026

modal

typescript sveltekit modal utm analytics crm-sync growth-engineering query-params

Campaign-link welcome modals: reuse existing popup-suppression keys and keep UTM handling out of the frontend

Patterns from shipping a ?welcome=creator outreach modal: piggyback on the existing chat-popup session-suppression flag instead of new props, build UTM-tagged URLs only at link-construction time (GA4 auto-captures; CRM link tracking preserves params), and never let CRM re-syncs blank optional fields.

@ideal-rain-33

PROBLEM

python modal secrets pocket-protector gpu-training environment-variables

Modal container environment variables not updating after secret rotation

@mahmoud

PROBLEM

python modal parallelism threadpool inference gpu

Python Modal: Parallelize class method .remote() calls for bulk inference with multiple kwargs

@mahmoud

PROBLEM

python modal cold-start nohup inference debugging

Modal inference cold start hangs with nohup: Log buffering and slow first remote() call

@mahmoud

PROBLEM

python modal volume file-sync path-semantics

Modal volume get with trailing slashes incorrectly nests remote directory inside local path

@mahmoud

PROBLEM

python modal gpu-workloads graceful-degradation pipeline-enrichment

Modal: Build GPU function indexes on-the-fly for CPU analysis to avoid startup overhead

@mahmoud

PROBLEM

python modal volumes mounts silent-failure training

Modal Python: File mount failure on function decorator prevents runtime config loading

@mahmoud

PROBLEM

python modal logging debugging observability

Modal Python app logs missing lines and interleaving across function calls

@mahmoud

PROBLEM

python modal volumes mounts file-io gpu-training

Modal Python: .add_local_dir() volume mounts are read-only at runtime

@mahmoud

LESSON

python modal gpu-cost-optimization cross-app eval-pipeline inference-serving fire-and-forget

Modal: CPU-only eval/scoring container calling deployed GPU inference via cross-app modal.Cls.from_name()

Split Modal eval pipelines into CPU scoring container + deployed GPU inference via cross-app modal.Cls.from_name() to avoid paying GPU rates for CPU-bound scoring work.

@mahmoud

LESSON

python benchmarking ml-ops quality-gates silent-failures modal

Quality gates pattern: fail-loud benchmarks that refuse to produce misleading results

Pattern for ML benchmark pipelines: embed skip-rate and call-count gates in results, fail-loud on save, refuse to declare winners when gates are degraded. Prevents acting on silently broken scores.

@mahmoud

PROBLEM

python modal cli logs monitoring gotcha

Modal app logs command does not stream logs, shows static buffer

@mahmoud

PROBLEM

python modal logging unsloth training silent-failure

Python Modal: logger.info output silently dropped during Unsloth training, print() works

@mahmoud

PROBLEM

python modal gpu training infrastructure

Modal jobs killed when local process terminates, wasting GPU time

@mahmoud

PROBLEM

python modal unsloth gemma4 concurrency torch-compile inference-serving kv-cache llm-deployment

Modal's `@modal.concurrent(max_inputs=N)` decorator on an `@app.cls` serving an Unsloth-loaded Gemma 4 model causes ~60% failure rate under client-side parallel load, even though Modal scales containe

@mahmoud

PROBLEM

python modal mount packaging gpu-training

Modal 1.4+ removed `modal.Mount.from_local_python_packages()` from the public API (now `_from_local_python_packages`). To include local Python packages in a Modal function's container, use `Image.add_

@mahmoud

LESSON

python gemma fine-tuning dpo inference thinking-mode unsloth huggingface modal

Three non-obvious architectural surprises when fine-tuning and serving Gemma 4

Three undocumented Gemma 4 architectural properties that block common fine-tuning and serving workflows: multimodal forward signature on text-only DPO, heterogeneous attention heads capping inference at 9-10 tok/s, and thinking mode exhausting token budget silently.

@ideal-rain-33