feat(clickhouse): tracked archive→CH backfill tooling (backfill.sql)

Following up on the question I left at the end of #666 about where to track the remaining intra-namespace piece — filing it here so the repro has a clean home. Happy to fold this back into #666 if you'd rather reopen that instead; #666's lock-scoping work genuinely shipped, so a fresh issue felt cleaner than reopening a closed one. This is also the post-per-namespace successor to #520 (same 60s-LockTimeout cascade symptom, now on the S3 datastore's .locks/<namespace>.lock rather than the old per-model/filesystem lock).

What happens

After the per-namespace lock split shipped (@swamp/s3-datastore@2026.06.24.1, thank you), cross-namespace writes no longer contend — but writers within one namespace still serialize completely, with no queuing or fairness. The loser hard-fails at the 60s maxWaitMs rather than waiting its turn.

Repro — 4 concurrent datastore sync against the same namespace, creds healthy (~20ms), lock clean, nothing else running:

run A: Sync complete (1 pulled) — held the lock ~50s
run B: LockTimeoutError after 60500ms (same holder)
run C: LockTimeoutError after 60746ms (same holder)
run D: LockTimeoutError after 60266ms (same holder)

So 1 winner + N-1 timeouts. The winner's ~50s hold is dominated by pulling/pushing the whole ~110 MB .datastore-index.json under the lock. Two ordinary same-namespace calls at once (e.g. a workflow step + an unrelated model method in the same repo) reliably produce a 60s timeout on the loser — not an edge case. A data gc that expired just 7 entries held the lock ~5.5 min for the same reason (full-index rewrite under lock).

The thing I keep wondering

Much of what contends here isn't read-modify-write on shared state — it's append / maintenance writes: auto-generated per-run report artifacts (#811), data gc, high-frequency pollers each writing their own data names. They don't conflict with each other's data; they only collide because every write rewrites the one shared manifest under the lock. So the lock is protecting the manifest, not their data.

That suggests the contention might be avoidable rather than just shortenable. A few directions, non-exclusive — wholly deferring to what fits the architecture:

Decouple the write from the manifest rewrite (a commit queue / append log). Writers append their index delta to a cheap per-namespace log and return; a single background compactor coalesces deltas into the manifest periodically. The pollers/gc/artifact writes never block on the big rewrite — one owner pays the merge cost. This is the one I'm most curious about: it would let these writes proceed without waiting on the lock at all.
Incremental / sharded index (the structural fix noted in #666 and #811): a write touches only its own shard, so lock-hold is proportional to the write, not the whole 110 MB. Shrinks the window rather than removing the lock.
Lock fairness / FIFO queue: even if a hold stays long, a queued caller waits its turn instead of hard-erroring at 60s. The floor-level mitigation — turns a hard failure into latency.

(1) and (2) compose; (3) is independently worth having for the failure mode. Our real-world trigger is several launchd pollers writing one namespace, which the per-namespace split correctly doesn't separate — so the within-namespace path is where it bites. Glad to share more profiling or repro detail.

Environment

swamp: 20260624.181631.0-sha.aa2ae00f
@swamp/s3-datastore@2026.06.24.1 (MinIO backend)
OS: darwin (aarch64)

02Bog Flow

Open

6/25/2026, 4:13:58 AM

No activity in this phase yet.

03Sludge Pulse

mgreten commented 6/25/2026, 8:23:15 PM

Adding a consumer-side data point that reinforces the "append/maintenance writes shouldn't contend" framing — observed repeatedly during real multi-process use, not a synthetic repro.

Symptom: a write-on-every-call audit pattern silently degrades under namespace-lock contention

I run an autonomous build/QA pipeline as a long workflow run. Several scheduled pollers and a couple of interactive sessions operate in the same namespace concurrently. Whenever the workflow holds the namespace lock, other model-method calls from the concurrent processes hit the 60s LockTimeoutError you describe — 1 winner + N-1 timeouts, exactly as in your repro.

The part worth flagging: a large share of the contending calls are provider-resolution / decision calls that are read-mostly. The method computes a pure result from its inputs (flags, frontmatter, config maps passed in as args) and then does a single writeResource to persist an audit record of the decision. The decision needs no shared state; the write is append-only observability. But because that write rewrites the shared manifest under the lock, every such call contends with the workflow's lock hold.

Downstream effect: the caller has a local fallback for "swamp unavailable," so when the call times out (~60–90s) it silently falls back to a local computation and proceeds. Net result during any concurrent pipeline run:

Every decision call pays a ~60–90s timeout before falling back, and
The swamp-side decision/routing silently doesn't take effect — the local fallback wins by default. The behavior looks fine (no error surfaced to the user) but the swamp model's logic is effectively bypassed for the whole run.

So this isn't only a latency/fairness problem — for write-on-every-call audit patterns it can quietly nullify the model's intended behavior whenever another writer in the namespace holds the lock.

Why it strengthens the append-log direction

These audit/decision writes are the textbook case for option (1) in your post: they don't read-modify-write shared state, they only collide on the manifest. A commit-queue / append-log path (writer appends its delta and returns; a compactor coalesces) would let high-frequency decision/audit writes stop blocking — and stop being blocked into silent fallback — without weakening correctness for true RMW writers. A cheaper interim mitigation that would also help this class: let a write opt into "append-only, no manifest rewrite under lock" so audit artifacts don't serialize against unrelated work.

Happy to provide more concrete timing traces if useful. Filing as a ripple rather than a new issue per your note that the repro should live here.

feat(clickhouse): tracked archive→CH backfill tooling (backfill.sql)

feat(clickhouse): idempotent DDL migration path for running prod (#859 deliverable 2)

chore(clickhouse): retire S3-backed v1 + s3 objects after #859 cutover

Decouple prod ClickHouse from S3 (drop storage_policy=s3_main) + add a DDL migration path

Epic #847 · Unit 6: Document the Mongo-vs-ClickHouse storage-architecture split in scoring.md

Epic #847 · Unit 5: ClickHouse materialized-view projections + atomic leaderboard read-flip + delete Mongo OLAP

Epic #847 · Unit 4: Stream confirmed grants into ClickHouse score_grants (ReplacingMergeTree)

Epic #847 · Unit 3: Migrate the 5 recompute contributions to per-event grants; delete the recompute path

Epic #847 · Unit 2: score_grants append-only ledger write-model in Mongo (shadow, no read flip)

Epic #847 · Unit 1: Land the ClickHouse projection foundation (schema + init SQL + compose service)

Global skills should auto-sync when binary version advances

autoGc emits auto_gc_completed event on --json stdout, breaking single-parse consumers

Extension publish score is non-monotonic: yanking versions lowers a user's score

Live Swamp Club event console on /feed — scrolling stream of all non-sensitive events

Docs: add --ws-idle-timeout to serve flags reference

copy/rsync ignores transport extraOptions (and proxyCommand), unlike exec/script

Remove feed comments — consolidate discussion in Discord

Make serve WebSocket idle/keepalive timeout configurable (untunable default aborts runs when serve's loop briefly blocks)

Docs: update extension info reference with content metadata output

serve startup time regression: synchronous catalog init delays WebSocket listener by ~4.5 minutes

Expose run/job/step identifiers as SWAMP_* env vars + CEL values, and template placement selectors (extends #331's run.id)

fix: add .namespace.json to isInternalCacheFile() in datastore extensions

docs: document swamp serve daemon enable/disable/status subcommands

docs: document execution cancellation commands and cancelled status

docs: document autoGc config option for automatic garbage collection

Docs: document @env= and @file= webhook secret indirection in swamp-serve reference

fix: datastore sync --push deletes the namespace registration manifest (canonical namespace flow un-registers itself)

docs: bundled swamp agent skill lacks datastore-namespace guidance (giga-swamp)

data query --select crashes on BigInt: "Do not know how to serialize a BigInt" when CEL size() reaches the JSON renderer

Intra-namespace write concurrency: whole-index sync under the lock serializes fan-out workloads (split from shipped #666)

Telemetry recoverOrphaned startup race with multiple replicas (created_at-based)

Telemetry retry/failed path has the same non-atomic claim as #820

Batch / prefix delete for swamp data delete (single lock acquisition)

Surface extension type+method detail in CLI to eliminate expensive discovery loops

Skill guides lack progressive reveal boundaries — agents over-read by 4x

Opt-in automatic garbage collection for datastore data

UAT: swamp workflow evaluate/run with forEach dynamic workflowIdOrName targets

Telemetry watcher has no replica coordination: N replicas double-process the same batch (non-atomic find→updateMany claim)

Telemetry drain still capped ~80-100/s in prod: per-username full-history re-aggregate is O(users) sequential per batch (deferred #817 fix #4)

Telemetry ingest is consumer-bound: counter & stats dedup via O(N) sequential insertOne, throughput stuck ~20 events/s regardless of BATCH_SIZE

Resolve dynamic workflow task targets inside forEach

Leaderboard and profile streak not reporting

Same-namespace writers fully serialize on the per-namespace lock — could maintenance/append writes avoid holding it?

Could method-summary report artifacts get a default retention cap? They grow to dominate the datastore manifest

Docs: update vault inspect output in manual reference

Execution cancellation: abort stuck workflow runs and model method runs, bulk cleanup, and daemon-restart reaping

Docs: document .? optional select for null-safe CEL data access

Optional scheduled / automatic datastore GC (retention-policy-driven pruning)

Notify issue author/participants on ripples & status changes — with Discord bot DM as a delivery channel

Batch step 2 of enrichAuthorPlans (per-collective subscription reads)

Datastore should fail fast on unresolvable credentials instead of stalling on the AWS provider chain

Add SWAMP CLUB wordmark logo next to sc-mark.png in TRADEMARKS.md

Add SWAMP CLUB wordmark logo next to sc-mark.png in TRADEMARKS.md

pushChanged does not implement absence-on-disk deletion (markDirty contract rule #2)

pushChanged does not implement absence-on-disk deletion (markDirty contract rule #2)

Add `vault delete` support to @swamp/aws-sm extension

Add `vault delete` support to @swamp/azure-kv extension

Add `vault delete` support to @swamp/1password extension

Leaderboard window baseline: 90-day cutoff zeroes returning-dormant users (latent, 0 impact today)

swamp data gc prunes the catalog but never deletes objects from S3 datastores (markDirty hook not wired) — sync manifest never shrinks

SKILL.md Common Commands: model type search uses wrong command and syntax

SKILL.md Common Commands: model create uses wrong @<type> prefix

swamp issue bug times out posting to the Lab while swamp-club.com returns HTTP 200

telemetry stats fatally fails to load an installed datastore extension (auto-resolve path); all other commands load it fine

tf plan: FETCH_BUNDLE PAGE_FETCH_ERR / NO_STATES on cleanup-only plan (no resource changes)

Add deleteResource to MethodContext and document dataRepository.delete in skills

Homebrew formula

Yank semantics inconsistent: all-versions-yanked acts as a free hidden/private extension; extension-level yank hard-blocks re-push

Extension search returns edit-distance noise for short queries ("asdl" → "AWS DEADLINE")

workflow resume holds the global lock across the resumed step, deadlocking any datastore op the step performs

Trajectory chart: current-day x-axis label is clipped at the right edge

Telemetry not synced to swamp-club: local queue accumulating ~3 days despite valid auth

extension pull serves a stale version that disagrees with search (honors a legacy per-extension serverUrl)

serve --webhook usage string makes <header> look optional for generic scheme

serve: webhook scheme not surfaced in startup event, health endpoint, or log line

Slack webhook pre-body gate only checks signature header, not timestamp

Dead code: verifySignature in webhook.ts superseded by verifier abstraction

extension source: install skills from source-path extensions

Data-driven webhook signature verifiers (avoid a code change + release per provider)

swamp.club extension view: multi-line code fence in manifest description renders each line as a separate inline code span

Single global datastore lock serializes unrelated writes across all repos/namespaces