Resource-leak test failures on main: extension_rubric_scorer_test and worker_gateway_test

context.readModelData(modelName, specName) produces different results depending on whether the method is invoked manually (swamp model method run) or within a workflow. This makes manual runs unreliable for debugging workflow behavior.

Manual run: readModelData returns ALL historical data for the source model (no workflowRunId available → no scoping)
Workflow run: readModelData is scoped to data produced by the current workflow run (via workflowRunId tag filtering in raw_execution_driver.ts lines 138-140)

This means a method that works correctly in a workflow can produce wildly different (and incorrect) results when run manually for debugging.

Concrete Example

anime-source has 27 configured shows. search_configured produces 182 episodes per run.

# Manual run — returns 921 items (all historical data, including removed shows)
swamp model method run dedup filter --input sourceModel=anime-source
→ Read 921 episodes from "anime-source"
→ 304 "new" episodes (many are false positives from orphaned data)

# Workflow run — returns 182 items (current run only)
swamp workflow run discover-and-download
→ Read 182 episodes from "anime-source"
→ correct dedup results

The 921 items include data from shows that were removed from the config months ago (e.g., "Dark Gathering" removed from globalArgs, but its data persists with lifetime: infinite). This orphaned data is invisible in workflow runs but pollutes manual runs.

Why This Matters

You can't debug workflows with manual runs. The primary way to test a model method is swamp model method run. If it returns different data than the workflow, you're debugging a different system.
False confidence in fixes. A dedup fix that looks correct in manual testing may behave completely differently in the workflow (or vice versa). We spent significant time chasing dedup bugs that only manifested in one invocation context.
No way to opt into scoping manually. There's no --scope-to-latest-run flag or equivalent. Manual runs always get the unscoped path.

Current Implementation

In raw_execution_driver.ts:

const workflowRunId = this.context.tagOverrides?.["workflowRunId"];
const readModelData = (modelName: string, specName?: string) =>
  dataAccessService.readModelData(modelName, specName, workflowRunId);

When workflowRunId is undefined (manual run), readModelData returns everything. When set (workflow run), it filters by workflowRunId tag.

Proposed Solution

readModelData should behave consistently regardless of invocation context. Options:

Default to latest execution's output — when no workflowRunId is available, scope to the source model's most recent method output instead of returning all historical data
Add a CLI flag — swamp model method run ... --scope-to-latest to simulate workflow scoping during manual runs
Always scope by default — return only the latest version of each unique data name, with an explicit opt-in for historical data

Any of these would make manual runs trustworthy for debugging.

Environment

swamp version: 20260206.200442.0
Extension: @keeb/mms/dedup calling readModelData("anime-source", "episode")

#1020 — closed as not-a-bug (findBySpec run-scoped, but same inconsistency exists)
#966 — forEach data.findBySpec resolves empty when data written by prior job
#914 — context.readModelData feature request

Automoved by swampadmin from GitHub issue #1113

02Bog Flow

Closed

4/17/2026, 8:48:46 PM

No activity in this phase yet.

03Sludge Pulse

stack72 assigned stack724/17/2026, 8:44:20 PM

stack72 commented 4/17/2026, 8:48:45 PM

Closing as already-fixed by #1145 (commit d9562498, merged 2026-04-08).

That PR removed all hidden workflowRunId scoping from readModelData, findBySpec, and queryData. Current behavior: manual runs and workflow runs both return all data — the inconsistency described here no longer exists.

Relevant code:

src/domain/drivers/raw_execution_driver.ts:139-140 — readModelData is called without workflowRunId.
src/domain/data/data_access_service.ts:105-117 — signature is readModelData(modelName, specName?); no scoping.
src/domain/data/data_access_service_test.ts:399, :480 — tests assert all data is returned regardless of workflowRunId.

Note: the underlying concern about orphaned data (e.g. removed shows persisting with lifetime: infinite) still exists, but now affects both contexts equally rather than causing a manual-vs-workflow divergence. If a --scope-to-latest-run flag or similar debugging affordance is still wanted, please file a fresh feature request — the proposed solutions in this issue conflict with #1145's explicit "remove hidden scoping" design direction.

Resource-leak test failures on main: extension_rubric_scorer_test and worker_gateway_test

Landing page clips the curl install command instead of rendering the full text

model delete --json output shape doesn't match the documented {deleted, modelId, modelName, artifactsDeleted}

model search --json returns a bare model object instead of {query, results} when exactly one model matches

Extension author gitignore guidance: add .swamp.yaml and CLAUDE.md to recommended excludes

reports.require in workflow YAML does not auto-execute pulled extension reports

Yanked extension version still shown as active on swamp-club.com and in 'extension search'

Add deprecate/yank/unyank actions for own extensions on the web interface

Expose per-run memory/CPU metrics for method & workflow executions

model method run OOMs at 4GB V8 heap on long high-fan-out methods; non-configurable heap + crash leaves run stuck in "running"

Tier-up announcements never fire for direct score contributions (badge awards, feed credits)

CI never runs the discord-bot service test suite

Discord role sync never assigns lower leaderboard tiers (Swamp Baby / Muck Runt / Sludge Whelp)

Docs: document globalArgument input reference validation in workflow validate

Remote execution: UAT coverage for tokens, enrollment, and dispatch

Remote execution: comprehensive reference documentation

doctor extensions: pulled-extension source files reported as orphans that --repair can't evict (nested @swamp/aws/* sibling mis-attribution)

Channel-based publishing for local extension backports

Feature: make extension yank channel-scoped (--channel) so it doesn't nuke every channel

swamp update --setup-auto does not work with bluefin44 (crontab not found, should

Feature: demote / withdraw an extension version from the stable channel

Docs: extension-publish skill guide doesn't cover release channels

workflow validate: resolve model globalArguments expressions against the calling workflow's declared inputs

Add a way to list registered report definitions (report search only lists results)

Inconsistent resource-field accessor: data get returns content, data query and CEL use attributes

Extension API: allow export const extension to add resource specs, or document that it cannot

@swamp/aws/cloudformation: expose StackSet instances, drift, and operations (Cloud Control cannot)

workflow validate: method args with a Zod .default() are treated as required

Profile months overlap in trajectory

model type describe --json: bloated output (40% duplicated specs, no compact mode) and lost method-to-output mapping drive agents to read extension source

UAT: Release channel CLI and adversarial tests

Docs: Add release channel documentation to the manual

Test skill evals with Fable in multi-skill eval tests

Guide users toward filing feature requests when @swamp extensions lack a needed capability

Guide users toward filing feature requests when @swamp extensions are missing features

correcting capitlization of Swamp, Swamp Club, and The Swamp on the swamp-club.com website

API: Add release channel support for extension versions (beta, rc, stable)

Add @swamp/hetzner-cloud/server-types model for availability and project limits

Resident/warm worker mode — `model method run` has ~6s fixed per-invocation overhead that rules out latency-sensitive use

Issue submission returns wrong URL path

Dynamic resource attribute refresh for model instances

`vault put` inside workflow steps still acquires global `.datastore.lock` after #382

extension quality scorer mis-detects quoted phrases in comments as bare imports

extension push: credentials-sensitive-field false positive when .meta({ sensitive: true }) is on a continuation line

Local source-loading should discover a report co-located with its model in a paths.base:manifest extension

extension push: optionally sync the published bundle to the manifest `repository:` (git mirror)

Add 'creek' extension kind for cross-querying external systems alongside swamp data

Docs: vault refresh hooks (--refresh-from, --refresh-ttl, --clear-refresh)

Support CI-friendly adversarial review artifacts for extension push

Deprecate "No slow types" (fast-check) rubric factor on server-side scorer

Registry scorer fails on bare specifiers — mirror CLI fix from #505

Official extension for GitHub repository configuration (environments, variables, secrets)

Feature request: @swamp/tailscale extension

Option to change email in your Swamp Club profile or delete account

Docs: vault reference in manual promotes inline KEY=VALUE as primary example

extension push error for disallowed file types doesn't mention binaries field

swamp help extension omits yank and unyank (machine-readable CLI schema misses real subcommands)

Extension model method execute lacks typed args/context — every author has to use ': any' to unblock tests

@webframp/hashicorp-vault: empty KV engine causes 'data.data.keys is not iterable' on vault put

workflow resume fails to register all extension model types (local and pulled) — "Unknown model type"

model method run: cannot pass arrays, numbers, or booleans via --input

extension push --dry-run --json reports local helper imports as bogus model entries

Extension source: direct-content export scan only reads first 64 KiB, silently drops exports beyond it

Homepage install command: wrong domain and missing https:// protocol

Homepage install command: wrong domain and missing https:// protocol

first-class refresh hook for short-lived vault values (gcloud / aws-sso / kubectl-oidc token UX)

swamp issue ripple logged "Posted ripple on issue #514" but the ripple is not visible on the Lab

vault guide promotes inline secret values (KEY=VALUE) as a primary "swamp vault put" example

swamp-getting-started SKILL.md still references removed swamp-model / swamp-workflow / swamp-vault / swamp-extension / swamp-repo skills

extension rm blocked by stale /workspace bundle_types rows from prior container sessions with host repo bind-mounted

extension search returns null repository fields that extension info populates

Quality rubric: scope "No slow types" for model extensions (consumed via model+CEL, not type imports)

data get/list --workflow and workflow history get/logs cannot resolve extension-delivered workflows (Workflow not found)

Docs: add doctor secrets and doctor vaults to the troubleshooting how-to guide

UAT: sensitive resource output without vault produces clear pre-flight error

Docs: add doctor vaults subcommand and pre-flight vault validation

gcs-datastore: registerNamespace does not detect conflicts with existing registrations

Validate vault availability when model has sensitive output fields

Managed skills ship with dangling reference-doc links

swamp-club: update skill references and CLAUDE.md after superskill consolidation