Add swamp data delete command to remove a data artifact by model and name

The docker driver currently runs each step in an isolated container. This is great for provisioning-style workflows where steps are independent, but it makes CI-style workflows painful — workflows that have shared filesystem state (checkouts, installed dependencies, build artifacts) across multiple steps.

Concrete example: we're building a multi-model-eval workflow that mirrors a GitHub Actions workflow for evaluating skill triggers across multiple LLMs. The workflow structure is:

checkout — clone the swamp repo
setup-npm — run npm install in evals/promptfoo/
run-evals — run 4 parallel evals (forEach over models) against the checkout
cleanup — remove the checkout

All four eval steps in step 3 need to read the same checkout and share the same node_modules tree. In raw mode this "just works" because everything runs on the host filesystem. In docker mode, each step is a fresh container with no shared state, so we have to:

Explicitly configure a volume mount in driverConfig.volumes
Use identical host and container paths (e.g., /tmp/swamp-eval-workspace:/tmp/swamp-eval-workspace) so the same path string is valid in both raw and docker modes
Store the clone path as a data artifact so downstream steps can look it up via data.latest('swamp-repo', 'repository').attributes.path
Add a dedicated setup-npm job that runs once before the parallel evals to populate the shared volume with node_modules (otherwise 4 parallel npm install runs race against each other)
Add a cleanup job to remove the checkout when done
Worry about host/container path parity, volume lifecycle, and npm cache persistence

Every CI-shaped workflow that uses the docker driver will have to reinvent this pattern. It's a lot of ceremony for something that should be "give me a shared working directory."

What we'd like

A first-class "workspace" concept for workflows. One of the following, in order of preference:

Option A: Workflow-level workspace primitive

Each workflow run gets an automatically-provisioned working directory that's mounted into every step's container at a stable path. Lifecycle is tied to the workflow run — created on start, cleaned up on end (unless --preserve-workspace is passed for debugging). Referenced in CEL as workspace.path.

workspace:
  enabled: true
  persistent: false   # optional: survive between runs for caching

jobs:
  - name: checkout
    steps:
      - name: clone
        task:
          type: model_method
          modelIdOrName: swamp-repo
          methodName: clone
          inputs:
            # workDir defaults to workspace.path

This eliminates:

Manual driverConfig.volumes config
Host/container path parity hacks
Manual cleanup jobs
Storing workdir paths as data artifacts purely for path propagation

Option B: Session-mode docker driver

Instead of one container per step, one container per workflow run. Steps execute as sub-operations inside the same long-lived container. State naturally persists without volume mounts. Parallel steps run as concurrent operations within the container.

driver: docker
driverConfig:
  mode: session    # vs. "per-step" (current default)
  image: ghcr.io/systeminit/swamp-eval-runner:latest

This matches how GitHub Actions / GitLab CI actually work (one runner hosts the entire job) and matches what users intuitively expect from "CI in a container." Per-step mode stays available for the current use cases.

Option C: Step output files as implicit inputs

A lighter-weight version: a step can declare "this file or directory is my output," and swamp makes it available at the same path in downstream steps' containers. Similar to GHA's upload-artifact/download-artifact but automatic based on step dependencies.

steps:
  - name: install-deps
    task: ...
    outputs:
      - path: node_modules
        makeAvailableTo: [run-evals]

Why this matters

CI is a first-class use case. swamp is pitching itself as a general automation framework. Multi-step CI workflows with shared state are one of the most common automation patterns.
The current workarounds leak driver-specific concerns into workflow YAML. Users have to know that docker steps are isolated containers and plan around it. A workspace primitive abstracts this away.
The workarounds don't compose. If another workflow wants a similar pattern, it has to re-solve volume mounts, path parity, setup steps, and cleanup from scratch. That's a sign we're missing an abstraction.
It unblocks parallelism. Right now we can run 4 parallel eval steps, but only after carefully engineering around shared state. A workspace primitive or session mode makes parallelism the default, not a puzzle.

Concrete reference

The full multi-model-eval workflow and extension code are in this repo at:

workflows/workflow-8a88a569-4620-431c-9028-643df0118c72.yaml
extensions/models/ci_git.ts
extensions/models/ci_promptfoo_eval.ts
extensions/reports/ci_eval_analysis.ts
extensions/reports/ci_eval_result.ts

It's a complete, working example of the workarounds described above, in case it's useful to look at when designing the primitive.

02Bog Flow

Triaged

4/11/2026, 10:09:26 PM

Click a lifecycle step above to view its details.

03Sludge Pulse

stack72 assigned stack724/11/2026, 10:07:19 PM

bixu commented 4/21/2026, 1:05:00 PM

I'm concerned about the security implication of B and especially A. C is a bit more work but feels safer (explicit opt-in).

Add swamp data delete command to remove a data artifact by model and name

add `swamp doctor extensions` subcommand for on-demand extension load diagnostics

AI agent skipped adversarial review step before smoke testing extension code

Better Auth rejects requests on hostnames not in hardcoded trustedOrigins (signup/signin broken on non-canonical hosts)

Local extension models in extensions/models are not discovered in fresh repo tutorial flow

ticket 1138 not clear - can we still use github?

Drag Windows into the Swamp

Add `swamp extension test` command to run built-in extension tests with coverage

swamp repo init --tool kiro does not create .kiro/settings/cli.json

Support multi-agent repo initialization (multiple --tool targets)

CLI dumps 300-line Cliffy Command object on unknown flags / subcommands

Docs: repository-configuration.md missing defaultDriver / defaultDriverConfig

Server-side parse query params on /lab/all so refresh preserves multi-filter URLs

fast-path sidecar TOCTOU: post-op HEAD can record generation from a concurrent writer's push, masking their data on next sync

Scorer should honour files/deno.json imports so bare specifiers resolve

@swamp/gcs-datastore: same minutes-slow zero-diff sync cliff as lab/164; mirror fingerprint fast path

cleanup for repoDriver

swamp datastore sync is minutes-slow at 4k-file scale even with zero-diff; outer 300s timeout fires

swamp-extension-publish skill: add quality rubric check before push

New @swamp extension: extension quality rubric checker for CI

swamp-extension-quality and swamp-extension-publish skills don't guide zod import map resolution for scorer

swamp repo upgrade deletes extension model source files

Add repo-level `defaultDriver` to `.swamp.yaml`

User report extensions registered lazily are silently skipped during method execution

swamp extension install: datastore push hangs ~8.5m then crashes with Deno TLS panic (tls_wrap.rs:1918 unwrap on None)

Add a preflight diagnostic for AI-tool audit integrations (so upstream CLI changes stop breaking us silently)

audit record --from-hook silently drops input from kiro-cli postToolUse hooks

codegen pipelines don't detect _lib/*.ts changes so manifest CalVer never bumps

Link to namespace extensions listing from profile pages

Trailing slash on /extensions/@<namespace>/ returns 404

No way to browse all extensions belonging to a collective by URL

DatastoreProvider.resolveCachePath declared optional but silently required at runtime

Accept-invite link returns HTML instead of JSON, breaking collective join

additionalFiles flatten to basenames on push and lack a runtime access API, creating a source-vs-pulled layout mismatch

Extension search returns inflated results for quoted phrase queries

Docs: document jsr:/https: imports and non-local pinning convention in user-facing manual

First-class jsr: specifier support in extension bundler

Cross-extension code sharing via manifest exports field

Extension models: document/resolve implicit-any in execute parameters when imported by test files

extension yank: allow unyank and version-specific yanks

extension source add does not discover brand new types — only overrides already-pulled types

Add light mode to swamp.club and swamp open UI

Add 'swamp open' command to open the swamp.club web UI

Add light mode to swamp.club website

Datastore sync surfaces opaque errors from extensions verbatim — no status code or body preview

@swamp/s3-datastore: first-attempt 403 masked as "UnknownError" from AWS SDK deserializer

Extension auto-resolve reports "already_installed" for truncated pulled-extension trees

Email delivery for mention notifications

Auto-update WARN is silent in --json mode (logger suppresses non-fatal)

open.ts web UI uses force:true pullExtension, same data-loss family as #126

Port bundle_freshness (content-fingerprint cache invalidation) to reports / drivers / datastores / vaults loaders

Mentions and notifications system for issues

Datastore auto-update in resolve_datastore.ts uses force:true, risking silent overwrite of local edits

Per-repo user-extension bundle cache doesn't invalidate on source changes

Adding methods to in-body `methods:{}` on `export const model` doesn't re-register

User extensions silently dropped when base type not yet registered at scan time

Footer floats when page content is shorter than viewport

workflow validate can silently overwrite local edits to pulled extensions via force-pull in auto-resolver

Extension pull should namespace files by extension to prevent filename collisions

Update CLAUDE.md co-author instructions to use swamp-club issue author lookup

Add swamp issue get CLI command to fetch issue details

swamp-club API: include issue author in GET /api/v1/lab/issues/{number} response

Install command curl-pipe-sh overflows the component on swamp.club homepage

'Assigned to me' overlaps 'Privacy policy' on short viewports

Collapsible left rail and repositionable right rail in Lab

Usernames aren't linked to their profile pages

Filter lab issues by author (opened by user)

Add skills extension type for bundling agent/human guidance documents

Remove traffic lights in column 2 of /lab

Multi-select combo filtering on /lab

data gc skips version-count GC when no lifetime-expired data exists

Content filter should identify the flagged word or phrase

Bog flow: text rendering and layout issues

Flow modal: text rendering issues

Normalize text sizing across issue list and detail views

Description 'Show more' button appears even when text fits

Inline editing: click-to-edit fields instead of pencil icons

Persist lab filter selection in localStorage

Update how-to guide with swamp-extension-publish skill

Fix incorrect favicon in Google search results