Skip to main content

Llm Catalog

@stateless/llm-catalogv2026.06.23.1· 4d agoMODELS
01README

A sourced, versioned knowledge base of LLM-ops knowledge — NOT a control plane for running models, but the structured, CEL-queryable record that running extensions read to decide which model to run where, with what settings, at what cost. Six subject kinds (model, runtime, provider, hardware, technique, endpoint), all the same uniform entry shape (open kind + open facets, like @stateless/inventory); a model's run-options embed on the model in runsOn[] keyed by endpoint, and capacity/plan answer "what to run where" over them. The twist: it records external, decaying knowledge, so every volatile assertion carries a provenance envelope (asOf, source, versionPins, supersededBy) — "best vLLM version" is derived as the latest un-superseded claim, never a bare rotting field. Declarative: apply materialises one entry resource per id (re-run = versioned trend); prune soft-retires undeclared entries.

02Models1
llm_catalog.tsv2026.06.23.1
fn apply()
Materialise each declared entry as an `entry` resource (one per id).
fn update()
Pull the public catalog (assembled catalog.json) from `catalogUrl` and
fn reconcile()
Gather every run-option for a target model — across the
fn sync()
Refresh gateway prices: fetch a provider pricing feed (default
fn ingest()
Draft a model entry by fetching a HuggingFace config.json (a structured,
fn contribute()
Sanitise selected declared entries into generic, public-shaped
fn prune()
Reconcile: soft-prune stored `entry` resources no longer present in
fn capacity()
Resolve an inference-capacity INTENT (task + host + co-resident
fn plan()
Deployment planner: given a SET of concurrent workloads (each with a

Resources

entry— A declared catalog entry — a subject (model, runtime, provider,
contribution— A sanitised, public-shaped entry produced by `contribute` — the
reconciliation— A side-by-side comparison of every run-option for one target
03Previous Versions6
2026.06.22.3Jun 22, 2026
2026.06.22.2Jun 22, 2026
2026.06.22.1Jun 22, 2026

Modified 1 models

2026.06.21.1Jun 20, 2026

Modified 1 models

2026.06.19.2Jun 19, 2026
2026.06.19.1Jun 19, 2026
04Stats
A
100 / 100
Downloads
0
Archive size
67.9 KB
  • Has README or module doc2/2earned
  • README has a code example1/1earned
  • README is substantive1/1earned
  • Most symbols documented1/1earned
  • No slow types (deprecated)1/1earned
  • Dependencies pass trust audit2/2earned
  • Has description1/1earned
  • Platform support declared (or universal)2/2earned
  • License declared1/1earned
  • Verified public repository2/2earned
05Platforms
06Labels