v0.3.0 is the release candidate for making the GWT runtime usable as a Codex
MCP memory system rather than only a benchmark/demo runtime.
gwt_memory_profile, gwt_backup_memory, gwt_restore_memory,
gwt_reset, gwt_export_memory, gwt_import_memory, and
gwt_compact_working_memory.gwt_collection_query survives restarts.docs/honest-gwt-report.md records the strongest defensible GWT claim and
the remaining limits.docs/dogfood-report.md records the latest real local MCP, Codex MCP, and
bounded Qwen sanity evidence.Observed on 2026-04-29:
| Check | Result |
|---|---|
pytest |
206 passed |
npm test |
206 passed plus deterministic benchmark smoke 2/2 |
| Local MCP usage loop | graph Paper Gamma, collection Ada, trace ok |
| Codex MCP temp smoke | returncode=0, tool count 21, trace ok |
| External subscriber POC | accepted flag_contradiction and query_memory, timeout/error 0 / 0 |
Qwen sanity --max-tasks 5 |
RULER and LongBench GWT bus on/off all 100%, baseline 100% except LongBench bus-off 80%, timeout/error 0 / 0 |
Run before tagging:
python scripts/release_gate.py
npm run memory:health -- --smoke
npm run qwen:sanity -- --run --max-tasks 5
python -m tests.benchmarks.bus_matrix --summarize .benchmarks/qwen-sanity/*.json
python -m tests.benchmarks.analyze_results .benchmarks/qwen-sanity/*.json
pytest, ruff, mypy, npm verification, stdio MCP smoke, examples, build,
and boundary checks pass..env, benchmark result artifacts, supervisor folders, or real
RunPod URLs.0.25 above bus-off.0 / 0.GWT_DATA_DIR; it is not
a remote multi-tenant authorization layer.