Replies: 2 comments
-
Deep dive: token usage of the 3 methodsToken cost is not one number — it splits into four buckets that behave very differently:
All figures below at ~4 chars/token. The current numbers are measured from this repo; Spec-Kit and BMAD are estimated from each method's structure (no install measured). 1. Static / always-on footprintCurrent — measured, loaded every turn:
Note: the SuperClaude global chain (
➡️ On always-on cost the three are roughly equal. Spec-Kit is marginally the leanest. 2. Per-feature working setCurrent — measured unit costs:
Realistic current per-feature method overhead: 1 subagent (~900) + 1–2 ADRs (~3K) + 1 issue (~0.5K) ≈ 4–6K. The dominant risk is the 72K monolithic planning doc — a single full read of it costs more than an entire Spec-Kit or BMAD feature cycle.
3. Generated artifacts (output tokens — the expensive kind)
Output tokens dominate cost, so this bucket matters most: Current ≪ Spec-Kit ≲ BMAD. 4. Cache behaviour
Bottom line
* excludes the 72K planning-doc full-read risk. Takeaways
This reinforces the main recommendation: adopt Spec-Kit's loop selectively for substantial net-new features (pay the 3–5× only where the planning rigor earns it), keep the lightweight ADR path for everything else, and avoid BMAD's always-on persona overhead. Current figures measured from this repo on 2026-05-17; Spec-Kit/BMAD figures estimated from each method's documented structure. |
Beta Was this translation helpful? Give feedback.
-
Re-evaluation after splitting the 72K planning docDone — What changed (measured)
The index keeps what's always relevant — the GitHub Issues table, the phase-status summary, and the ADR index (25 ADRs). Per-phase narrative detail moved to four archive files, each independently readable. (The ~3.5K net shrink is the deleted 200-line in-doc TOC.) Re-evaluated per-feature token cost — "Current" method
The landmine is defused. The catastrophic 72K single-read is gone: worst case is now ~26K, typical ~12K. More importantly the cost is now predictable — the index is small enough (~8K) that reading it whole every feature is the sensible default, and Updated comparison table
Note the multipliers compressed (Spec-Kit was ~3–5×, BMAD ~5–9×) — not because those methods got cheaper, but because the Current baseline is now an honest, stable ~12K instead of an artificially-low 4–6K that ignored the 72K risk. This is the apples-to-apples number. Revised conclusionThe original recommendation stands and is now stronger:
Recommendation unchanged: keep the ADR-native workflow as the default (now genuinely lightweight and predictable), layer Spec-Kit's loop selectively on big features, and apply this same split discipline to any future doc that grows past ~15K tokens. Measured from the repo on 2026-05-17 after the split commit. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Why this discussion
We currently run a spec-/ADR-driven workflow (Knative/KEP-style governance):
docs/ADRs/— the architectural decision logdocs/planning-health-dataspace-v2.md) — 27+ phases, ~69 ADR cross-references.claude/agents/(4 subagents: architect, compliance-reviewer, security, tester),.claude/commands/(3 slash commands),.claude/rules/(api-conventions, code-style, testing),.claude/skills/, plusCLAUDE.mdThis issue evaluates whether to adopt — wholly or partially — one of two popular agentic methodologies:
The three approaches in brief
1. Current — ADR-native / KEP-style
Decisions captured as ADRs; a planning doc as roadmap; issues as the task queue; Claude behaviour shaped by hand-written
CLAUDE.md+ rules + scoped subagents. No formal per-feature spec → plan → tasks loop — features go straight from an issue/ADR to implementation.2. GitHub Spec-Kit (SDD)
Inverts the spec → "specs become executable blueprints." Seven-step loop, each step a slash command:
constitution → specify → clarify → plan → tasks → analyze → implementArtifacts land in
.specify/<feature-id>/(spec, research, plan, contracts, task list). Installs native slash commands (/speckit.specify,/speckit.plan, …) into the agent's command directory. Agent-agnostic (30+ agents incl. Claude Code).3. BMAD Method
Agentic agile framework: 12+ specialist agent personas (PM, Architect, Dev, UX, QA…) that collaborate, sometimes in "Party Mode." Scale-adaptive — adjusts planning ceremony from bug-fix to enterprise. 34+ workflows. Own installer (
npx bmad-method install); needs Node 20+, Python 3.10+,uv. Ships.claude-plugin/integration.Side-by-side
CLAUDE.md+ rulesspecify init, slash cmds)Where each shines / hurts for this project
This is a mature codebase — Phase 27, 25 ADRs, 5,300+ graph nodes, deployed to Azure. Both alternatives are designed to add the most value at project inception, which we are well past.
Spec-Kit
specify → plan → tasksloop fills our real gap: we have no formalised per-feature spec/plan step. It would have caught the "0 Participant Profiles" issue earlier (spec would have stated the Neo4j-fallback contract)..claude/commands/— same mechanism we already use.CLAUDE.md+.claude/rules/already do.BMAD
Recommendation
Adopt Spec-Kit's per-feature loop selectively; keep ADRs; do not adopt BMAD wholesale.
specify → plan → tasksloop for net-new features (e.g. the open issues [FEATURE] Cross-Participant Dataset Discovery - "Find all diabetes datasets across German hospitals" #8, DCP BusinessWallet + EUDI Wallet Sandbox: hybrid patient/B2B credential stack (Option B) #22, GesundheitsID OIDC RP via gematik open-source stack (Option C) #24). It plugs the one genuine gap in our workflow and integrates as plain slash commands.CLAUDE.mdalready is ours.CLAUDE.md.Easiest to use in Claude Code?
Spec-Kit, clearly. It installs as native slash commands into the same
.claude/commands/directory we already use, is agent-agnostic, and needs no extra runtime. BMAD brings its own plugin system, installer, and Node+Python+uvtoolchain — more friction, more overlap with what we already have tuned.Open questions
.specify/or fold intodocs/?/speckit.analyze-style gate, or stay manual?Comparison drafted with Claude Code; figures from each project's README as of 2026-05.
Beta Was this translation helpful? Give feedback.
All reactions