chore(tests): regenerate stale automated test catalog on main#1804
chore(tests): regenerate stale automated test catalog on main#1804markmhendrickson wants to merge 1 commit into
Conversation
main's catalog has been stale since #1797 merged tests/integration/ sandbox_seed_token_bypass.test.ts without regenerating it. Every subsequent PR's `baseline` job fails `validate:test-catalog --check` as a result. Committed as RAW generator output (NOT prettier-formatted): CI's `validate:test-catalog --check` runs the raw generator and compares, and `format:check` only globs src/**, so the catalog must match generator output exactly. Prettier-formatting it is what made prior regenerations still read as "stale". Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
|
review:pm Scope & Acceptance Verification
PM Lens Findings[NON-BLOCKING] Maintenance hygiene: Consider adding a pre-commit hook or CI gate to regenerate the catalog automatically when test files change, so this stale-state gap does not reoccur. This is a follow-up suggestion, not a blocker for this PR. Verdict: Change unblocks a critical validation gate that was incorrectly blocking all PRs on |
6f632d0 to
f30f510
Compare
|
review:pm APPROVE Scope alignment
Acceptance criteria met
No user-visible behavior impact
[NON-BLOCKING] Procedural note: Commit message notes why raw generator output was used (CI expects exact match). This is correct; marking as non-blocking since it's educational and the choice is sound per the message body. ✅ Verdict: Change matches scoped intent, no scope creep, acceptance criteria satisfied. Ready to merge. |
|
review:ux Surface & Discoverability
Error Messages
Documentation Quality
Preventive DXCommit message is transparent about a subtle but important detail: generator output must be raw (no prettier-formatting) so that Operational ImpactThis PR unblocks all downstream PRs currently failing CI because main's catalog drifted since #1797. The stale catalog is a blocker for every PR, even fixture-only and docs-only changes — this fixes a significant friction point. Acceptance Checklist
📎 Neotoma: neotoma issue #1804 |
|
review:ux UX surface examined: agent/developer experience of the test catalog regeneration workflow ( Findings[NON-BLOCKING] Error message clarity: When Current friction: Developers making non-test changes (docs-only, fixture-only, or unrelated features) do not expect a catalog validation failure. The word "stale" does not immediately signal "test files were added/removed/moved upstream." Recommendation: No blocking issue; the docs (lines 310–313 in the generated file) and error message together provide enough context. This is low friction. If refined in a follow-up, consider:
Neither is necessary; the current UX is functional and clear to anyone familiar with the repo. Positive Signals
Acceptance
✅ Signed off. No UX blockers. The catalog maintenance surface is transparent, discoverable, and maintains a single source of truth. 📎 Neotoma: none (chore: generated file sync only) |
|
review:qa SummaryThis PR regenerates the automated test catalog to reflect the new integration test added in PR #1797. The change itself is non-functional (documentation only) and the underlying test coverage is appropriate. Test Coverage AssessmentNew Test Added (PR #1797)The test in validates the security contract for the sandbox write-rate-limit bypass token: Coverage:
Edge cases covered:
Catalog Regeneration (This PR)
CI StatusAll critical test lanes pass:
The catalog file format matches generator output (raw, not Prettier-formatted), which is required for Functional SurfaceNo functional surface — this is a documentation-only commit that regenerates a generated artifact. The actual test coverage is from PR #1797, which is already merged and validated. Sign-off: |
|
review:qa APPROVE SummaryCatalog regeneration triggered by test file added in #1797 () without prior catalog sync. QA AssessmentFunctional surface: None — chore / documentation maintenance per Existing test coverage (from #1797):
Catalog accuracy:
No additional test coverage required for catalog-only changes per project standards. VerdictNo functional surface — no eval required. Catalog regeneration is accurate and correctly reflects the test file added in #1797. |
|
🤖 Vanellus — Ateles swarm, PR steward Aggregated Review VerdictAll review lenses have signed off with no blocking findings. Per-Lens Summary
CI Status✅ Required checks all passing:
⏳ Advisory checks (non-blocking):
Non-Blocking Observations
Merge Readiness✅ All blocking criteria met:
READY TO MERGE. All lenses approved; no blockers; required CI passing. Awaiting operator merge instruction per autonomy guardrail. 📎 Neotoma: Vanellus PR steward aggregation for markmhendrickson/neotoma#1804 |
|
🤖 Vanellus — Ateles swarm, PR steward Review Panel Roll-upPM Lens (Pavo)APPROVE — Scope and intent aligned. Regeneration of stale test catalog is a mechanical, low-risk fix. Single generated-file update; no scope creep. Unblocks critical CI gate that was incorrectly blocking all open PRs on UX Lens (Accipiter)APPROVE — Developer experience surface transparent and discoverable. Error messages are actionable (not just "stale", but the exact regeneration command). Documentation flow, examples, and validation checklist are thorough. No UX blockers. QA Lens (Phoenicurus)APPROVE — No functional surface (documentation/chore only). Underlying test coverage from PR #1797 is comprehensive (happy path, error paths, edge cases, timing-safe comparison). Catalog regeneration is accurate: 501→502 total tests, 141→142 integration tests, correctly reflects new CI Status
Verdict
Note: Merge is operator-gated per Vanellus protocol. All review lenses clear; awaiting operator confirmation to merge. |
|
Closing as redundant: |
Problem
main'sdocs/testing/automated_test_catalog.mdhas been stale since #1797, which mergedtests/integration/sandbox_seed_token_bypass.test.tswithout regenerating the catalog. ThebaselineCI job runsvalidate:test-catalog --check, so every PR opened against main failsbaselineon this check — including fixture-only or docs-only PRs that touch no tests.Confirmed by running
npm run validate:test-catalogon a cleanorigin/maincheckout: ❌ stale.Fix
Regenerated the catalog (
npm run generate:test-catalog). The drift is exactly:tests/integration/sandbox_seed_token_bypass.test.tsadded to the integration file listGenerator output is deterministic (stable across repeated runs);
validate:test-catalogpasses after the change.Impact
Unblocks
baselinefor all open PRs once they pick this up (rebase / merge main). Standalone and low-risk — a single generated-doc file.🤖 Generated with Claude Code