Skip to content

Add ranked analyze map evidence bundles (--rank-probes, --evidence-bundle)#792

Merged
yuechen-li-dev merged 1 commit into
masterfrom
codex/add-ranking-and-evidence-bundle-to-analyze-map
Jun 23, 2026
Merged

Add ranked analyze map evidence bundles (--rank-probes, --evidence-bundle)#792
yuechen-li-dev merged 1 commit into
masterfrom
codex/add-ranking-and-evidence-bundle-to-analyze-map

Conversation

@yuechen-li-dev

Copy link
Copy Markdown
Owner

Motivation

  • CTC-01 and dogfooding showed coarse six-view maps are useful but global higher-resolution grids are too slow for tight LLM loops, so the tool must prioritize local, ranked probes instead of brute-force refinement.
  • The goal is to present LLMs with compact, actionable measurement evidence (ranked probes, suggested sections, local map windows) so reasoning can focus on hypothesis testing rather than parsing huge grids.

Description

  • Add CLI flags --rank-probes and --evidence-bundle (wired through CliRunner), with --evidence-bundle implying ranked output and --rank-probes enabling scoring.
  • Extend analysis models (StepAnalysisModels.cs) and StepAnalyzer to emit RankedProbes, an evidenceBundle structure, and a compact point-probe summary (pointSummary) while preserving full hits and existing detailed output.
  • Implement a small deterministic scorer (RankSixViewProbes, ScoreComponent) that is JudgmentEngine-compatible in shape (explicit scores, reasons, evidence terms, uncertainty, tie-breaking) and prioritizes interior no-hit components, curved analytic families (cylinder/cone/torus/sphere), fallback/tessellated regions, small isolated height bands, centrality, and analytic provenance.
  • Bridge map→section by emitting recommended follow-ups for each ranked component: a centroid pointProbe command, two analyze section commands through the component centroid (mapped to supported --xy|--xz|--yz syntax), and a localMap recommendation (marked unsupported until --bounds is added); evidenceBundle contains coarse-map metadata, ranked questions, flattened suggested actions, executedEvidence (empty for A6), limits, and human-readable notes.

Testing

  • Ran dotnet restore Aetheris.slnx and dotnet build Aetheris.slnx -f net10.0 --no-restore /m:1 successfully.
  • Ran the focused CLI test set with dotnet test Aetheris.CLI.Tests/Aetheris.CLI.Tests.csproj -f net10.0 --filter "AnalyzeMap|Ranked|Probes|Section|EvidenceBundle|Components" and it passed (9 tests).
  • Smoke/validation runs: executed aetheris analyze map ... --views six --resolution 8x8 --llm --rank-probes --json on a ruled fixture and confirmed ranked probes/evidence bundle were emitted; exported primitives used in tests included a cylinder and a box.
  • Attempted full CTC-01 16x16 six-view ranked-probe run under a 60s cap which timed out (exit 124), validating the documented locality/performance constraint for large dogfood runs (A6 does not auto-execute follow-ups).

Codex Task

@yuechen-li-dev yuechen-li-dev merged commit 15c0042 into master Jun 23, 2026
2 checks passed
@yuechen-li-dev yuechen-li-dev deleted the codex/add-ranking-and-evidence-bundle-to-analyze-map branch June 23, 2026 22:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant