STATUS: record real multi-year T4 streamflow baselines (all 7 gauges)#135
Merged
Conversation
Multi-year T4 ran to completion at full registry windows across all 7 wired gauges (~3 min wall, 7-way parallel; every domain >=1000 scored gauge-days, all runoff finite). Records the honest uncalibrated cold-start KGE/NSE table (new section 2.5): best Abisko KGE=+0.31, Baltimore NSE=+0.12; volume often right (beta~1) with snowmelt timing/variance the uncalibrated error; Tagus/ Iceland/Massa are domain-config (gauge-area / glacial-melt) issues, not physics. Closes the "multi-year T4 at depth" remaining-work item; real KGE skill now gated only on a separate calibration pass, not a harness gap. Assisted-by: Claude Opus 4.8 (1M context)
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Records the honest multi-year T4 streamflow baseline numbers in STATUS.md (new §2.5).
All 7 wired gauges ran to completion at their full multi-year registry windows (multi-year spin-up + multi-year scoring), in ~3 min wall under 7-way parallelism; every domain scored ≥1,000 overlapping gauge-days with all simulated runoff finite (nbad=0).
These are uncalibrated cold-start baselines (the honest floor). Volume is often right (β≈1); timing/variance is the uncalibrated error. Tagus/Iceland/Massa are domain-config issues (gauge-area / glacial-melt), not physics. Real positive KGE needs a separate calibration pass — not a harness gap.
Docs-only; no code change.
🤖 Generated with Claude Code