Add code to Bundleio to generate error stats by zingo · Pull Request #12051 · pytorch/executorch

zingo · 2025-06-27T06:07:42Z

Add a way to get error stats/metrics between actual and reference output.

cc @digantdesai @freddan80 @per @oscarandersson8218

Signed-off-by: Zingo Andersen <zingo.andersen@arm.com> Change-Id: Ib51b22c80954c87812b81b6fa9798ace705a555a

pytorch-bot · 2025-06-27T06:07:47Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12051

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

VolumeLimitExceeded Issue for linux.2xlarge and linux.4xlarge

✅ No Failures

As of commit 49542cb with merge base 142b1c6 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

zingo · 2025-06-27T06:16:07Z

Hi @digantdesai and @mergennachin this PR will add a new method/API to BundleIO and maybe you want to involve the proper people about this :) Also is the metric sane? And the way to propagate it back to the runner?

My basics intentions is to be able to log/track models over time in a better way then PASS/FAIL on a set atol/rtol, as it easily miss if we could have set rtol/atol lower when improving stuff. Something like this is also useful to get a good guess of what atol/rtol could be to make it work instead of a lot of trail and error with different values.

zingo · 2025-06-27T06:33:19Z

+          double abs_err = std::abs(a_data[k] - e_data[k]);
+          double relative_divider =
+              std::max(std::abs(a_data[k]), std::abs(e_data[k]));
+          relative_divider = std::max(relative_divider, eps);
+          double relative_err = abs_err / relative_divider;
+
+          sum_abs += abs_err;
+          max_abs = std::max(max_abs, abs_err);
+          sum_rel += relative_err;
+          max_rel = std::max(max_rel, relative_err);


Is this good? I'm no ML-math-stats person so if this can be improved we should in PR or after :)

I think it is good for regular cases e.g. both input and target are in float32 and good enough for bundled program. We shouldn't expect that bundled program can cover all cases like quantization

digantdesai · 2025-07-02T17:43:47Z

I like this general direction of getting more than true/false. And since its opt in, we care a bit less about the binary size overhead. I will let @Gasoonjia weigh in. He is looking at this for AoT with multiple "distance" measures. Thanks @zingo.

Gasoonjia

Thansk @zingo for the update and overall l love the update. It makes the error msg more meaningful.

Also spoiler alert we will have a new api in 0.7 release (https://github.com/pytorch/executorch/blob/main/devtools/inspector/_inspector.py#L1365) for comparing intermediate output in operator-level, beyond what we have right now in bundled program for only compare the final result! Stay tune and we will have a doc for better demonstration!

@digantdesai

Add a way to get error stats/metrics between actual and reference output. cc @digantdesai @freddan80 @per @oscarandersson8218 Signed-off-by: Zingo Andersen <zingo.andersen@arm.com>

digantdesai · 2025-07-10T12:08:53Z

Thank you both. @zingo or @Gasoonjia we should use this compute_method_output_error_stats for other runners as well.

Add code to Bundleio to generate error stats

336e0ae

Signed-off-by: Zingo Andersen <zingo.andersen@arm.com> Change-Id: Ib51b22c80954c87812b81b6fa9798ace705a555a

zingo requested review from Gasoonjia and digantdesai as code owners June 27, 2025 06:07

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 27, 2025

zingo added release notes: devtools Changes to dev tooling, for example the debugger & profiler release notes: arm Changes to the ARM backend delegate ciflow/trunk partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm labels Jun 27, 2025

zingo commented Jun 27, 2025

View reviewed changes

Merge branch 'main' into Add-code-to-Bundleio-to-generate-error-stats

49542cb

mergennachin requested review from JacobSzwejbka and larryliu0820 June 27, 2025 14:22

Gasoonjia approved these changes Jul 2, 2025

View reviewed changes

Gasoonjia merged commit 59e0476 into pytorch:main Jul 2, 2025
201 checks passed

zingo deleted the Add-code-to-Bundleio-to-generate-error-stats branch August 8, 2025 09:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add code to Bundleio to generate error stats#12051

Add code to Bundleio to generate error stats#12051
Gasoonjia merged 2 commits into
pytorch:mainfrom
zingo:Add-code-to-Bundleio-to-generate-error-stats

zingo commented Jun 27, 2025 •

edited by pytorch-bot Bot

Loading

Uh oh!

pytorch-bot Bot commented Jun 27, 2025 •

edited

Loading

Uh oh!

zingo commented Jun 27, 2025

Uh oh!

zingo Jun 27, 2025

Uh oh!

Gasoonjia Jul 2, 2025

Uh oh!

digantdesai commented Jul 2, 2025

Uh oh!

Gasoonjia left a comment •

edited

Loading

Uh oh!

Uh oh!

digantdesai commented Jul 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

zingo commented Jun 27, 2025 • edited by pytorch-bot Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12051

❗ 1 Active SEVs

✅ No Failures

Uh oh!

zingo commented Jun 27, 2025

Uh oh!

zingo Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

Gasoonjia Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

digantdesai commented Jul 2, 2025

Uh oh!

Gasoonjia left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

digantdesai commented Jul 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

zingo commented Jun 27, 2025 •

edited by pytorch-bot Bot

Loading

pytorch-bot Bot commented Jun 27, 2025 •

edited

Loading

Gasoonjia left a comment •

edited

Loading