Skip to content

feat(datasets): Add partition skew distance metrics#7389

Draft
WilliamLindskog wants to merge 1 commit into
fix/flwr-datasets-iid-metricsfrom
feat/flwr-datasets-skew-metrics
Draft

feat(datasets): Add partition skew distance metrics#7389
WilliamLindskog wants to merge 1 commit into
fix/flwr-datasets-iid-metricsfrom
feat/flwr-datasets-skew-metrics

Conversation

@WilliamLindskog

@WilliamLindskog WilliamLindskog commented Jun 16, 2026

Copy link
Copy Markdown
Member

What changed

  • Add public Hellinger and Jensen-Shannon distance helpers for categorical partition-label distributions
  • Keep the distance implementation in a dedicated flwr_datasets.metrics.distances module
  • Export the helpers from flwr_datasets.metrics
  • Document a small categorical partition-skew measurement workflow

Review scope

This is split out from #7385 and should be reviewed after the shuffle-only IID partitioner change. The PR is intentionally stacked on fix/flwr-datasets-iid-metrics so this diff contains only the metrics follow-up.

To keep this first metrics PR lightweight, continuous-target binning and additional distance configuration are left for a later follow-up if needed.

Issue/PR mapping

Validation

  • pytest, ruff, mypy, and black --check on the touched metrics files
  • git diff --check

@WilliamLindskog WilliamLindskog force-pushed the feat/flwr-datasets-skew-metrics branch from 5379a73 to 4899095 Compare June 16, 2026 02:55
@github-actions github-actions Bot added the Maintainer Used to determine what PRs (mainly) come from Flower maintainers. label Jun 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Maintainer Used to determine what PRs (mainly) come from Flower maintainers.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant