Adjust mod multiplier recalculation processes to minimise number of required database writes by bdach · Pull Request #374 · ppy/osu-queue-score-statistics

bdach · 2026-05-29T12:13:53Z

Part of Mod score multiplier rebalance osu#37818
Depends on the actual game-side mod multiplier changes
Will want to recheck further myself with a fresh brain

PRing early as draft primarily for visibility reasons.

The original plan for deploying the mod multiplier changes was to first ensure that all scores in the database have total_score_without_mods present, and then write out the updated total_score across the whole table as the new multipliers times that populated total_score_without_mods quantity.

Aside: Why is the migration not run in-place?

The primary reason for me approaching things in this way rather than running the recalculation in-place is because running the recalculation in-place would be an inherently lossy process.

Assuming everything goes right, because total score is rounded to full integers and mod multipliers are very much not integers, the possibility of rounding errors and floating-point inaccuracies is very real. Therefore, any in-place recalculation of multipliers would be a process that ideally would only ever occur once and never again, because the more times it is performed, the bigger the degradation incurred with every in-place conversion.
And that is assuming everything goes right. In case of any mistake or incorrectly-applied migration, the original data would be lost in an in-place migration, and would not be recoverable quickly or at all because of how complicated the scoring algorithm is at the explicit demand of the user base. The closest thing to a recovery procedure in that case would be replays - when there are replays - and even that would be slow.

It is my former professional experience that tells me you never want to run huge data migrations in-place which incur chances of irretrievable or difficult-to-recover-from data loss.

In recent days, an attempt was made to run the backpopulation of total_score_without_mods on the producton scores table. Unfortunately at this time the original plan appears to be mostly unviable.

The number of rows in the production scores table that require this backfill is estimated to be around 2.38 billion rows. This would incur a storage overhead in the hundreds of gigabytes (not only from the actual rows themselves, but also from a transient spike in the size of binlog, which is relevant when replication is depended upon) and take roughly a month to execute.

This PR contains a set of changes that is designed to reduce the number of writes required to execute this migration. The cost here is the increased complexity of the migration process. While I am somewhat confident this is going to work, I will want to test this more thoroughly on the actual changes and maybe even do some test runs locally using data dumps to make sure I have not missed anything.

Additionally, comments from #269 (review) are addressed here.

As for the gory details:

Switch to new mod multiplier calculation API in backpopulation command

By itself this does nothing, the old API worked fine as far as the backpopulation was concerned. However doing this allows for an opportunity of automatically handling the change to mania key mod multipliers that was done live without backwards adjustments. More on this later.

Adjust query in backpopulation command to target the minimum necessary number of rows

As the inline commentary states, the number of rows that require writes of total_score_without_mods can be significantly reduced because of two facts:

Firstly, when it comes to scores set on stable, total_score_without_mods as well as total_score are quantities derived directly from legacy_total_score as well as beatmap attributes. Therefore, it is not actually necessary to write total_score_without_mods for those scores, and instead the score conversion algorithm can be used to recalculate the multipliers instead.
Secondly, when it comes to lazer scores specifically, this change assumes the invariant that if there are no mods on a score, then total_score == total_score_without_mods - so there is no point writing out values.

Adjust backpopulation command to automatically handle change to mania key mod multipliers

Migrating to the new multiplier API, which already has to handle multiplier versioning and the directly-applied change to mania key mod multipliers, means that it is now possible to leverage the versioned logic and automatically correct for the key mod multiplier change by filling out ScoreInfo.ClientVersion using osu_builds.

Adjust mod multiplier recalculation command to work with the minimum necessary number of rows with `total_score_without_mods`

Adds an alternate processing path for stable scores which leverages the algorithm of converting from legacy total score to implement the multiplier recalculation.

The command was already skipping scores with no mods at all, so no adjustment required there.

Add test coverage of commands

Written using real-world cases pulled from live production scores (via data.ppy.sh dumps plus some manual trawling for the mania key mod cases).

In the current state the most useful tests are the ones checking correctness of the mania key mod recalculation. Everything else will need updating once the new proposed values are out.

By itself this does nothing, the old API worked *fine* as far as the backpopulation was concerned. However doing this allows for an opportunity of automatically handling the change to mania key mod multipliers that was done live without backwards adjustments. More on this later.

…y number of rows As the inline commentary states, the number of rows that require writes of `total_score_without_mods` can be significantly reduced because of two facts: - Firstly, when it comes to scores set on stable, `total_score_without_mods` as well as `total_score` are quantities derived directly from `legacy_total_score` as well as beatmap attributes. Therefore, it is not actually necessary to write `total_score_without_mods` for those scores, and instead the score conversion algorithm can be used to recalculate the multipliers instead. - Secondly, when it comes to lazer scores specifically, this change assumes the invariant that if there are no mods on a score, then `total_score == total_score_without_mods` - so there is no point writing out values.

… key mod multipliers Migrating to the new multiplier API, which already has to handle multiplier versioning and the directly-applied change to mania key mod multipliers, means that it is now possible to leverage the versioned logic and automatically correct for the key mod multiplier change by filling out `ScoreInfo.ClientVersion` using `osu_builds`.

…necessary number of rows with `total_score_without_mods` Adds an alternate processing path for stable scores which leverages the algorithm of converting from legacy total score to implement the multiplier recalculation. The command was already skipping scores with no mods at all, so no adjustment required there.

Written using real-world cases pulled from live production scores (via data.ppy.sh dumps plus some manual trawling for the mania key mod cases).

peppy

No major issues found.

peppy · 2026-06-01T05:41:34Z

            return 0;
        }

+        private static BeatmapScoringAttributes? getScoringAttributesFor(SoloScore score, MySqlConnection conn)


I wonder if we want to expose and use the cached path for this.

osu-queue-score-statistics/osu.Server.Queues.ScoreStatisticsProcessor/Helpers/BatchInserter.cs

Lines 348 to 352 in 1421bad

private static readonly ConcurrentDictionary<BeatmapLookup, BeatmapScoringAttributes?> scoring_attributes_cache =

new ConcurrentDictionary<BeatmapLookup, BeatmapScoringAttributes?>();

private static BeatmapScoringAttributes? getScoringAttributes(BeatmapLookup lookup)

{

I considered it. My primary concern was running out of memory because the aforementioned static dictionary is never cleared other than via BeatmapStatusWatcher.StartPollingAsync() which is only relevant when the actual data changes.

I'm not even sure why this seemingly doesn't explode in BatchInserter, really.

There's a finite number of ranked beatmaps in the mentioned case (BatchInserter should only be dealing with ranked beatmap scores). It can most definitely fit in memory.

For our purposes here, I think we also consider unranked beatmaps, which may tip the scales.

For our purposes here, I think we also consider unranked beatmaps

There are presumably lazer scores set on unranked beatmaps, yes. I am not aware of any process culling them at this time.

This will have to be done again when the new mod multipliers land, but I guess the tag is already there to use.

bdach · 2026-06-01T09:29:26Z

I'll undraft and check off the boxes from the OP now, given that the indicated plan was to dry run this and check whether this new adjusted process is preserving existing multipliers (with the one exception of mania key mods).

Still bit scared of all this but testing added in 9b87267 should make sure I haven't screwed up dry run mode at least.

Will re-test once more with new multiplier values.

This mirrors the updated server flow (ppy/osu-queue-score-statistics#374) pretty closely.

bdach added 5 commits May 29, 2026 13:53

Add test coverage of commands

e0b8c10

Written using real-world cases pulled from live production scores (via data.ppy.sh dumps plus some manual trawling for the mania key mod cases).

bdach self-assigned this May 29, 2026

bdach added this to osu! team task tracker May 29, 2026

pull-request-size Bot added the size/XXL label May 29, 2026

github-project-automation Bot moved this to Inbox in osu! team task tracker May 29, 2026

bdach moved this from Inbox to Pending Review in osu! team task tracker May 29, 2026

bdach mentioned this pull request May 29, 2026

Mod score multiplier rebalance ppy/osu#37818

Open

29 tasks

peppy self-requested a review May 29, 2026 16:35

peppy reviewed Jun 1, 2026

View reviewed changes

bdach added 5 commits June 1, 2026 10:26

Assert existence of build when populating total score without mods

3546b02

Prefetch build-id-to-version mapping once

6676129

Bump game packages

942ac75

This will have to be done again when the new mod multipliers land, but I guess the tag is already there to use.

Fix incorrect test data

324f74f

Add test coverage for dry run mode

9b87267

peppy approved these changes Jun 1, 2026

View reviewed changes

bdach marked this pull request as ready for review June 1, 2026 09:29

peppy approved these changes Jun 1, 2026

View reviewed changes

peppy merged commit f548863 into ppy:master Jun 1, 2026
4 checks passed

github-project-automation Bot moved this from Pending Review to Done in osu! team task tracker Jun 1, 2026

bdach deleted the multiplier-recalculation-with-reduced-writes branch June 2, 2026 05:47

bdach added a commit to bdach/osu that referenced this pull request Jun 8, 2026

Update total score to latest version on replay import

31b587c

This mirrors the updated server flow (ppy/osu-queue-score-statistics#374) pretty closely.

bdach mentioned this pull request Jun 8, 2026

Implement client-side migration paths for new mod multipliers ppy/osu#38022

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust mod multiplier recalculation processes to minimise number of required database writes#374

Adjust mod multiplier recalculation processes to minimise number of required database writes#374
peppy merged 10 commits into
ppy:masterfrom
bdach:multiplier-recalculation-with-reduced-writes

bdach commented May 29, 2026 •

edited

Loading

Uh oh!

peppy left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

peppy Jun 1, 2026

Uh oh!

bdach Jun 1, 2026

Uh oh!

peppy Jun 1, 2026 •

edited

Loading

Uh oh!

bdach Jun 1, 2026

Uh oh!

bdach commented Jun 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	private static readonly ConcurrentDictionary<BeatmapLookup, BeatmapScoringAttributes?> scoring_attributes_cache =
	new ConcurrentDictionary<BeatmapLookup, BeatmapScoringAttributes?>();

	private static BeatmapScoringAttributes? getScoringAttributes(BeatmapLookup lookup)
	{

Conversation

bdach commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

peppy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peppy Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bdach commented Jun 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bdach commented May 29, 2026 •

edited

Loading

peppy Jun 1, 2026 •

edited

Loading