Upgrade transformers to 5.5.4 and huggingface-hub to 1.16 by dxqb · Pull Request #1524 · Nerogar/OneTrainer

dxqb · 2026-06-14T22:31:00Z

Summary

reopens #1472 but only upgrades to 5.5.4, right before huggingface/transformers#44431 was merged to avoid the issues described in #1506

Test plan

pre-commit run --all-files passes
Launched the affected UI or script and exercised the change
Tested with at least one real preset / config when relevant (note which: Lens, Ideogram, SDXL!)

AI assistance

AI-assisted — I have read every line in this diff and can defend each change

…16 (Nerogar#147…" This reverts commit 574ec55.

5.5.4 is the last release before CLIP flattening in 5.6, which avoids the full CLIP-compat migration while still picking up the general v5 fixes from Nerogar#1472 (Trie removal, thread-safety, hub 1.16/xet cleanup). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

HFModelLoaderMixin checked sub_module._checkpoint_conversion_mapping for legacy checkpoint key renaming, but that attribute is just an empty {} declared on the base PreTrainedModel class in this transformers version. The actual renaming rules now live in transformers' central conversion registry. Use get_checkpoint_conversion_mapping()/rename_source_key() instead, which correctly remaps Ernie's Mistral3 text encoder (language_model.model.* -> language_model.*) and Qwen's Qwen2_5_VL text encoder (model.* -> model.language_model.*, visual.* -> model.visual.*). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

T5EncoderModel's encoder.embed_tokens.weight is tied to shared.weight, saved only once in the checkpoint. The manual loading path in HFModelLoaderMixin.__load_sub_module only assigns tensors for keys present in the state dict, so the tied key was left an empty meta tensor, crashing Chroma's text_encoder, Flux's text_encoder_2 and SD3's text_encoder_3 with "Cannot copy out of meta tensor; no data!". Generalize the fix in __load_sub_module: for every _tied_weights_keys entry still on the meta device, clone the already-loaded, already dtype-converted source parameter into it. Cloning (rather than aliasing via the real tie_weights()) keeps the two parameters as separate objects, so a later in-place quantization of one side (e.g. quantize_layers() quantizing a quantized lm_head) can't silently corrupt the other (e.g. the embedding table) through a shared Parameter object -- verified empirically. This subsumes the per-loader manual workaround already used for Qwen3-based causal LM text encoders (Flux2, ZImage), which is removed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

get_checkpoint_conversion_mapping(sub_module.config.model_type), added to fix Ernie/Qwen text encoder loading, ran unconditionally in __load_sub_module. Diffusers sub-modules (loaded via _load_diffusers_sub_module, e.g. Anima's VAE) have a plain FrozenDict config with no model_type attribute, crashing with AttributeError. Diffusers has no such checkpoint-conversion registry and never needs this renaming, so skip it when the config lacks model_type. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…hts check Addresses review comment on PR Nerogar#1524.

dxqb · 2026-06-18T19:18:37Z

smoke test passes on embeddings

OK: #sd 1.5 embedding
OK: #sd 2.1 embedding
OK: #sdxl 1.0 embedding
OK: #wuerstchen 2.0 embedding

dxqb · 2026-06-18T19:19:11Z

LoRA smoke test passed via preview branch

The ctk view refactor moved ConceptWindow.__download_dataset into the new ConceptWindowController.download_dataset from a base predating PR Nerogar#1524, which silently reverted that PR's fix: the method again called huggingface_hub.login(token=..., new_session=False) with no empty-token guard. new_session was removed from login() in huggingface-hub 1.16, so every call raised TypeError, swallowed by the surrounding except, so snapshot_download never ran and dataset download was fully broken. Re-apply Nerogar#1524 at the new location: only login when a token is configured, and drop new_session. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

dxqb and others added 2 commits June 5, 2026 21:00

Revert "Revert "Upgrade transformers to 5.9 and huggingface-hub to 1.…

2f0620b

…16 (Nerogar#147…" This reverts commit 574ec55.

dxqb mentioned this pull request Jun 15, 2026

Upgrade transformers to 5.9 #1506

Closed

dxqb added the preview merged in the preview branch label Jun 16, 2026

This comment was marked as resolved.

Sign in to view

dxqb and others added 2 commits June 17, 2026 21:28

dxqb commented Jun 18, 2026

View reviewed changes

Comment thread modules/modelLoader/mixin/HFModelLoaderMixin.py Outdated

dxqb added 2 commits June 18, 2026 19:48

Update transformers version comment in requirements

26f466e

Use explicit branch instead of for-loop-over-empty-dict for tied weig…

0d4f72d

…hts check Addresses review comment on PR Nerogar#1524.

dxqb marked this pull request as ready for review June 18, 2026 19:18

dxqb merged commit 3e3b3e8 into Nerogar:master Jun 18, 2026
1 check passed

emanuelenurra78-byte mentioned this pull request Jun 23, 2026

Fix CLIPTextModel compatibility with transformers > 5.5 #1548

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Upgrade transformers to 5.5.4 and huggingface-hub to 1.16#1524

Upgrade transformers to 5.5.4 and huggingface-hub to 1.16#1524
dxqb merged 7 commits into
Nerogar:masterfrom
dxqb:transformers-5.5.4

dxqb commented Jun 14, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

This comment was marked as resolved.

Uh oh!

dxqb commented Jun 18, 2026

Uh oh!

dxqb commented Jun 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

dxqb commented Jun 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

AI assistance

Uh oh!

This comment was marked as resolved.

This comment was marked as resolved.

Uh oh!

dxqb commented Jun 18, 2026

Uh oh!

dxqb commented Jun 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dxqb commented Jun 14, 2026 •

edited

Loading