fix(z-image): Fix padding token shape mismatch for GGUF models #8690

Pfannkuchensack · 2025-12-22T17:35:44Z

Summary

Fix shape mismatch when loading GGUF-quantized Z-Image transformer models.

GGUF Z-Image models store x_pad_token and cap_pad_token with shape [3840], but diffusers ZImageTransformer2DModel expects [1, 3840] (with batch dimension). This caused a RuntimeError on Linux systems when loading models like z_image_turbo-Q4_K.gguf.

The fix:

Dequantizes GGMLTensors first (since they don't support unsqueeze)
Reshapes the tensors to add the missing batch dimension

Related Issues / Discussions

Reported by Linux user using:

QA Instructions

Install a GGUF-quantized Z-Image model (e.g., z_image_turbo-Q4_K.gguf)
Install a Qwen3 GGUF encoder
Run a Z-Image generation
Verify no RuntimeError: size mismatch for x_pad_token error occurs

Merge Plan

None, straightforward fix.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
❗Changes to a redux slice have a corresponding migration
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

GGUF Z-Image models store x_pad_token and cap_pad_token with shape [dim], but diffusers ZImageTransformer2DModel expects [1, dim]. This caused a RuntimeError when loading GGUF-quantized Z-Image models. The fix dequantizes GGMLTensors first (since they don't support unsqueeze), then reshapes to add the batch dimension.

Pfannkuchensack requested review from blessedcoolant and lstein as code owners December 22, 2025 17:35

github-actions bot added python PRs that change python files backend PRs that change backend files labels Dec 22, 2025

Merge branch 'main' into fix/z-image-gguf-padding-token-shape

5264b75

blessedcoolant approved these changes Dec 23, 2025

View reviewed changes

Merge branch 'main' into pr/8690

7068cf9

blessedcoolant enabled auto-merge December 23, 2025 00:30

blessedcoolant merged commit 90e3400 into invoke-ai:main Dec 23, 2025
13 checks passed

Pfannkuchensack deleted the fix/z-image-gguf-padding-token-shape branch December 23, 2025 00:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(z-image): Fix padding token shape mismatch for GGUF models #8690

fix(z-image): Fix padding token shape mismatch for GGUF models #8690

Uh oh!

Pfannkuchensack commented Dec 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(z-image): Fix padding token shape mismatch for GGUF models #8690

fix(z-image): Fix padding token shape mismatch for GGUF models #8690

Uh oh!

Conversation

Pfannkuchensack commented Dec 22, 2025

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants