[Dreambooth flux] bug fix for dreambooth script (align with dreambooth lora) #9257

linoytsaban · 2024-08-23T11:18:40Z

HuggingFaceDocBuilderDev · 2024-08-23T11:26:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

Single comment.

sayakpaul · 2024-08-23T12:07:10Z

examples/dreambooth/train_dreambooth_flux.py

+                    height=int(model_input.shape[2] * vae_scale_factor / 2),
+                    width=int(model_input.shape[3] * vae_scale_factor / 2),


Where is this from?

diffusers/src/diffusers/pipelines/flux/pipeline_flux.py

Line 760 in 4e74206

latents = self._unpack_latents(latents, height, width, self.vae_scale_factor)

We don't additionally scale it by "/2".

it's just a modification from the original version of the training scripts where we had

model_pred = FluxPipeline._unpack_latents( model_pred, height=int(model_input.shape[2] * 8), width=int(model_input.shape[3] * 8), vae_scale_factor=vae_scale_factor, )

and it didnt work with all resolutions, so we fixed in the previous PR for the LoRA script

(in the pipeline there is this- diffusers/src/diffusers/pipelines/flux/pipeline_flux.py)

Yeah but we still don't have to do the additional scaling in the original pipeline, no? And it works with multiple resolutions without that. So, I am struggling to understand why we would need it here.

in the pipeline we first scale in prepare_latents the width and height by

height = 2 * (int(height) // self.vae_scale_factor) width = 2 * (int(width) // self.vae_scale_factor)

but we don't override them, so when they're sent to unpack_latents it's the x8 of the scaled version here^
in the training script, we send

height=model_input.shape[2], width=model_input.shape[3],

to pack_latents - which is already a scaled down version because it happens after vae encoding.
i.e. model_input.shape[2] is equivalent to 2 * (int(height) // self.vae_scale_factor) in shape
that's why when we call unpack_latents in the training script we need to scale up

Ah okay. Thanks for explaining this. Perhaps we could add a link to this comment in our script for our bookkeeping?

linoytsaban · 2024-08-26T11:59:07Z

@sayakpaul shall we merge?

sayakpaul · 2024-08-26T12:00:05Z

Thank you!

…h lora) (#9257) * fix shape * fix prompt encoding * style * fix device * add comment

linoytsaban added 3 commits August 23, 2024 14:07

fix shape

5232e7f

fix prompt encoding

d4491f0

style

f9b33ed

linoytsaban requested a review from sayakpaul August 23, 2024 11:40

sayakpaul reviewed Aug 23, 2024

View reviewed changes

linoytsaban and others added 2 commits August 26, 2024 08:40

Merge branch 'main' into dreambooth-flux

f12ee73

fix device

badea9d

linoytsaban mentioned this pull request Aug 26, 2024

[Flux Dreambooth] Fix device of text_ids #9275

Closed

add comment

2aa29de

sayakpaul merged commit c977966 into huggingface:main Aug 26, 2024

linoytsaban deleted the dreambooth-flux branch November 26, 2024 10:18

sayakpaul pushed a commit that referenced this pull request Dec 23, 2024

[Dreambooth flux] bug fix for dreambooth script (align with dreamboot…

18a3c35

…h lora) (#9257) * fix shape * fix prompt encoding * style * fix device * add comment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Dreambooth flux] bug fix for dreambooth script (align with dreambooth lora) #9257

[Dreambooth flux] bug fix for dreambooth script (align with dreambooth lora) #9257

Uh oh!

linoytsaban commented Aug 23, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Aug 23, 2024

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Aug 23, 2024

Uh oh!

linoytsaban Aug 26, 2024

Uh oh!

sayakpaul Aug 26, 2024

Uh oh!

linoytsaban Aug 26, 2024

Uh oh!

sayakpaul Aug 26, 2024

Uh oh!

linoytsaban Aug 26, 2024

Uh oh!

linoytsaban commented Aug 26, 2024

Uh oh!

sayakpaul commented Aug 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		height=int(model_input.shape[2] * vae_scale_factor / 2),
		width=int(model_input.shape[3] * vae_scale_factor / 2),

[Dreambooth flux] bug fix for dreambooth script (align with dreambooth lora) #9257

[Dreambooth flux] bug fix for dreambooth script (align with dreambooth lora) #9257

Uh oh!

Conversation

linoytsaban commented Aug 23, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Aug 23, 2024

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 23, 2024

Choose a reason for hiding this comment

Uh oh!

linoytsaban Aug 26, 2024

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 26, 2024

Choose a reason for hiding this comment

Uh oh!

linoytsaban Aug 26, 2024

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 26, 2024

Choose a reason for hiding this comment

Uh oh!

linoytsaban Aug 26, 2024

Choose a reason for hiding this comment

Uh oh!

linoytsaban commented Aug 26, 2024

Uh oh!

sayakpaul commented Aug 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants