Upstream #1

cmdr2 · 2023-03-27T09:45:59Z

No description provided.

* Adding `use_safetensors` argument to give more control to users about which weights they use. * Doc style. * Rebased (not functional). * Rebased and functional with tests. * Style. * Apply suggestions from code review * Style. * Addressing comments. * Update tests/test_pipelines.py Co-authored-by: Will Berman <wlbberman@gmail.com> * Black ??? --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Will Berman <wlbberman@gmail.com>

add safety checker

…2710) Improve error message

fix link Co-authored-by: yiyixuxu <yixu310@gmail,com>

* minor edits to onnx and openvino docs. * Apply suggestions from code review Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> --------- Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

* updated black format * update black format * make style format * updated line endings * update code formatting * Update examples/research_projects/onnxruntime/text_to_image/train_text_to_image.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/models/vae.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/models/vae.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * added vae gradient checkpointing test * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Will Berman <wlbberman@gmail.com>

* [Tests] Correct PT2 * correct more * move versatile to nightly * up * up * again * Apply suggestions from code review

Fix typos

* add: controlnet entry to training section in the docs. * formatting. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * wrap in a tip block. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update numbers for Habana Gaudi in doc

* first refactor * more text * improve * finish * up * up * up * up * finish * Apply suggestions from code review Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> * up * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * finished * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * finished --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* [Img2Img]: Copyover img2img pipeline * [Img2Img]: img2img pipeline * [Img2Img]: img2img pipeline * [Img2Img]: img2img pipeline --------- Co-authored-by: njindal <njindal@adobe.com>

…Pipeline (#2731) * Add guidance start/end parameters to community controlnet img2img pipeline * Fix formats

add test_to_dtype to check pipe.to(fp16)

* Update benchmark for A100, 3090, 3090 Ti, 4090. * Link to PyTorch blog. * Update install instructions.

* clarify purpose of repro docs * apply feedback

* [MS Text To Video} Add first text to video * upload * make first model example * match unet3d params * make sure weights are correcctly converted * improve * forward pass works, but diff result * make forward work * fix more * finish * refactor video output class. * feat: add support for a video export utility. * fix: opencv availability check. * run make fix-copies. * add: docs for the model components. * add: standalone pipeline doc. * edit docstring of the pipeline. * add: right path to TransformerTempModel * add: first set of tests. * complete fast tests for text to video. * fix bug * up * three fast tests failing. * add: note on slow tests * make work with all schedulers * apply styling. * add slow tests * change file name * update * more correction * more fixes * finish * up * Apply suggestions from code review * up * finish * make copies * fix pipeline tests * fix more tests * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * apply suggestions * up * revert --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Remove warmup passes in mps tests. * Update mps docs: no warmup pass in PyTorch 2 * Update imports.

* add noise offset * make style

* add: section on multiple controlnets. Co-authored-by: William Berman <WLBberman@gmail.com> * fix: docs. * fix: docs. --------- Co-authored-by: William Berman <WLBberman@gmail.com>

* add: initial implementation of the pix2pix instruct training script. * shorten cli arg. * fix: main process check. * fix: dataset column names. * simplify tokenization. * proper placement of null conditions. * apply styling. * remove debugging message for conditioning do. * complete license. * add: requirements.tzt * wandb column name order. * fix: augmentation. * change: dataset_id. * fix: convert_to_np() call. * fix: reshaping. * fix: final ema copy. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * address PR comments. * add: readme details. * config fix. * downgrade version. * reduce image width in the readme. * note on hyperparameters during generation. * add: output images. * update readme. * minor edits to readme. * debugging statement. * explicitly placement of the pipeline. * bump minimum diffusers version. * fix: device attribute error. * weight dtype. * debugging. * add dtype inform. * add seoarate te and vae. * add: explicit casting/ * remove casting. * up. * up 2. * up 3. * autocast. * disable mixed-precision in the final inference. * debugging information. * autocasting. * add: instructpix2pix training section to the docs. * Empty-Commit --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* [UNet3DModel] Fix attn processor * make style

The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.

* initial TokenEncoder and ContinuousEncoder * initial modules * added ContinuousContextTransformer * fix copy paste error * use numpy for get_sequence_length * initial terminal relative positional encodings * fix weights keys * fix assert * cross attend style: concat encodings * make style * concat once * fix formatting * Initial SpectrogramPipeline * fix input_tokens * make style * added mel output * ignore weights for config * move mel to numpy * import pipeline * fix class names and import * moved models to models folder * import ContinuousContextTransformer and SpectrogramDiffusionPipeline * initial spec diffusion converstion script * renamed config to t5config * added weight loading * use arguments instead of t5config * broadcast noise time to batch dim * fix call * added scale_to_features * fix weights * transpose laynorm weight * scale is a vector * scale the query outputs * added comment * undo scaling * undo depth_scaling * inital get_extended_attention_mask * attention_mask is none in self-attention * cleanup * manually invert attention * nn.linear need bias=False * added T5LayerFFCond * remove to fix conflict * make style and dummy * remove unsed variables * remove predict_epsilon * Move accelerate to a soft-dependency (#1134) * finish * finish * Update src/diffusers/modeling_utils.py * Update src/diffusers/pipeline_utils.py Co-authored-by: Anton Lozhkov <anton@huggingface.co> * more fixes * fix Co-authored-by: Anton Lozhkov <anton@huggingface.co> * fix order * added initial midi to note token data pipeline * added int to int tokenizer * remove duplicate * added logic for segments * add melgan to pipeline * move autoregressive gen into pipeline * added note_representation_processor_chain * fix dtypes * remove immutabledict req * initial doc * use np.where * require note_seq * fix typo * update dependency * added note-seq to test * added is_note_seq_available * fix import * added toc * added example usage * undo for now * moved docs * fix merge * fix imports * predict first segment * avoid un-needed copy to and from cpu * make style * Copyright * fix style * add test and fix inference steps * remove bogus files * reorder models * up * remove transformers dependency * make work with diffusers cross attention * clean more * remove @ * improve further * up * uP * Apply suggestions from code review * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py * loop over all tokens * make style * Added a section on the model * fix formatting * grammer * formatting * make fix-copies * Update src/diffusers/pipelines/__init__.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/spectrogram_diffusion/pipeline_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * added callback ad optional ionnx * do not squeeze batch dim * clean up more * upload * convert jax to nnumpy * make style * fix warning * make fix-copies * fix warning * add initial fast tests * add initial pipeline_params * eval mode due to dropout * skip batch tests as pipeline runs on a single file * make style * fix relative path * fix doc tests * Update src/diffusers/models/t5_film_transformer.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/models/t5_film_transformer.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/api/pipelines/spectrogram_diffusion.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * add MidiProcessor * format * fix org * Apply suggestions from code review * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py * make style * pin protobuf to <4 * fix formatting * white space * tensorboard needs protobuf --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Anton Lozhkov <anton@huggingface.co>

…line (#2779) [2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines Co-authored-by: njindal <njindal@adobe.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* small fixes to the text to video doc. * add: Spaces link. * add: warning on research-only model.

* Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * format

* Skip mps in text-to-video tests. * style * Skip UNet3D mps tests.

* add contronet flax --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>

* add colab notebook and spaces * fix image link

* Add AudioLDM * up * add vocoder * start unet * unconditional unet * clap, vocoder and vae * clean-up: conversion scripts * fix: conversion script token_type_ids * clean-up: pipeline docstring * tests: from SD * clean-up: cpu offload vocoder instead of safety checker * feat: adapt tests to audioldm * feat: add docs * clean-up: amend pipeline docstrings * clean-up: make style * clean-up: make fix-copies * fix: add doc path to toctree * clean-up: args for conversion script * clean-up: paths to checkpoints * fix: use conditional unet * clean-up: make style * fix: type hints for UNet * clean-up: docstring for UNet * clean-up: make style * clean-up: remove duplicate in docstring * clean-up: make style * clean-up: make fix-copies * clean-up: move imports to start in code snippet * fix: pass cross_attention_dim as a list/tuple to unet * clean-up: make fix-copies * fix: update checkpoint path * fix: unet cross_attention_dim in tests * film embeddings -> class embeddings * Apply suggestions from code review Co-authored-by: Will Berman <wlbberman@gmail.com> * fix: unet film embed to use existing args * fix: unet tests to use existing args * fix: make style * fix: transformers import and version in init * clean-up: make style * Revert "clean-up: make style" This reverts commit 5d6d1f8. * clean-up: make style * clean-up: use pipeline tester mixin tests where poss * clean-up: skip attn slicing test * fix: add torch dtype to docs * fix: remove conversion script out of src * fix: remove .detach from 1d waveform * fix: reduce default num inf steps * fix: swap height/width -> audio_length_in_s * clean-up: make style * fix: remove nightly tests * fix: imports in conversion script * clean-up: slim-down to two slow tests * clean-up: slim-down fast tests * fix: batch consistent tests * clean-up: make style * clean-up: remove vae slicing fast test * clean-up: propagate changes to doc * fix: increase test tol to 1e-2 * clean-up: finish docs * clean-up: make style * feat: vocoder / VAE compatibility check * feat: possibly expand / cut audio waveform * fix: pipeline call signature test * fix: slow tests output len * clean-up: make style * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: William Berman <WLBberman@gmail.com>

* TIME first commit * styling. * styling 2. * fixes; tests * apply styling and doc fix. * remove sups. * fixes * remove temp file * move augmentations to const * added doc entry * code quality * customize augmentations * quality * quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Relax DiT test * relax 2 more tests * fix style * skip test on mac due to older protobuf

* update import onnxruntime package, enable onnxruntime-rocm and onnxruntime-training * add ort_nightly_gpu

* up * fix more 7 * up * finish

* update docs to reflect the updated ckpts. * update: point about prompt. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * emove image resizing. * Apply suggestions from code review * Apply suggestions from code review --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* comment update * comment update

Narsil and others added 30 commits March 16, 2023 15:57

[docs] Add safety checker to ethical guidelines (#2699)

77e0ea8

add safety checker

train_unconditional save restore unet parameters (#2706)

a4b2c2f

Improve deprecation error message when using cross_attention import (#…

a41850a

…2710) Improve error message

fix image link in inpaint doc (#2693)

f4bbcb2

fix link Co-authored-by: yiyixuxu <yixu310@gmail,com>

[docs] Update ONNX doc to use optimum (#2702)

a169571

* minor edits to onnx and openvino docs. * Apply suggestions from code review Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> --------- Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

[Tests] Correct PT2 (#2724)

9ecd924

* [Tests] Correct PT2 * correct more * move versatile to nightly * up * up * again * Apply suggestions from code review

Update README.md

ec2c1bc

Fix more slow tests

fdcff56

Update mps.mdx (#2749)

d91dc57

Fix typos

Update torch2.0.mdx (#2748)

a9f28b6

Fix typos

Update fp16.mdx (#2746)

af86b0c

Fix typos

Update dreambooth.mdx (#2742)

25ed7cb

Fix typos

Update philosophy.mdx (#2752)

afe59a9

Fix typos

Update text_inversion.mdx (#2751)

ba87c16

Fix typos

Update numbers for Habana Gaudi in documentation (#2734)

c10d685

Update numbers for Habana Gaudi in doc

Fix typos (#2715)

f024e00

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

[1929]: Add CLIP guidance for Img2Img stable diffusion pipeline (#2723)

0042efd

* [Img2Img]: Copyover img2img pipeline * [Img2Img]: img2img pipeline * [Img2Img]: img2img pipeline * [Img2Img]: img2img pipeline --------- Co-authored-by: njindal <njindal@adobe.com>

Add guidance start/end parameters to StableDiffusionControlNetImg2Img…

58bcf46

…Pipeline (#2731) * Add guidance start/end parameters to community controlnet img2img pipeline * Fix formats

Fix mps tests on torch 2.0 (#2766)

1fcf279

Add option to set dtype in pipeline.to() method (#2317)

b33bd91

add test_to_dtype to check pipe.to(fp16)

stable diffusion depth batching fix (#2757)

ca1e407

[docs] update torch 2 benchmark (#2764)

a39d42b

* Update benchmark for A100, 3090, 3090 Ti, 4090. * Link to PyTorch blog. * Update install instructions.

[docs] Clarify purpose of reproducibility docs (#2756)

7fe8861

* clarify purpose of repro docs * apply feedback

mps: remove warmup passes (#2771)

92e1164

* Remove warmup passes in mps tests. * Update mps docs: no warmup pass in PyTorch 2 * Update imports.

Support for Offset Noise in examples (#2753)

e0d8c9e

* add noise offset * make style

sayakpaul and others added 21 commits March 23, 2023 09:55

add: section on multiple controlnets. (#2762)

c681ad1

* add: section on multiple controlnets. Co-authored-by: William Berman <WLBberman@gmail.com> * fix: docs. * fix: docs. --------- Co-authored-by: William Berman <WLBberman@gmail.com>

deduplicate training section in the docs. (#2788)

0d633a4

[UNet3DModel] Fix with attn processor (#2790)

a8315ce

* [UNet3DModel] Fix attn processor * make style

[doc wip] literalinclude (#2718)

8e35ef0

Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' (#2732)

14e3a28

The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.

[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipe…

055c90f

…line (#2779) [2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines Co-authored-by: njindal <njindal@adobe.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

[Docs] small fixes to the text to video doc. (#2787)

0d7aac3

* small fixes to the text to video doc. * add: Spaces link. * add: warning on research-only model.

Update train_text_to_image_lora.py (#2767)

dc5b4e2

* Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * format

Skip mps in text-to-video tests (#2792)

aa0531f

* Skip mps in text-to-video tests. * style * Skip UNet3D mps tests.

Flax controlnet (#2727)

df91c44

* add contronet flax --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>

[docs] Add Colab notebooks and Spaces (#2713)

1870fb0

* add colab notebook and spaces * fix image link

Update train_text_to_image_lora.py (#2795)

4a98d6e

Relax DiT test (#2808)

f6feb69

* Relax DiT test * relax 2 more tests * fix style * skip test on mac due to older protobuf

Update onnxruntime package candidates (#2666)

c4892f1

* update import onnxruntime package, enable onnxruntime-rocm and onnxruntime-training * add ort_nightly_gpu

[Stable UnCLIP] Finish Stable UnCLIP (#2814)

dbcb15c

* up * fix more 7 * up * finish

StableDiffusionModelEditingPipeline documentation (#2810)

9fb0217

* comment update * comment update

cmdr2 merged commit 5644969 into cmdr2:main Mar 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upstream #1

Upstream #1

Uh oh!

cmdr2 commented Mar 27, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Upstream #1

Upstream #1

Uh oh!

Conversation

cmdr2 commented Mar 27, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants