Model loading from huggingface using parallel downloads by dxqb · Pull Request #770 · Nerogar/OneTrainer

dxqb · 2025-04-05T09:13:21Z

increases download speed from about 40 mbit/s to the maximum download limit of my connection, by utilizing the parallel downloads feature of the huggingface_hub API
works on any host including your local machine, but especially useful for cloud training, because the model is downloaded anew each training run (if you don't have persistent storage)
implemented only for Flux, hence the "Draft PR". Will copy to all models if you like this PR

dxqb · 2025-04-05T11:10:43Z

merged #639 because there is a small dependency. Find the stand-alone code of this PR here: 851c3f2

dxqb · 2025-04-21T20:12:31Z

merged the new master branch, which makes this PR easier to read because it was a combination before

This reverts commit fcb581a.

dxqb · 2025-06-03T09:02:00Z

Please review this when you have the time, even though it is still a draft, because the only open point is to copy it to all models.
It is a big time and cost saver on clouds, cutting down model download time from ~ 15 minutes to ~ 3 minutes.

…pstream

dxqb · 2025-08-20T10:59:06Z

copy to other models, at least the newer and larges ones

then ready for merge

O-J1 · 2025-08-21T10:34:19Z

For future maintainers/ref:

HF_HUB_ENABLE_HF_TRANSFER is disabled by default. Dont ever enable it without a toggle. This PR does not enable it.

Since huggingface_hub > 32.0.0, hf-xet capabilities were merged into the base package. Xet is buggy and not ready for consumer usage. We need to set HF_HUB_DISABLE_XET=1. This will slow downloads down but that does not matter for consumer systems, you only download once.

If end users want it, they need to make manual modifications themselves but given the widespread reports its not usable at this current time.

Test results

Worked without issue on 2 runs but given my findings in their docs and repo, its only ready for review (and then merge) after:

Ensure other large models utilise huggingface.snapshot_download
we modify start-ui.bat and sh to launch with HF_HUB_DISABLE_XET=1 set.

start-ui.bat suggested modification:

REM Ensure HF_HUB_DISABLE_XET as it's still buggy; default disables XET (set to 0 to enable)
if not defined HF_HUB_DISABLE_XET (
    set "HF_HUB_DISABLE_XET=1"
)
echo HF_HUB_DISABLE_XET=%HF_HUB_DISABLE_XET%
echo.
echo NOTE: XET (when enabled) allows higher speed parallel downloads which can increase throughput, however it is buggy.
echo NOTE: Only enable XET if your download speed is greater than 40 megabytes per second (MB/s) — not megabits.
echo To enable XET, either export HF_HUB_DISABLE_XET=0 in the venv or modify start-ui.bat directly before running (the latter will break git fetch).
echo.

:launch
...

References on why we have to disable it on consumer systems:

huggingface/xet-core#446
huggingface/xet-core#409
huggingface/xet-core#448
huggingface/xet-core#400
Multiple reports in discord (HF official discord and others)

…3 and Hunyuan

… precache2

dxqb · 2025-08-26T15:47:58Z

tested on all implemented models

dxqb and others added 20 commits January 12, 2025 16:44

override Flux transformer

a40c25e

ignore hugging face connection error for offline training

c5c08a2

pre-commit fix

b60968d

Merge branch 'Nerogar:master' into master

395d078

Merge branch 'Nerogar:master' into master

a441f04

Merge branch 'Nerogar:master' into master

0fb8512

Merge branch 'Nerogar:master' into master

a1a263d

Merge branch 'master' of https://github.com/dxqbYD/OneTrainer

ae2e7ce

clean exit

aec59b6

Merge branch 'Nerogar:master' into override_transformer

15967e6

Merge branch 'master' of https://github.com/dxqbYD/OneTrainer

93fe9e0

Merge branch 'master' of https://github.com/dxqbYD/OneTrainer

6fefde0

Merge branch 'master' of https://github.com/dxqbYD/OneTrainer

af3cea8

Merge branch 'master' of https://github.com/dxqbYD/OneTrainer

0aa6c63

Merge branch 'master' into override_transformer

7929cb3

avoid loading in float32; bugfix

bf38e55

Merge branch 'master' of https://github.com/dxqbYD/OneTrainer

3566a1d

prepare HF submodules

851c3f2

Merge branch 'override_transformer' into precache2

29182bb

merged Nerogar#639

4d0ccaa

dxqb added 3 commits April 5, 2025 13:13

typo

42831d7

bugfix

7fe85da

Merge branch 'upstream' into precache2

6b44ee3

dxqb added 3 commits May 17, 2025 20:07

temperature

fcb581a

Revert "temperature"

f2381f1

This reverts commit fcb581a.

Merge remote-tracking branch 'upstream/master' into upstream

147f358

dxqb requested a review from Nerogar June 3, 2025 09:01

Nerogar reviewed Jun 3, 2025

View reviewed changes

Comment thread modules/modelLoader/mixin/HFModelLoaderMixin.py

Nerogar reviewed Jun 3, 2025

View reviewed changes

Comment thread modules/modelLoader/flux/FluxModelLoader.py

dxqb added 7 commits June 8, 2025 09:04

Merge remote-tracking branch 'upstream/master' into upstream

92a87b2

Merge remote-tracking branch 'upstream/torch_2.7' into upstream

06f2472

Merge remote-tracking branch 'upstream/master' into upstream

3803f7c

Merge branch 'master' of https://github.com/Nerogar/OneTrainer into u…

5adf137

…pstream

exception handling

612d256

pre-commit fix

83ab359

Merge remote-tracking branch 'upstream/master' into upstream

99d2d81

O-J1 mentioned this pull request Aug 25, 2025

[Feat]: Renable Xet under hf_hub once they get their bugs sorted out. #949

Open

O-J1 and others added 8 commits August 25, 2025 21:01

Disable xet by default but defer to user value, add snapshot dl to SD…

b424f15

…3 and Hunyuan

Merge branch 'master' into precache2

009077e

Less verbose note on Xet

d3bc598

Update with ignore patterns to not download fp16 for SD3.X

39db6f6

Merge branch 'upstream' into precache2

cd8233f

Merge branch 'precache2' of https://github.com/dxqbYD/OneTrainer into…

3d795a7

… precache2

add HiDream, remove ignorepattern

a37ea84

remove SD3

1d7210a

dxqb marked this pull request as ready for review August 26, 2025 15:48

dxqb merged commit 3bf082f into Nerogar:master Aug 26, 2025
1 check passed

dxqb mentioned this pull request Aug 26, 2025

Chroma #945

Merged

9 tasks

dxqb mentioned this pull request Feb 4, 2026

Upgrade transformers to 5.x and other dependencies #1285

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Model loading from huggingface using parallel downloads#770

Model loading from huggingface using parallel downloads#770
dxqb merged 41 commits intoNerogar:masterfrom
dxqb:precache2

dxqb commented Apr 5, 2025

Uh oh!

dxqb commented Apr 5, 2025

Uh oh!

dxqb commented Apr 21, 2025

Uh oh!

dxqb commented Jun 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

dxqb commented Aug 20, 2025 •

edited by O-J1

Loading

Uh oh!

O-J1 commented Aug 21, 2025 •

edited

Loading

Uh oh!

dxqb commented Aug 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

dxqb commented Apr 5, 2025

Uh oh!

dxqb commented Apr 5, 2025

Uh oh!

dxqb commented Apr 21, 2025

Uh oh!

dxqb commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dxqb commented Aug 20, 2025 • edited by O-J1 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

O-J1 commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test results

start-ui.bat suggested modification:

References on why we have to disable it on consumer systems:

Uh oh!

dxqb commented Aug 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dxqb commented Jun 3, 2025 •

edited

Loading

dxqb commented Aug 20, 2025 •

edited by O-J1

Loading

O-J1 commented Aug 21, 2025 •

edited

Loading