Model loading from huggingface using parallel downloads#770
Model loading from huggingface using parallel downloads#770dxqb merged 41 commits intoNerogar:masterfrom
Conversation
|
merged the new master branch, which makes this PR easier to read because it was a combination before |
|
Please review this when you have the time, even though it is still a draft, because the only open point is to copy it to all models. |
then ready for merge |
|
For future maintainers/ref: HF_HUB_ENABLE_HF_TRANSFER is disabled by default. Dont ever enable it without a toggle. This PR does not enable it. Since huggingface_hub > 32.0.0, If end users want it, they need to make manual modifications themselves but given the widespread reports its not usable at this current time. Test resultsWorked without issue on 2 runs but given my findings in their docs and repo, its only ready for review (and then merge) after:
start-ui.bat suggested modification:REM Ensure HF_HUB_DISABLE_XET as it's still buggy; default disables XET (set to 0 to enable)
if not defined HF_HUB_DISABLE_XET (
set "HF_HUB_DISABLE_XET=1"
)
echo HF_HUB_DISABLE_XET=%HF_HUB_DISABLE_XET%
echo.
echo NOTE: XET (when enabled) allows higher speed parallel downloads which can increase throughput, however it is buggy.
echo NOTE: Only enable XET if your download speed is greater than 40 megabytes per second (MB/s) — not megabits.
echo To enable XET, either export HF_HUB_DISABLE_XET=0 in the venv or modify start-ui.bat directly before running (the latter will break git fetch).
echo.
:launch
...References on why we have to disable it on consumer systems:huggingface/xet-core#446 |
|
tested on all implemented models |
huggingface_hubAPI