Feature/parallelize model download #42675

RituAddepalli · 2025-12-06T16:04:26Z

This PR modifies the cached_files() function in hub.py to download multiple files (like sharded model weights) in parallel instead of sequentially.This change safely parallelizes multi-file downloads without altering any existing logic. Single-file downloads remain unchanged, and all files are still resolved to the same cache locations. This improves performance without introducing functional risks.

Details:

Before:
Single file → uses hf_hub_download().
Multiple files → uses snapshot_download(), downloading files sequentially.

After (this PR):
Single file → still uses hf_hub_download().
Multiple files → each file is downloaded in parallel using concurrent.futures.ThreadPoolExecutor with hf_hub_download().
This improves download speed for multiple files without changing any other functionality.

Example snippet of the change:
from concurrent.futures import ThreadPoolExecutor
def cached_files(...):
...
if len(full_filenames) == 1:
return [hf_hub_download(...)]
else:
resolved_files = []

def download_file(file):
    return hf_hub_download(
        path_or_repo_id,
        file,
        subfolder=None if len(subfolder) == 0 else subfolder,
        repo_type=repo_type,
        revision=revision,
        cache_dir=cache_dir,
        user_agent=user_agent,
        force_download=force_download,
        proxies=proxies,
        token=token,
        local_files_only=local_files_only,
    )

with ThreadPoolExecutor(max_workers=min(8, len(full_filenames))) as executor:
    for f in executor.map(download_file, full_filenames):
        resolved_files.append(f)

return resolved_files

Impact:

No functional changes for single-file downloads.
Multi-file downloads are faster due to parallelization.

Testing Done

Test 1: test_parallel_download.py
3 files (config, vocab, model) start downloading at the same time
Verified parallel start timestamps
output:
Directory: C:\gitcontributions\transformers

Mode LastWriteTime Length Name

d----- 04-12-2025 22:00 hf_cache_parallel_test2
Starting download: bert-base-uncased/config.json at 22:00:13
Starting download: bert-base-uncased/vocab.txt at 22:00:13
Starting download: bert-base-uncased/pytorch_model.bin at 22:00:13

Parallel download completed in 0.01 seconds

Test 2: test_parallel_multi_models.py

Downloaded 3 models in parallel:
bert-base-uncased
distilbert-base-uncased
roberta-base
All resolved files appeared in correct HF cache structure.
output:

Conclusion:

In this PR, we partially addressed the issue:

Implemented parallel downloads for multiple files in cached_files() using ThreadPoolExecutor with hf_hub_download().
Single-file downloads and actual weight-loading integration remain unchanged.
While this does not fully implement parallel weight loading, it demonstrates the approach and improves download speed for multiple files.
Value of this partial work:

Acts as a proof of concept for parallel downloads.
Shows performance gains for multi-file downloads, which is relevant to sharded weights.
Provides a foundation for integrating full parallel weight loading in future updates.
This PR is therefore helpful for demonstrating the technique, even if it does not completely resolve the original issue.
Request for Review
Tagging:
@huggingface/transformers-maintainers @Cyrilvallez

Please review when convenient — happy to improve or modify any part!

…face#42129) * fix * add test * fix test * fix the obvious * more fix * fix * continue to improve * more fix * more * fix * fix * finally * CI

@SunMarc

* ah actually we don't discard lm head if missing -> needs to be moved to correct device and etc * fix some tests * small fixes * up * up * dik why we tie weights twice but,..,,. * ups * removeunused * fix hunyuan * small fix * nits * ish * up * rev * fix more tie weights keys * small fixes * nit * update * fix and fix * fix a test * glubs * current shitty changes * ship validated ones * more * more update * more * more * more * mllama * more up * fix ernie * fix xopies * up more * more fixes * up * up * fix-copies * fix more * more updates * AI UPDATE * up * hoey * make it fast * fix * lol * fix asjusting * more fixes * _dtype nit * up * nit * update * update * remove semaphores * fix import to avoid jit execution * try to remove custom tiing logic when its stupid * fix more individual models * fix whisper as well * fix? * fox umt5 * improve tqdm bar * cleanup a bit * oupsi * some updates * improve * remove all buffering -> much faster without it * remove some tie_weights custome funcs when not needed * more fixes related to strict matching regex * remove ALL custom tie weights * small update * revert change to init scheme (no need for params) * mixtral init * try less strict source check * tied weight first shot to the fiiiixxxxxx * does this help? * :) * fix some ppolry defined tied_weights_keys for now * subclass nn.Parameters * up * lol * Ouiiii * fix led * fix long cat flash * fix qwen and long cat flash * properly fix qwen init * just push this for now * propnet is dumb * update * push * remove explict sharing of some tied keys. * update decoder.bias * moe case * more changes to untangle old hardcoded ting * fixup * fix big faileurs * fix prophnet * fix resize token embeddings * nits * fix xcodex * asyncio? * fix smart apply * fix data-2-vec * [build-ci-image] * checkout * uupdate * fix hunyuan * update error message * fix deformable detr * fixes * fix init weights for non param gate up projs * shared todo? * update some models * big revert, don't break this behaviour * ty @SunMarc this fixes the buffers Co-authored-by: SunMarc <SunMarc@users.noreply.github.com> * mt5 fuck * fix lxmbert * nuke slow test fetcher * fix zamba and deepcopy for now * fix zamba tied weight keys! ~ * fix-copies * update fetch terst * fix gradient for test modeling common! * break "shared" for now I will fix tomorrow changes are properly isoalted now :) * does this fix marian? probably not * fix some vlms * D fine seems to handle this well * glob is fine actually * fix dab detr * small steps * opusy * fix some more models? * yups * better erro * fix? * fix double escape * escape wehere it makes sense * ?? * fix ibert * fix tvp as well * more fxes * try always download ref PR * ONONONO * big fixup * more fixup * small step * small nits * nits * brut force some stuff * fix vilt * make sure special models that always need tie always tie * cleaning up * small nits * fix zamba and bridge tower! * just fixup * potential culprits * revert bark and fix bridgetower * remove now non existant tie_weights * ? * lol reformer actually had nothing tied! * wow these two fucking models were really not well made * fix sam family! * fix bark revision * fix speech2test ? * push this for now.... * upsy * the fuck * fix rtdetr * update * proper * wow that one 's annoying * update * try to find the culprit * get some help on common * nit about general init and cls.padding_idx * revert num workers update * remove old loading func * fix glob * add annotations * fix re * small improvements * clean some stuff * improvements * someone did not understannnnnnd what I tried to dooo or does BNB not support that either? * gluos * fix case when `.` is just not there * remove unused arg * recover orignal parameter/buffer using _original * fix glob issu * this? * deepspeed best-effort * remove unused stuff * Update tie weight keys as they were just wroong Co-authored-by: Benjamin Bossan <benjaminbossan@users.noreply.github.com>" * up * augustuc clauss, a gloubs gloups gloubs * fixup * fixup * there was fucking typo * mrain * nits * fix marian 3 remaining tests * one more * fix some of the copies, not all :) * small cleanup * one propertest * fix core model loadig tes * attempt a new test * fix some of the annoying tests by supporting reading .bin sometimes * push * push more small fixes * remove 1 useless test * up * fix audio flamingo post rebase * fixup * some small updatess * fix sam models * nits * up * updates * onem ore * skip this stupid test * some other fixes * fixup * update * skip more offloaded stuff * oups * ups * update mixtral * skip this one * LET"SGO * fixup * rope delta order * fix csm * small nit --------- Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com> Co-authored-by: SunMarc <SunMarc@users.noreply.github.com> Co-authored-by: Marc Sun <marc@huggingface.co>

…1681) * remove deprecations from v4 * delete those for v5 * delete these also * fix tests * add dummy test config * fix copies * SDPA raises warning but doesn't automatically change to eager * max size can't be deleted, sadly * oke, this should allow loading from-pretrained, but delete everything else * style * fix popping from kwargs * audios rename * padding defaults to self * modular fix * address comment * style

* WIP * WIP

fix checkpoint loading with DeepSpeed ZeRO3 Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com> Co-authored-by: Ferdinand Mom <47445085+3outeille@users.noreply.github.com>

* fix * try oh try * change fix

* add cross links * a few nits * last bit * Update CONTRIBUTING.md Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com> * Update docs/source/en/transformers_as_backend.md Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com> --------- Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>

* Stop inheriting tests! * Just use a del instead * fixup

* refactor check_auto_docstring with AST * use dataclass for ASTIndexes * simplify and improve readability * fix missing imports * fix modular * fix modular issues

* fix * properly * fix tests

…rch 2.10 nightly) (huggingface#42212) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

…uggingface#42191) * everything untilo informer * everything until perceiver * all of them finally * style * replace by transformers init everywhere * use relative import instead * deprecated models * style * start contexts * small fixes * fix modular * remove class switch * do not initialize tied weights * typo * fix * improve * improve comments * improve * improve * fix zamba * fix import * add the post_init * more post_init * fix * protect * more post_init * fix * fixes * fix * fix * switch flag name * more fixes * fixes * fixes * copies * fix * finally find the culprit * style * last small * big bird * better * update init check * final touch * do it everywhere

* init * update * add * Update video_processing_glm46v.py * update doc * Update modular_glm46v.py * 2 * Update processing_glm46v.py * 21 * Update check_repo.py * Update check_repo.py * Update test_processor_glm46v.py * Update modeling_auto.py * update * Update glm46v.md * Update configuration_auto.py * 2 * update with glm46v import * uppercase * upload * upload * upload with modular * 1 * - * update * 1 * 2 * 1 * 2 * 2 * 1 * update config * 1 * update as automoel * 1 * try remove * delete * delete * test * update * 1 * Update modular_glm46v.py * Update test_modeling_glm46v.py * update 1513 * 1 * use PreTrainedConfig * Update modular_glm46v.py * Update configuration_glm46v.py * model_type = "glm46v" * remove glm46v_text * Update image_processing_auto.py * 1 * update readme * GLM-4.6V * update * update * Update __init__.py * update * update doc * Update check_docstrings.py * update doc * fix copies for tied weight keys! * more fixup --------- Co-authored-by: Raushan Turganbay <raushan@huggingface.co> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Arthur <arthur.zucker@gmail.com>

add copy

* start * more fixes

* Fix a bug in the CB memory calcuation * Nit in example * Replace _free_blocks with a proper object BlockManager * Removed dead code * Added hasing mechanism (wip) * Added de-duplication * Add de-initialization mechnaism * Add prefix detection * Ensure we always keep 1 token for decode start * Removed some todos and small fix * Update src/transformers/generation/continuous_batching/cache.py Co-authored-by: Luc Georges <McPatate@users.noreply.github.com> * Update src/transformers/generation/continuous_batching/continuous_api.py Co-authored-by: Luc Georges <McPatate@users.noreply.github.com> * DOCSSSS * Review comments * Style * Added a flag to allow prefix sharing * [IMPORTANT] bug fix for prefix length memoization * Added a test for Cb prefix sharing * Example, start of refactor * End of refactor for example script * Added a do sample arg * Added reporting on prefix sharing * Added a context managr option for CB manager * Nit and style * Review comment from ArthurZucker --------- Co-authored-by: Luc Georges <McPatate@users.noreply.github.com>

* remove loop over modules * no need for set casting

…re (huggingface#42230) sdpa Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>

* FIX Broken PEFT adapter loading For some time now, loading PEFT adapters directly with transformers is broken when using revisions or subfolders. To check, run: RUN_SLOW=1 pytest tests/peft_integration/test_peft_integration.py -k test_peft_from_pretrained_hub_kwargs This PR makes the PEFT tests pass. The PR causing this is huggingface#41445 (bad commit: b720faf, previous good comit: 80465c7). However, that PR also caused other errors (see huggingface#41604), which is why this error was not immediately obvious. * Fix for adapter_kwargs being None

fix arg name

* Fix UnboundLocalError in RT-DETR loss computation Initialize auxiliary_outputs to None before conditional use to prevent UnboundLocalError when config.auxiliary_loss is False. Fixes the error: UnboundLocalError: local variable 'auxiliary_outputs' referenced before assignment This occurs when auxiliary_loss is disabled but the variable is still referenced later in the function. * Update src/transformers/loss/loss_rt_detr.py --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Stop inheriting tests! * Just use a del instead * fixup * Stop using del! * make fixup

…e#42246) * fix device * fix * CI * simplify a bit

…uggingface#42206) * fix 1 * fix 2: bark * fix 2: mamba * fix 4: Speech2TextModelIntegrationTests * fix 5: Aria * fix 6: RTDetrModelIntegrationTest * fix 7: PLBartBaseIntegrationTest * fix 8: XLMRobertaModelIntegrationTest * fix 9: TvpModelIntegrationTests * fix 10: LlavaForConditionalGenerationIntegrationTest * fix 11: RTDetrV2ModelIntegrationTest * fix 12: HieraModelIntegrationTest * fix 13: Olmo2IntegrationTest * fix 14: BarkModelIntegrationTests * fix 15: Rag * fix 16: JambaModelIntegrationTest * run * fix 17: ImageGPTModelTest * fix 18: MBartEnroIntegrationTest * revert * style --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* i am so confused, too many circular dependencies. Delete and see what happens * pop if exists * fix a few tests * fix loading generation params from model config * oh no, revert this * replace audios with audio in docs * fix tests * fix last test * i am dumb, typo * Update src/transformers/generation/configuration_utils.py Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * Update tests/utils/test_modeling_utils.py Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> --------- Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>

* allow VLMs to have a correct `base_model` * fix copies * fix copies? * empty commit * fix copies * nits after rebase * fix copies * add a test * skip more tests * fiix copies, ig have to do it in all PRs after rebase

* fix * fix * 1 * 2 * 2 * 2 * 2 * 2 * 2 * 2 * 2 * 2 * so many batch_size=13 --> batch_size=2 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

…gface#42258) Revert "Make tests run in less time by reducing `batch_size` (huggingface#42213)" This reverts commit d708dd1.

…#42182) * Cleanup reference to TFBertTokenizer * Remove the GPT2 TF tokenizer too

…not always present (huggingface#42593) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

* [XPU] Fix fp8 UT patch * make style --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

…40691) * fix resume from epoch >= 1 * add test checking order of sampled data points * add require_torch_non_multi_accelerator decorator to test method * move the epoch setting of epoch_dataloader before iterating over it * make fixup

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

Fix three typos found in code comments: - 'avaoid' → 'avoid' in modeling_utils.py - 'weigth' → 'weight' in trainer_utils.py - 'Templace' → 'Template' in convert_slow_tokenizer.py These typos appeared in TODO comments and inline documentation.

Co-authored-by: marconaguib <marco.naguib@aphp.fr>

…uggingface#42436) * Fix mixed torch.Tensor and DTensor error by registering fsdp forward for trainer.mode.generate * Apply fsdp forward register when fsdp is enabled --------- Co-authored-by: yiminzme <yiminzme@gmail.com> Co-authored-by: Ferdinand Mom <47445085+3outeille@users.noreply.github.com>

* Matching FA2 naming under kernels * make style * convert model * Follow the comments

* fix * style * up

* transfer commit * Allow fullgraph and little fixes * fix when no measurements * CB is better at handling compile. Also can be benched. * Style * Add sumarrized by default * Doc and better CG logic * CG logic rollback * Update to Fa3 thx to Anton * style

) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

* Add SDPA and Flash Attention support for PatchTST model - Add _supports_sdpa = True and _supports_flash_attn = True to PatchTSTPreTrainedModel - The existing PatchTSTAttention class already uses ALL_ATTENTION_FUNCTIONS to select the attention implementation based on config._attn_implementation - Fix test_modeling_patchtst.py _prepare_for_class for dynamic batch sizes * Guard PatchTST positional init under ZeRO-3 * Force SDPA in PatchTST regression integration test * Use sdpa attn in PatchTST regression test * fixups re tests --------- Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: vasqu <antonprogamer@gmail.com> Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>

fix

* draft * draft * draft * draft * draft * draft * draft * draft * draft * draft * draft * fail to see the check * fail to see the check * fail to see the check * fail to see the check * fail to see the check * Apply style fixes * fail to see the check * fail to see the check * fail to see the check * Apply repo. consistency fixes * fail to see the check * Apply repo. consistency fixes * fail to see the check * delete * Apply repo. consistency fixes * comment * Apply repo. consistency fixes * comment * Apply repo. consistency fixes * comment * Apply repo. consistency fixes * comment * Apply repo. consistency fixes * back * check * check * check * check * check --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

* fix * fix * rm unnecessary config * remove references

* Create cache when training in case generate needs being called * Align modular * fixes * cohere * fix modular * fix * review --------- Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>

* extend FA2 and other cases to XPU, we expect all model cases except CUDAGraph specific, CUDA compute capability specific and FA3 specific can run XPU. For FA3, we are develioping Signed-off-by: Yao, Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao, Matrix <matrix.yao@intel.com> * Update modeling_mimi.py --------- Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

* Add **kwargs into every Model.forward() * Add the test back in * And the others I missed * Fix udop test * Fix fast2speech2conformer test * make fixup

huggingface#42609) * fix(Qwen3VLCausalLMOutputWithPast): missing `hidden_states` and `attentions` kwargs * revert kwargs for the base class * make regenerated model files * symmetrical change for `modular_qwen3_vl_moe.py` * regenerated `modeling_qwen3_vl_moe.py`

* intial * update * add convert * fix * style * rm comment * explain * loop * fix * fix * update * Apply style fixes * fix * style * update --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

fix

…ggingface#42615) Fix minibug

github-actions · 2025-12-06T16:56:22Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

Cyrilvallez and others added 30 commits November 13, 2025 15:44

[TP] Fix parameter detection issue and some invalid TP-plans (hugging…

8740b86

…face#42129) * fix * add test * fix test * fix the obvious * more fix * fix * continue to improve * more fix * more * fix * fix * finally * CI

Add AutoTokenizer mapping for mistral3 and ministral (huggingface#42198)

c7c5470

* WIP * WIP

Fix checkpoint loading with DeepSpeed ZeRO3 (huggingface#42201)

69f5671

fix checkpoint loading with DeepSpeed ZeRO3 Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com> Co-authored-by: Ferdinand Mom <47445085+3outeille@users.noreply.github.com>

[Pop2Piano] Fix tied weights (huggingface#42193)

1c15104

* fix * try oh try * change fix

New docker from AMD (huggingface#42208)

44da439

Stop inheriting tests! (huggingface#42192)

a7e0f20

* Stop inheriting tests! * Just use a del instead * fixup

Refactor check_auto_docstring using AST (huggingface#41432)

0da7f1a

* refactor check_auto_docstring with AST * use dataclass for ASTIndexes * simplify and improve readability * fix missing imports * fix modular * fix modular issues

[BLT] Fix cache usage (huggingface#42188)

a1abf26

* fix * properly * fix tests

Update test_dynamic_cache_exportability_multiple_run (failing on to…

bb731df

…rch 2.10 nightly) (huggingface#42212) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fix initialization guard for pytest (huggingface#42234)

31785ab

add copy

Fix TP plans for MoE models (huggingface#42236)

e4b1d1c

* start * more fixes

Loading optimization (huggingface#42239)

113629a

* remove loop over modules * no need for set casting

calls AttentionMaskConverter._unmask_unattended for xpu device befo…

33c2d1d

…re (huggingface#42230) sdpa Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>

Fix processor test for glm (huggingface#42233)

4cadb07

fix arg name

Stop inheriting tests (again) (huggingface#42247)

dd287e8

* Stop inheriting tests! * Just use a del instead * fixup * Stop using del! * make fixup

[loading] Fix device when source and target are different (huggingfac…

4f1c38b

…e#42246) * fix device * fix * CI * simplify a bit

Allow VLMs to have a correct base_model (huggingface#41589)

14b8242

* allow VLMs to have a correct `base_model` * fix copies * fix copies? * empty commit * fix copies * nits after rebase * fix copies * add a test * skip more tests * fiix copies, ig have to do it in all PRs after rebase

Make tests run in less time by reducing batch_size (huggingface#42213)

d708dd1

* fix * fix * 1 * 2 * 2 * 2 * 2 * 2 * 2 * 2 * 2 * 2 * so many batch_size=13 --> batch_size=2 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Revert "Make tests run in less time by reducing batch_size" (huggin…

67feae6

…gface#42258) Revert "Make tests run in less time by reducing `batch_size` (huggingface#42213)" This reverts commit d708dd1.

Cleanup reference to TFBertTokenizer and TFGPT2Tokenizer (huggingface…

cf3a994

…#42182) * Cleanup reference to TFBertTokenizer * Remove the GPT2 TF tokenizer too

hmellor and others added 27 commits December 3, 2025 12:04

Use getattr in standardize_rope_params because rope_parameters …

7028fe2

…not always present (huggingface#42593) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

[XPU] Fix fp8 UT patch func (huggingface#42584)

7aca519

* [XPU] Fix fp8 UT patch * make style --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

fix the FSDP1 default for reshard_after_forward (huggingface#42578)

e3de444

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

fix : cast into floats AFTER all assignments (huggingface#42587)

a8cd70f

Co-authored-by: marconaguib <marco.naguib@aphp.fr>

Fix the FA2 logic in the longcat_flash model (huggingface#42549)

dc2822a

* Matching FA2 naming under kernels * make style * convert model * Follow the comments

[Quantization] Remove dequant fp8 config (huggingface#42596)

23fb942

* fix * style * up

Allow validation_fn to be None in validate_rope (huggingface#42601

84f8eee

) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Align tie weights in Idefics (huggingface#42551)

b749132

fix

Fix Ernie Moe Test (huggingface#42595)

636ea1c

* fix * fix * rm unnecessary config * remove references

Fix some models cache initialization (huggingface#42586)

09a67b8

* Create cache when training in case generate needs being called * Align modular * fixes * cohere * fix modular * fix * review --------- Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>

Update revision so that there is a safetensors model (huggingface#42618)

2346a1a

Every model forward() should have **kwargs (huggingface#42603)

b6961c0

* Add **kwargs into every Model.forward() * Add the test back in * And the others I missed * Fix udop test * Fix fast2speech2conformer test * make fixup

[core] Fix quark (huggingface#42457)

d8e82bd

* intial * update * add convert * fix * style * rm comment * explain * loop * fix * fix * update * Apply style fixes * fix * style * update --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

Fix small weight loading example (huggingface#42622)

fa6a247

fix

Fix _is_package_available to handle underscore/hyphen equivalence (hu…

0ce40b2

…ggingface#42615) Fix minibug

Fix huggingface#42559: Parallelize model downloads in hub.py

d274cbe

Fix circular import in hub.py

3a90337

Fix import order, add hf_logging, skip hub tests without HF_TOKEN

c8bbc81

Parallelize hub.py model download and fix lint issues for CircleCI

6c41366

RituAddepalli closed this Dec 6, 2025

RituAddepalli force-pushed the feature/parallelize-model-download branch from 192ec03 to 6c41366 Compare December 6, 2025 16:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/parallelize model download #42675

Feature/parallelize model download #42675

RituAddepalli commented Dec 6, 2025

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Feature/parallelize model download #42675

Feature/parallelize model download #42675

Conversation

RituAddepalli commented Dec 6, 2025

Details:

Impact:

Testing Done

Conclusion:

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants