Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1808 commits
Select commit Hold shift + click to select a range
8405276
Merge branch 'mergeability-pr-45075' into features-and-defects-750
evalstate Apr 29, 2026
773209e
Merge branch 'mergeability-pr-45040' into features-and-defects-750
evalstate Apr 29, 2026
358420e
Merge branch 'mergeability-pr-44989' into features-and-defects-750
evalstate Apr 29, 2026
936f92c
Fix train_batch_size and eval_batch_size to respect split_batches config
MinuriRajapakse Apr 29, 2026
6377e75
Merge branch 'mergeability-pr-44952' into features-and-defects-750
evalstate Apr 29, 2026
9da1c7f
Merge branch 'mergeability-pr-44940' into features-and-defects-750
evalstate Apr 29, 2026
b2da860
Merge branch 'mergeability-pr-44923' into features-and-defects-750
evalstate Apr 29, 2026
374660d
Merge branch 'mergeability-pr-44907' into features-and-defects-750
evalstate Apr 29, 2026
309c85e
Merge branch 'mergeability-pr-44893' into features-and-defects-750
evalstate Apr 29, 2026
02e557f
Merge branch 'mergeability-pr-44891' into features-and-defects-750
evalstate Apr 29, 2026
2950cb8
Merge branch 'mergeability-pr-44889' into features-and-defects-750
evalstate Apr 29, 2026
552fba0
Merge branch 'new-model-original-code' into granite_speech_plus
zvik Apr 29, 2026
9019aa5
Merge branch 'mergeability-pr-45694' into features-and-defects-750
evalstate Apr 29, 2026
cb6444a
Merge branch 'mergeability-pr-44836' into features-and-defects-750
evalstate Apr 29, 2026
90d49d3
Merge branch 'mergeability-pr-44827' into features-and-defects-750
evalstate Apr 29, 2026
a28192f
Merge branch 'mergeability-pr-44793' into features-and-defects-750
evalstate Apr 29, 2026
d534dbe
Merge branch 'mergeability-pr-44781' into features-and-defects-750
evalstate Apr 29, 2026
e40fdff
Merge branch 'mergeability-pr-44771' into features-and-defects-750
evalstate Apr 29, 2026
5f46f04
add compressed-tensor fp8 integeration
jiqing-feng Apr 29, 2026
297b062
Merge branch 'mergeability-pr-44731' into features-and-defects-750
evalstate Apr 29, 2026
4102d18
Merge branch 'mergeability-pr-44724' into features-and-defects-750
evalstate Apr 29, 2026
74e7ea5
Merge branch 'mergeability-pr-44713' into features-and-defects-750
evalstate Apr 29, 2026
8e8a6bd
Merge branch 'mergeability-pr-44697' into features-and-defects-750
evalstate Apr 29, 2026
27be42a
Rerun modular_model_converter
zvik Apr 29, 2026
d6cb5cf
Merge branch 'mergeability-pr-44680' into features-and-defects-750
evalstate Apr 29, 2026
bf523b9
Merge branch 'mergeability-pr-44676' into features-and-defects-750
evalstate Apr 29, 2026
998596f
Merge branch 'mergeability-pr-45695' into features-and-defects-750
evalstate Apr 29, 2026
f5a3168
update
jiqing-feng Apr 29, 2026
25ab701
Merge branch 'mergeability-pr-44664' into features-and-defects-750
evalstate Apr 29, 2026
5c1770e
Merge branch 'mergeability-pr-44662' into features-and-defects-750
evalstate Apr 29, 2026
59762f0
update
jiqing-feng Apr 29, 2026
3479d85
Merge branch 'mergeability-pr-44660' into features-and-defects-750
evalstate Apr 29, 2026
8e634f2
Merge branch 'mergeability-pr-44650' into features-and-defects-750
evalstate Apr 29, 2026
023c96b
Merge branch 'mergeability-pr-44641' into features-and-defects-750
evalstate Apr 29, 2026
0facb18
update copyright
jiqing-feng Apr 29, 2026
5b3f1ae
Merge branch 'mergeability-pr-44635' into features-and-defects-750
evalstate Apr 29, 2026
119c0e5
Merge branch 'mergeability-pr-44626' into features-and-defects-750
evalstate Apr 29, 2026
e649bd2
Merge branch 'mergeability-pr-44615' into features-and-defects-750
evalstate Apr 29, 2026
d57b50a
Merge branch 'mergeability-pr-44606' into features-and-defects-750
evalstate Apr 29, 2026
92a6b57
Merge branch 'mergeability-pr-44603' into features-and-defects-750
evalstate Apr 29, 2026
a0cd47a
Merge branch 'mergeability-pr-44594' into features-and-defects-750
evalstate Apr 29, 2026
0aecbf7
Merge branch 'mergeability-pr-44587' into features-and-defects-750
evalstate Apr 29, 2026
d6392bb
Merge branch 'mergeability-pr-44585' into features-and-defects-750
evalstate Apr 29, 2026
e0e8e7f
Merge branch 'mergeability-pr-44569' into features-and-defects-750
evalstate Apr 29, 2026
e25d2db
Merge branch 'mergeability-pr-45699' into features-and-defects-750
evalstate Apr 29, 2026
64c1fe9
Merge branch 'mergeability-pr-44543' into features-and-defects-750
evalstate Apr 29, 2026
8fa92b1
Merge branch 'mergeability-pr-44438' into features-and-defects-750
evalstate Apr 29, 2026
898a7bf
Merge branch 'mergeability-pr-44408' into features-and-defects-750
evalstate Apr 29, 2026
0dcd40a
Merge branch 'mergeability-pr-44385' into features-and-defects-750
evalstate Apr 29, 2026
b4d9b79
Merge branch 'mergeability-pr-44369' into features-and-defects-750
evalstate Apr 29, 2026
76e5ba2
Merge branch 'mergeability-pr-44348' into features-and-defects-750
evalstate Apr 29, 2026
7d64052
Merge branch 'mergeability-pr-44270' into features-and-defects-750
evalstate Apr 29, 2026
6bc6046
Merge branch 'mergeability-pr-44259' into features-and-defects-750
evalstate Apr 29, 2026
a461941
Merge branch 'mergeability-pr-44257' into features-and-defects-750
evalstate Apr 29, 2026
4e34121
Merge branch 'mergeability-pr-44228' into features-and-defects-750
evalstate Apr 29, 2026
8c92a14
Merge branch 'mergeability-pr-44215' into features-and-defects-750
evalstate Apr 29, 2026
f7a921e
Merge branch 'mergeability-pr-44189' into features-and-defects-750
evalstate Apr 29, 2026
ae844aa
Merge branch 'mergeability-pr-44184' into features-and-defects-750
evalstate Apr 29, 2026
08f0c9d
Merge branch 'mergeability-pr-44171' into features-and-defects-750
evalstate Apr 29, 2026
2a46291
Merge branch 'mergeability-pr-44142' into features-and-defects-750
evalstate Apr 29, 2026
bc7f9f0
Merge branch 'mergeability-pr-44070' into features-and-defects-750
evalstate Apr 29, 2026
324862f
Merge branch 'mergeability-pr-44056' into features-and-defects-750
evalstate Apr 29, 2026
6f7c941
Merge branch 'mergeability-pr-44044' into features-and-defects-750
evalstate Apr 29, 2026
f2cb385
Merge branch 'mergeability-pr-44030' into features-and-defects-750
evalstate Apr 29, 2026
70cbed1
Merge branch 'mergeability-pr-44029' into features-and-defects-750
evalstate Apr 29, 2026
fccd5d1
Merge branch 'mergeability-pr-44028' into features-and-defects-750
evalstate Apr 29, 2026
20d96a8
Merge branch 'mergeability-pr-44027' into features-and-defects-750
evalstate Apr 29, 2026
d0e0c61
Merge branch 'mergeability-pr-44026' into features-and-defects-750
evalstate Apr 29, 2026
5b647ab
Merge branch 'mergeability-pr-44025' into features-and-defects-750
evalstate Apr 29, 2026
90dcaca
Merge branch 'mergeability-pr-44024' into features-and-defects-750
evalstate Apr 29, 2026
615af1e
Merge branch 'mergeability-pr-44019' into features-and-defects-750
evalstate Apr 29, 2026
fca4376
Merge branch 'mergeability-pr-44017' into features-and-defects-750
evalstate Apr 29, 2026
5d80355
Merge branch 'mergeability-pr-44013' into features-and-defects-750
evalstate Apr 29, 2026
49499f2
Merge branch 'mergeability-pr-44010' into features-and-defects-750
evalstate Apr 29, 2026
29c3c78
Merge branch 'mergeability-pr-44002' into features-and-defects-750
evalstate Apr 29, 2026
955c36a
Merge branch 'mergeability-pr-44001' into features-and-defects-750
evalstate Apr 29, 2026
ace7547
Merge branch 'mergeability-pr-44000' into features-and-defects-750
evalstate Apr 29, 2026
8772c95
Merge branch 'mergeability-pr-43999' into features-and-defects-750
evalstate Apr 29, 2026
8232096
Merge branch 'mergeability-pr-43998' into features-and-defects-750
evalstate Apr 29, 2026
4de8dda
Merge branch 'mergeability-pr-43997' into features-and-defects-750
evalstate Apr 29, 2026
c8989de
Merge branch 'mergeability-pr-43989' into features-and-defects-750
evalstate Apr 29, 2026
81e930f
Merge branch 'mergeability-pr-43967' into features-and-defects-750
evalstate Apr 29, 2026
58e454a
Merge branch 'mergeability-pr-43915' into features-and-defects-750
evalstate Apr 29, 2026
5ff7a36
Merge branch 'mergeability-pr-43911' into features-and-defects-750
evalstate Apr 29, 2026
19832fe
Merge branch 'mergeability-pr-43875' into features-and-defects-750
evalstate Apr 29, 2026
1b4bff1
Merge branch 'mergeability-pr-43863' into features-and-defects-750
evalstate Apr 29, 2026
6840389
Merge branch 'mergeability-pr-43838' into features-and-defects-750
evalstate Apr 29, 2026
377dea7
Merge branch 'mergeability-pr-43833' into features-and-defects-750
evalstate Apr 29, 2026
93c36e4
Merge branch 'mergeability-pr-43823' into features-and-defects-750
evalstate Apr 29, 2026
395701a
Merge branch 'mergeability-pr-43779' into features-and-defects-750
evalstate Apr 29, 2026
daf83e3
Merge branch 'mergeability-pr-43775' into features-and-defects-750
evalstate Apr 29, 2026
881f91f
Merge branch 'mergeability-pr-43747' into features-and-defects-750
evalstate Apr 29, 2026
8385cd1
Apply PR #43663 signature columns hook
evalstate Apr 29, 2026
565d184
Merge branch 'mergeability-pr-43654' into features-and-defects-750
evalstate Apr 29, 2026
2237a1c
Merge branch 'mergeability-pr-43651' into features-and-defects-750
evalstate Apr 29, 2026
2d42e54
Apply PR #43636: add Trainer custom metrics dict
evalstate Apr 29, 2026
46f5b4e
Merge branch 'mergeability-pr-43613' into features-and-defects-750
evalstate Apr 29, 2026
4166846
Merge branch 'mergeability-pr-43612' into features-and-defects-750
evalstate Apr 29, 2026
8ee975a
Merge branch 'mergeability-pr-43549' into features-and-defects-750
evalstate Apr 29, 2026
64d4729
Merge branch 'mergeability-pr-43543' into features-and-defects-750
evalstate Apr 29, 2026
c350757
Apply PR #43542: preserve MoE router logits
evalstate Apr 29, 2026
78aa64c
Merge branch 'mergeability-pr-43506' into features-and-defects-750
evalstate Apr 29, 2026
f8f07f3
Merge branch 'mergeability-pr-43498' into features-and-defects-750
evalstate Apr 29, 2026
0ca5b1f
Merge branch 'mergeability-pr-43492' into features-and-defects-750
evalstate Apr 29, 2026
806dfdd
Merge branch 'mergeability-pr-43484' into features-and-defects-750
evalstate Apr 29, 2026
a343f51
Merge branch 'mergeability-pr-43469' into features-and-defects-750
evalstate Apr 29, 2026
9a2ed36
Merge branch 'mergeability-pr-43466' into features-and-defects-750
evalstate Apr 29, 2026
6eee72f
Merge branch 'mergeability-pr-43451' into features-and-defects-750
evalstate Apr 29, 2026
dbe54b3
Merge branch 'mergeability-pr-43395' into features-and-defects-750
evalstate Apr 29, 2026
b4a6c9a
Merge branch 'mergeability-pr-43382' into features-and-defects-750
evalstate Apr 29, 2026
a399a87
Merge branch 'mergeability-pr-43378' into features-and-defects-750
evalstate Apr 29, 2026
e653cb9
Merge branch 'mergeability-pr-43363' into features-and-defects-750
evalstate Apr 29, 2026
70ff6b9
Merge branch 'mergeability-pr-43291' into features-and-defects-750
evalstate Apr 29, 2026
b2fb116
Merge branch 'mergeability-pr-43270' into features-and-defects-750
evalstate Apr 29, 2026
2682a32
Merge branch 'mergeability-pr-43254' into features-and-defects-750
evalstate Apr 29, 2026
05c0a11
Merge branch 'mergeability-pr-43212' into features-and-defects-750
evalstate Apr 29, 2026
dc47081
Merge branch 'mergeability-pr-43151' into features-and-defects-750
evalstate Apr 29, 2026
72108a7
Merge branch 'mergeability-pr-43133' into features-and-defects-750
evalstate Apr 29, 2026
06a631e
Fix save_pretrained for quantized models with custom serialization
480284856 Jan 4, 2026
9798054
Merge branch 'mergeability-pr-43094' into features-and-defects-750
evalstate Apr 29, 2026
5f684b2
Merge branch 'mergeability-pr-43088' into features-and-defects-750
evalstate Apr 29, 2026
5c62b01
Merge branch 'mergeability-pr-43085' into features-and-defects-750
evalstate Apr 29, 2026
f0d96f2
Merge branch 'mergeability-pr-43056' into features-and-defects-750
evalstate Apr 29, 2026
a589444
Merge branch 'mergeability-pr-43044' into features-and-defects-750
evalstate Apr 29, 2026
10edcd5
Apply PR #43028 ViT interpolation default
evalstate Apr 29, 2026
d6507a1
Merge branch 'mergeability-pr-43015' into features-and-defects-750
evalstate Apr 29, 2026
30aed07
Merge branch 'mergeability-pr-42979' into features-and-defects-750
evalstate Apr 29, 2026
114668a
Apply PR 42942: fix continuous batching result iteration
evalstate Apr 29, 2026
c4aebc8
Merge branch 'mergeability-pr-42881' into features-and-defects-750
evalstate Apr 29, 2026
b9a8ba8
Merge branch 'mergeability-pr-42865' into features-and-defects-750
evalstate Apr 29, 2026
fe55f6c
Merge branch 'mergeability-pr-42793' into features-and-defects-750
evalstate Apr 29, 2026
500741c
Merge branch 'mergeability-pr-42765' into features-and-defects-750
evalstate Apr 29, 2026
5b72469
Merge branch 'mergeability-pr-42717' into features-and-defects-750
evalstate Apr 29, 2026
70191e7
Merge branch 'mergeability-pr-42598' into features-and-defects-750
evalstate Apr 29, 2026
5b6ea02
Merge branch 'mergeability-pr-42493' into features-and-defects-750
evalstate Apr 29, 2026
8d1ed80
Merge branch 'mergeability-pr-42446' into features-and-defects-750
evalstate Apr 29, 2026
07b41a0
Merge branch 'mergeability-pr-42432' into features-and-defects-750
evalstate Apr 29, 2026
f89a6d4
Merge branch 'mergeability-pr-42424' into features-and-defects-750
evalstate Apr 29, 2026
4d5e680
Apply PR #42311: guard Blip2Processor num_query_tokens
evalstate Apr 29, 2026
2da06b7
Merge branch 'mergeability-pr-42310' into features-and-defects-750
evalstate Apr 29, 2026
0592cbf
Merge branch 'mergeability-pr-42256' into features-and-defects-750
evalstate Apr 29, 2026
5c05c8a
Merge branch 'mergeability-pr-42228' into features-and-defects-750
evalstate Apr 29, 2026
8b1728a
Merge branch 'mergeability-pr-42134' into features-and-defects-750
evalstate Apr 29, 2026
1344d92
Merge branch 'mergeability-pr-42133' into features-and-defects-750
evalstate Apr 29, 2026
10bf56b
Merge branch 'mergeability-pr-42098' into features-and-defects-750
evalstate Apr 29, 2026
451cefe
Merge branch 'mergeability-pr-42051' into features-and-defects-750
evalstate Apr 29, 2026
1f1608b
Apply huggingface_hub v1 import compatibility fix (#41973)
evalstate Apr 29, 2026
5330676
Apply Voxtral tokenizer dependency error fix (#41928)
evalstate Apr 29, 2026
835ef05
Apply variable batch loss averaging fix (#41904)
evalstate Apr 29, 2026
324663e
Merge branch 'mergeability-pr-41901' into features-and-defects-750
evalstate Apr 29, 2026
a5042f6
Merge branch 'mergeability-pr-41895' into features-and-defects-750
evalstate Apr 29, 2026
5174411
Merge branch 'mergeability-pr-41855' into features-and-defects-750
evalstate Apr 29, 2026
8aa7322
Apply PR 41844 FSDPv2 TPU checkpoint unwrap fix
evalstate Apr 29, 2026
b9eec02
Merge branch 'mergeability-pr-41827' into features-and-defects-750
evalstate Apr 29, 2026
f6e017b
Merge branch 'mergeability-pr-41798' into features-and-defects-750
evalstate Apr 29, 2026
d71b424
Merge branch 'mergeability-pr-41776' into features-and-defects-750
evalstate Apr 29, 2026
47a2381
Merge branch 'mergeability-pr-41734' into features-and-defects-750
evalstate Apr 29, 2026
f7f8c8e
Fix confusing warning in EncoderDecoderModel when training with label…
st81 Oct 19, 2025
3180b91
Delete unnecessary comments
st81 Oct 19, 2025
d91ef77
Delete unnecessary comments
st81 Oct 19, 2025
fc1149c
Merge branch 'mergeability-pr-41718' into features-and-defects-750
evalstate Apr 29, 2026
ec3287a
Merge branch 'mergeability-pr-41701' into features-and-defects-750
evalstate Apr 29, 2026
990a3b1
Fix tokenizer check script: safe dataset access, default checkpoints,…
aijadugar Oct 17, 2025
928ab5f
Merge branch 'mergeability-pr-41687' into features-and-defects-750
evalstate Apr 29, 2026
985bc4c
Merge branch 'mergeability-pr-41594' into features-and-defects-750
evalstate Apr 29, 2026
a74a105
Merge branch 'mergeability-pr-41593' into features-and-defects-750
evalstate Apr 29, 2026
aa3dfe6
Merge branch 'mergeability-pr-41561' into features-and-defects-750
evalstate Apr 29, 2026
2cf3e16
reorder and fix pe-audio-video
zucchini-nlp Apr 29, 2026
ce41ff3
Apply max_eval_batches evaluation limit
evalstate Apr 29, 2026
9a81445
Skip Qwen2.5-VL init for non-floating weights
evalstate Apr 29, 2026
0e51d47
Merge branch 'mergeability-pr-41521' into features-and-defects-750
evalstate Apr 29, 2026
cdbfa6f
Merge branch 'mergeability-pr-45702' into features-and-defects-750
evalstate Apr 29, 2026
dbc1980
Merge branch 'mergeability-pr-41458' into features-and-defects-750
evalstate Apr 29, 2026
441667c
Merge branch 'mergeability-pr-41441' into features-and-defects-750
evalstate Apr 29, 2026
f2d79f5
Merge branch 'mergeability-pr-41349' into features-and-defects-750
evalstate Apr 29, 2026
2c311d8
Merge branch 'mergeability-pr-41313' into features-and-defects-750
evalstate Apr 29, 2026
771521b
Merge branch 'mergeability-pr-41304' into features-and-defects-750
evalstate Apr 29, 2026
0924900
Apply PR #41239 T5Gemma config num_hidden_layers
evalstate Apr 29, 2026
26ff38e
Merge branch 'mergeability-pr-41224' into features-and-defects-750
evalstate Apr 29, 2026
a8cd291
Merge branch 'mergeability-pr-41169' into features-and-defects-750
evalstate Apr 29, 2026
c1a53e7
Merge branch 'mergeability-pr-41144' into features-and-defects-750
evalstate Apr 29, 2026
09656d8
Apply PR #41132: fix SpeechT5 inputs_to_logits_ratio property
evalstate Apr 29, 2026
df8a297
Apply PR #41105: honor NeuronCore device check
evalstate Apr 29, 2026
1ce51af
Merge branch 'mergeability-pr-41075' into features-and-defects-750
evalstate Apr 29, 2026
eb5d449
Merge branch 'mergeability-pr-41041' into features-and-defects-750
evalstate Apr 29, 2026
ba3a2df
Merge branch 'mergeability-pr-41033' into features-and-defects-750
evalstate Apr 29, 2026
9f2fe91
Merge branch 'mergeability-pr-40976' into features-and-defects-750
evalstate Apr 29, 2026
649b8b7
Merge branch 'mergeability-pr-40908' into features-and-defects-750
evalstate Apr 29, 2026
30d837e
Merge branch 'mergeability-pr-40898' into features-and-defects-750
evalstate Apr 29, 2026
57714f5
Merge branch 'mergeability-pr-40861' into features-and-defects-750
evalstate Apr 29, 2026
1e8dd05
Merge branch 'mergeability-pr-40840' into features-and-defects-750
evalstate Apr 29, 2026
3e6cb03
Apply checkpoint resume handling from PR #40790
evalstate Apr 29, 2026
0f50574
Merge branch 'mergeability-pr-40783' into features-and-defects-750
evalstate Apr 29, 2026
9778b88
Merge branch 'mergeability-pr-40756' into features-and-defects-750
evalstate Apr 29, 2026
ad750f6
Merge branch 'mergeability-pr-40755' into features-and-defects-750
evalstate Apr 29, 2026
fd0632c
Merge branch 'mergeability-pr-40740' into features-and-defects-750
evalstate Apr 29, 2026
3a49adb
Merge branch 'mergeability-pr-40587' into features-and-defects-750
evalstate Apr 29, 2026
0c39bb1
Merge branch 'mergeability-pr-40520' into features-and-defects-750
evalstate Apr 29, 2026
8fe862b
Merge branch 'mergeability-pr-40515' into features-and-defects-750
evalstate Apr 29, 2026
588d29a
Merge branch 'mergeability-pr-40492' into features-and-defects-750
evalstate Apr 29, 2026
1e30721
Merge branch 'mergeability-pr-40438' into features-and-defects-750
evalstate Apr 29, 2026
6abc2eb
Remove debug print statement from ShieldGemma2 conversion script
Prawal-Sharma Aug 22, 2025
2f22c86
Add fromjson filter to Jinja2 chat templates
Prawal-Sharma Aug 22, 2025
9705641
Fix typo: 'seperate' -> 'separate' in mm_grounding_dino conversion sc…
Prawal-Sharma Aug 22, 2025
5c743c1
Fix MXFP4 MLP output shape for 2D inputs
evalstate Apr 29, 2026
f480611
Merge branch 'mergeability-pr-40244' into features-and-defects-750
evalstate Apr 29, 2026
4e98599
Apply save_strategy best metric default from PR #40221
evalstate Apr 29, 2026
6775a91
Apply FSDP save_only_model sharded checkpoint fix from PR #40208
evalstate Apr 29, 2026
e0c328d
Merge branch 'mergeability-pr-40148' into features-and-defects-750
evalstate Apr 29, 2026
c628138
Port Mixtral torch export expert loop fix
evalstate Apr 29, 2026
01209a8
Merge branch 'mergeability-pr-40092' into features-and-defects-750
evalstate Apr 29, 2026
a62ad0a
Skip non-floating weight initialization
evalstate Apr 29, 2026
2b31948
Delay causal LM loss upcast until after label filtering
evalstate Apr 29, 2026
2c18eac
Apply PR #40059: use fused GPT-2 GELU
evalstate Apr 29, 2026
5566830
Apply PR #40058: support Qwen2VL GGUF
evalstate Apr 29, 2026
218bc70
Merge branch 'mergeability-pr-40055' into features-and-defects-750
evalstate Apr 29, 2026
5d2cfa8
Apply PR #40022: handle Doge MoE tuple outputs
evalstate Apr 29, 2026
bcc0a6b
Apply PR #39999: preserve meta device maps for TP
evalstate Apr 29, 2026
9a4da1e
Merge branch 'mergeability-pr-39997' into features-and-defects-750
evalstate Apr 29, 2026
7dd2aea
Merge branch 'mergeability-pr-39941' into features-and-defects-750
evalstate Apr 29, 2026
b29e162
Merge branch 'mergeability-pr-39895' into features-and-defects-750
evalstate Apr 29, 2026
e3f7ce6
Merge branch 'mergeability-pr-39866' into features-and-defects-750
evalstate Apr 29, 2026
6b5b380
Apply ProphetNet tuple encoder_outputs fix from PR 39794
evalstate Apr 29, 2026
075c2ba
Merge branch 'mergeability-pr-39793' into features-and-defects-750
evalstate Apr 29, 2026
c024ae9
Merge branch 'mergeability-pr-39785' into features-and-defects-750
evalstate Apr 29, 2026
b26a9ba
Merge branch 'mergeability-pr-39741' into features-and-defects-750
evalstate Apr 29, 2026
9c3cc50
Apply PR #39698: fix Exaone4 sliding window layer types
evalstate Apr 29, 2026
8f73042
Merge branch 'mergeability-pr-39697' into features-and-defects-750
evalstate Apr 29, 2026
2e2779e
Apply PR #39690: allow custom hf_quantizer
evalstate Apr 29, 2026
952a5f6
Apply PR #39683: respect disabled torch dynamo
evalstate Apr 29, 2026
4d1d1fc
Apply PR #39674: scale loss by data parallel size
evalstate Apr 29, 2026
39c0527
Apply PR #39599: tolerate missing trainer state
evalstate Apr 29, 2026
864cf5e
Apply PR #39560: save best checkpoints on eval
evalstate Apr 29, 2026
fcf8baf
Apply PR #39493: pin mistral-common extras
evalstate Apr 29, 2026
8c95ec9
Apply PR #39491: skip quantized weight init
evalstate Apr 29, 2026
7b6ed57
Apply PR #39468: skip bnb dispatch
evalstate Apr 29, 2026
1d68999
Apply PR #39435: add Bart mask regression
evalstate Apr 29, 2026
8205757
Merge branch 'mergeability-pr-39309' into features-and-defects-750
evalstate Apr 29, 2026
8d8de6f
Merge branch 'mergeability-pr-39257' into features-and-defects-750
evalstate Apr 29, 2026
74def78
Port Qwen3 MoE empty router logits fix (#39206)
evalstate Apr 29, 2026
d2dfb4c
Port chat extra dependency group (#39183)
evalstate Apr 29, 2026
f5c57c0
Port MoE fullgraph compile disable (#39108)
evalstate Apr 29, 2026
bb2f6c4
Apply PR #39103: fix Gemma3n audio config naming
evalstate Apr 29, 2026
618429c
Merge branch 'mergeability-pr-39047' into features-and-defects-750
evalstate Apr 29, 2026
32b7952
Merge branch 'mergeability-pr-39037' into features-and-defects-750
evalstate Apr 29, 2026
33afdec
Merge branch 'mergeability-pr-39009' into features-and-defects-750
evalstate Apr 29, 2026
27c3403
Merge branch 'mergeability-pr-38888' into features-and-defects-750
evalstate Apr 29, 2026
9ffa016
Merge branch 'mergeability-pr-38886' into features-and-defects-750
evalstate Apr 29, 2026
199afea
Merge branch 'mergeability-pr-38884' into features-and-defects-750
evalstate Apr 29, 2026
85172b4
Add PR classifications artifact for feature-defect run
evalstate Apr 29, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
16 changes: 16 additions & 0 deletions .ai/AGENTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -59,6 +59,22 @@ Do not raise PRs without human validation.
- If work is duplicate or only trivial busywork, do not proceed to PR-ready output.
- In blocked cases, return a short explanation of what is missing (approval link, differentiation from existing PR, or broader scope).

## Learning transformers primitives by example

The `src/transformers/cli/agentic/` directory contains concise, self-contained
examples of how to use the core transformers primitives (`AutoModel`,
`AutoTokenizer`, `AutoProcessor`, `AutoImageProcessor`, etc.) for a wide
range of tasks — text classification, NER, QA, summarization, translation,
image classification, object detection, segmentation, depth estimation,
speech recognition, audio classification, text-to-speech, video
classification, visual QA, captioning, OCR, and more.

Each file (`text.py`, `vision.py`, `audio.py`, `multimodal.py`) follows the
same pattern: load a model and processor with `from_pretrained`, preprocess
inputs, run a forward pass or `generate`, and post-process the outputs. If
you need to write code that uses transformers and are unsure how to get
started, read the relevant command in that folder first.

## Copies and Modular Models

We try to avoid direct inheritance between model-specific files in `src/transformers/models/`. We have two mechanisms to manage the resulting code duplication:
Expand Down
11 changes: 10 additions & 1 deletion .circleci/create_circleci_config.py
Original file line number Diff line number Diff line change
Expand Up @@ -398,6 +398,15 @@ def job_name(self):
parallelism=6,
)

training_distributed_ci_job = CircleCIJob(
"training_distributed_ci",
additional_env={"RUN_TRAINING_TESTS": True},
docker_image=[{"image": "huggingface/transformers-torch-light"}],
install_steps=["uv pip install ."],
marker="is_training_distributed_test",
parallelism=6,
)

# We also include a `dummy.py` file in the files to be doc-tested to prevent edge case failure. Otherwise, the pytest
# hangs forever during test collection while showing `collecting 0 items / 21 errors`. (To see this, we have to remove
# the bash output redirection.)
Expand Down Expand Up @@ -427,7 +436,7 @@ def job_name(self):
PIPELINE_TESTS = [pipelines_torch_job]
REPO_UTIL_TESTS = [repo_utils_job]
DOC_TESTS = [doc_test_job]
TRAINING_CI_TESTS = [training_ci_job]
TRAINING_CI_TESTS = [training_ci_job, training_distributed_ci_job]
TENSOR_PARALLEL_CI_TESTS = [tensor_parallel_ci_job]
ALL_TESTS = REGULAR_TESTS + EXAMPLES_TESTS + PIPELINE_TESTS + REPO_UTIL_TESTS + DOC_TESTS + [custom_tokenizers_job] + [exotic_models_job] + TRAINING_CI_TESTS + TENSOR_PARALLEL_CI_TESTS # fmt: skip

Expand Down
1 change: 0 additions & 1 deletion .github/scripts/assign_reviewers.py
Original file line number Diff line number Diff line change
@@ -1,4 +1,3 @@
# coding=utf-8
# Copyright 2025 the HuggingFace Inc. team. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
Expand Down
13 changes: 12 additions & 1 deletion .github/workflows/model_jobs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -186,7 +186,18 @@ jobs:
env:
report_name_prefix: ${{ inputs.report_name_prefix }}
run: |
cat "/transformers/reports/${machine_type}_${report_name_prefix}_${matrix_folders}_test_reports/captured_info.txt"
shopt -s nullglob
captured_info_files=("/transformers/reports/${machine_type}_${report_name_prefix}_${matrix_folders}_test_reports"/captured_info*.txt)

if [ ${#captured_info_files[@]} -eq 0 ]; then
echo "No captured information files found."
exit 0
fi

for captured_info_file in "${captured_info_files[@]}"; do
echo "===== ${captured_info_file##*/} ====="
cat "$captured_info_file"
done

- name: Copy test_outputs.txt
if: ${{ always() }}
Expand Down
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -134,8 +134,9 @@ pipeline("the secret to baking a really good cake is ")
To chat with a model, the usage pattern is the same. The only difference is you need to construct a chat history (the input to `Pipeline`) between you and the system.

> [!TIP]
> You can also chat with a model directly from the command line, as long as [`transformers serve` is running](https://huggingface.co/docs/transformers/main/en/serving).
> You can also chat with a model directly from the command line, as long as the `chat` extra is installed and [`transformers serve` is running](https://huggingface.co/docs/transformers/main/en/serving).
> ```shell
> pip install .[chat] # or pip install transformers[chat]
> transformers chat Qwen/Qwen2.5-0.5B-Instruct
> ```

Expand Down
98 changes: 98 additions & 0 deletions all_requirements.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,98 @@
gpustat==1.1.1
psutil==6.0.0
psycopg2==2.9.9
pandas>=1.5.0
numpy>=1.21.0
psutil>=5.8.0
nvidia-ml-py>=12.0.0
torch>=2.0.0
datasets>=2.10.0
huggingface_hub>=0.16.0
amdsmi>=7.0.2
git+https://github.com/huggingface/transformers.git@main # install main or adjust it with vX.X.X for installing version specific transforms
datasets==1.8.0accelerate >= 0.12.0
datasets >= 1.8.0
torch >= 1.3.0
evaluateaccelerate >= 0.21.0
sentencepiece != 0.1.92
protobuf
torch >= 1.3
datasets[audio]>=1.14.0
evaluate
librosa
torchaudio
torch>=1.6
accelerate >= 0.12.0
datasets >= 1.8.0
sentencepiece != 0.1.92
protobuf
sacrebleu >= 1.4.12
py7zr
torch >= 1.3
evaluatedatasets >= 2.0.0
torch >= 1.3
accelerate
evaluate
Pillow
albumentations >= 1.4.16
accelerate >= 0.12.0
datasets >= 1.8.0
sentencepiece != 0.1.92
protobuf
rouge-score
nltk
py7zr
torch >= 1.3
evaluate
torch>=1.5.0
torchvision>=0.6.0
datasets>=1.8.0accelerate >= 0.12.0
datasets >= 1.8.0
sentencepiece != 0.1.92
scipy
scikit-learn
protobuf
torch >= 1.3
evaluateaccelerate>=0.12.0
torch>=1.5.0
torchvision>=0.6.0
datasets>=2.14.0
evaluate
scikit-learnaccelerate >= 0.12.0
torch >= 1.3
datasets >= 2.14.0
sentencepiece != 0.1.92
protobuf
evaluate
scikit-learn
accelerate >= 0.12.0
seqeval
datasets >= 1.8.0
torch >= 1.3
evaluatealbumentations >= 1.4.16
timm
datasets>=4.0
torchmetrics
pycocotools
datasets[audio] >= 1.18.0
torch >= 1.5
torchaudio
librosa
jiwer
evaluate
datasets[audio] >= 1.12.0
torch >= 1.5
torchaudio
accelerate >= 0.12.0
librosatorch>=1.5.0
torchvision>=0.6.0
datasets>=1.8.0albumentations >= 1.4.16
timm
datasets
torchmetrics
pycocotools
accelerate >= 0.12.0
sentencepiece != 0.1.92
protobuf
torch >= 1.3
evaluate
Loading