Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
466 commits
Select commit Hold shift + click to select a range
0b066be
Revert PR 32299, flag users when Zero-3 was missed (#32851)
muellerzr Aug 16, 2024
1c36db6
fix multi-gpu with static cache (#32543)
SunMarc Aug 16, 2024
8ec028a
Reduce the error log when using core models that need their weights r…
muellerzr Aug 16, 2024
6806d33
Make beam_constraints.Constraint.advance() docstring more accurate (#…
alex-calderwood Aug 16, 2024
52cb403
generate: missing `to` in DoLa body, causing exceptions in multi-gpu …
gante Aug 17, 2024
843e5e2
Add Flax Dinov2 (#31960)
MHRDYN7 Aug 19, 2024
8260cb3
Add Descript-Audio-Codec model (#31494)
kamilakesbi Aug 19, 2024
54b7703
support torch-speech (#32537)
itazap Aug 19, 2024
e55b33c
[tests] make `test_sdpa_can_compile_dynamic` device-agnostic (#32519)
faaany Aug 19, 2024
f1b720e
Add __repr__ for Conv1D (#32425)
AaronZLT Aug 19, 2024
8a4857c
Support save/load ckpt for XLA FSDP (#32311)
yitongh Aug 19, 2024
5f6c080
RT-DETR parameterized batchnorm freezing (#32631)
AlanBlanchet Aug 19, 2024
59e8f19
Fix incorrect vocab size retrieval in GGUF config (#32551)
Isotr0py Aug 19, 2024
93e538a
Mamba / FalconMamba: Fix mamba left padding (#32677)
younesbelkada Aug 19, 2024
61d89c1
Fix: Mamba2 generation mismatch between input_ids and inputs_embeds (…
vasqu Aug 19, 2024
3720484
Docs: Fixed `whisper-large-v2` model link in docs (#32871)
Sai-Suraj-27 Aug 19, 2024
85345bb
Add tip to clarify tool calling (#32883)
Rocketknight1 Aug 19, 2024
13e645b
Allow-head-dim (#32857)
ArthurZucker Aug 20, 2024
fd06ad5
🚨🚨🚨 Update min version of accelerate to 0.26.0 (#32627)
SunMarc Aug 20, 2024
65f4bc9
Fix repr for conv (#32897)
ArthurZucker Aug 20, 2024
01c4fc4
fix: jamba cache fails to use torch.nn.module (#32894)
xgal Aug 20, 2024
c63a3d0
Fix: Mamba2 `norm_before_gate` usage (#32686)
vasqu Aug 20, 2024
9800e6d
Bump nltk from 3.7 to 3.9 in /examples/research_projects/decision_tra…
dependabot[bot] Aug 20, 2024
078d5a8
Replace `tensor.norm()` with decomposed version for CLIP executorch e…
qubvel Aug 20, 2024
1dde50c
link for optimizer names (#32400)
nbroad1881 Aug 20, 2024
8713466
[i18n-ar] add README_ar.md to README.md (#32583)
AhmedAlmaghz Aug 20, 2024
c6d484e
fix: [whisper] don't overwrite GenerationConfig's `return_timestamps`…
hrl Aug 21, 2024
3bb7b05
Update docker image building (#32918)
ArthurZucker Aug 21, 2024
f6e2586
Jamba: update integration tests (#32250)
gante Aug 22, 2024
af638c4
fix: Added missing `huggingface_hub` installation to workflows (#32891)
Sai-Suraj-27 Aug 22, 2024
6baa6f2
fix: no need to dtype A in jamba (#32924)
xgal Aug 22, 2024
c42d264
FEAT / Trainer: Add adamw 4bit optimizer (#31865)
SunMarc Aug 22, 2024
8b94d28
CI: separate step to download nltk files (#32935)
gante Aug 22, 2024
eeea712
FIX / Hub: Also catch for `exceptions.ConnectionError` (#31469)
younesbelkada Aug 22, 2024
9282413
Add SynCode to llm_tutorial (#32884)
shubhamugare Aug 22, 2024
bf97d4a
Fix benchmark script (#32635)
ydshieh Aug 22, 2024
99d67f1
Improve greedy search memory usage (#32895)
regisss Aug 22, 2024
ee8c01f
Add chat_template for tokenizer extracted from GGUF model (#32908)
Isotr0py Aug 22, 2024
f1d822b
fix: (issue #32689) `AttributeError` raised when using `Trainer` with…
fshp971 Aug 22, 2024
975b988
Gemma2: eager attention by default (#32865)
gante Aug 22, 2024
18199b3
[run_slow] idefics2 (#32840)
andimarafioti Aug 22, 2024
273c0af
Fix regression on `Processor.save_pretrained` caused by #31691 (#32921)
leloykun Aug 22, 2024
09e6579
🌐 [i18n-KO] Translated `knowledge_distillation_for_image_classificati…
JinukHong Aug 22, 2024
a26de15
Generate: Deprecate returning legacy cache by default; Handle `use_ca…
gante Aug 22, 2024
d806fa3
docs: fix outdated link to TF32 explanation (#32947)
anakin87 Aug 22, 2024
22e6f14
Reducing memory usage: removing useless logits computation in generat…
Cyrilvallez Aug 23, 2024
970a16e
Forbid `PretrainedConfig` from saving `generate` parameters; Update d…
gante Aug 23, 2024
adb9117
Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to Trainer (#…
JasonZhu1313 Aug 23, 2024
371b9c1
Enable some Jinja extensions and add datetime capabilities (#32684)
Rocketknight1 Aug 23, 2024
1dbd9d3
DeviceGuard added to use Deformable Attention more safely on multi-GP…
DonggeunYu Aug 23, 2024
e3a5f35
added doctring to SchedulerType class (#32898)
Arunprakash-A Aug 23, 2024
0a7af19
Update Jinja docs with new functions and general cleanup (#33097)
Rocketknight1 Aug 23, 2024
8defc95
Updated the custom_models.md changed cross_entropy code (#33118)
S-M-J-I Aug 26, 2024
19e6e80
support qwen2-vl (#32318)
simonJJJ Aug 26, 2024
93e0e1a
CI: add torchvision to the consistency image (#32941)
gante Aug 26, 2024
894d421
Test: add higher `atol` in `test_forward_with_num_logits_to_keep` (#3…
gante Aug 26, 2024
72d4a3f
mps: add `isin_mps_friendly`, a wrapper function for `torch.isin` (#3…
gante Aug 26, 2024
a378a54
Add changes for uroman package to handle non-Roman characters (#32404)
nandwalritik Aug 26, 2024
3562772
fix: Fixed `pydantic` required version in dockerfiles to make it comp…
Sai-Suraj-27 Aug 26, 2024
26f043b
quickfix documentation (#32566)
molbap Aug 26, 2024
9578c25
Fixup py 38 type hints for mps friendly (#33128)
muellerzr Aug 26, 2024
3bf6dd8
fix: Fixed CodeGenTokenizationTest::test_truncation failing test (#32…
Sai-Suraj-27 Aug 27, 2024
7562366
fix: multilingual midel convert to tflite get wrong token (#32079)
Ayaa17 Aug 27, 2024
3806faa
disable scheduled daily CI temporarily (#33136)
ydshieh Aug 27, 2024
ab0ac3b
CI: fix `efficientnet` pipeline timeout and prevent future similar is…
gante Aug 27, 2024
746e114
Bump torch from 1.13.1 to 2.2.0 in /examples/research_projects/jax-pr…
dependabot[bot] Aug 27, 2024
892d51c
Log additional test metrics with the CometCallback (#33124)
Lothiraldan Aug 27, 2024
6f0ecf1
[docs] add quick usage snippet to Whisper. (#31289)
Vaibhavs10 Aug 27, 2024
d1f39c4
Update stateful_callbacks state before saving checkpoint (#32115)
brs-pt Aug 27, 2024
834ec7b
fix Idefics2VisionConfig type annotation (#33103)
chenzizhao Aug 27, 2024
9956c2b
Add a fix for custom code tokenizers in pipelines (#32300)
Rocketknight1 Aug 27, 2024
c6b23fd
Llama: make slow tests green 🟢 (#33138)
gante Aug 27, 2024
d47a9e8
fix redundant checkpointing in example training scripts (#33131)
eminorhan Aug 27, 2024
7ee4363
update torch req for 4-bit optimizer (#33144)
SunMarc Aug 27, 2024
6101d93
🌐 [i18n-KO] Translated `conversations.md` to Korean (#32468)
newfull5 Aug 27, 2024
27903de
Very small change to one of the function parameters (#32548)
alisalamatian1 Aug 27, 2024
7591ca5
🚨 Add Blip2ForImageTextRetrieval (#29261)
jpizarrom Aug 27, 2024
c35d2cc
Granite language models (#31502)
mayank31398 Aug 27, 2024
386931d
fix model name and copyright (#33152)
mayank31398 Aug 28, 2024
3bfd3e4
Fix: Jamba batched generation (#32914)
vasqu Aug 28, 2024
e0b87b0
[whisper] pass attention_mask to generate_with_fallback() (#33145)
benniekiss Aug 28, 2024
f1a385b
[RoBERTa-based] Add support for sdpa (#30510)
hackyon Aug 28, 2024
f9ed05d
Fix import paths for test_module (#32888)
rasmi Aug 28, 2024
f4c86d0
Zero-shot pipelines: minor doc changes (#33127)
pcuenca Aug 28, 2024
5c84682
Customise the separator used for splicing in DataCollatorWithFlatteni…
beep-bebop Aug 28, 2024
74e19e8
Fix spell mistakes (#33149)
matsuo1234567 Aug 28, 2024
3d79dcb
update push CI workflow files for security (#33142)
ydshieh Aug 28, 2024
5c1027b
added quick clarification (#33166)
DuyguA Aug 28, 2024
39bfb2f
pass module to Params4bit.from_prequantized to ensure quant_state (#3…
winglian Aug 29, 2024
92a75ff
Mamba2 conversion script for original models (#32580)
vasqu Aug 29, 2024
5129671
Add a static cache that offloads to the CPU or other device (#32161)
gerbenvv Aug 29, 2024
c409cd8
use a single for loop (#33148)
ArthurZucker Aug 29, 2024
b127fb8
Pipeline: fix bad generation kwargs docs (#33205)
gante Aug 30, 2024
4987463
Bump torch from 1.13.1 to 2.2.0 in /examples/research_projects/codepa…
dependabot[bot] Aug 30, 2024
9a6956b
Bump torch from 1.13.1 to 2.2.0 in /examples/research_projects/decisi…
dependabot[bot] Aug 30, 2024
e259d6d
Add missing quotes in modeling_llava_next_video.py (#33214)
juliendenize Aug 30, 2024
fbff276
Add warning for stop string edge case (#33169)
Rocketknight1 Aug 30, 2024
38d58a4
Fix local repos with remote code not registering for pipelines (#33100)
Rocketknight1 Aug 30, 2024
b017a9e
Refactor CI: more explicit (#30674)
ArthurZucker Aug 30, 2024
c79bfc7
Create local Transformers Engine (#33218)
aymeric-roucher Aug 30, 2024
db70426
🌐 [i18n-KO] Translated `llm_optims.md` to Korean (#32325)
yijun-lee Aug 30, 2024
51e6526
Fix red amin (#33220)
ArthurZucker Aug 30, 2024
746104b
Test fetcher: missing return on filtered tests; don't write empty fil…
gante Aug 30, 2024
eb5b968
Generate: throw warning when `return_dict_in_generate` is False but s…
gante Aug 31, 2024
2e3f8f7
Add video text to text docs (#33164)
merveenoyan Sep 1, 2024
b9bc691
Add GraniteRMSNorm (#33177)
NielsRogge Sep 2, 2024
1ca9ff5
Add duckduckgo search tool (#32882)
aymeric-roucher Sep 2, 2024
409fcfd
Fix: Suppressed 'use_reentrant=False' warning (#33208)
ankush13r Sep 2, 2024
963ed98
docs: Replace package abbreviations with full name(`bitsandbytes`) in…
rapsealk Sep 2, 2024
2d37085
Bump opencv-python from 4.4.0.42 to 4.8.1.78 in /examples/research_pr…
dependabot[bot] Sep 2, 2024
52a0213
Add assistant prefill for chat templates and TextGenerationPipeline (…
Rocketknight1 Sep 2, 2024
97c0f45
Generate: fix assistant in different device (#33257)
gante Sep 2, 2024
9ea1eac
remove to restriction for 4-bit model (#33122)
SunMarc Sep 2, 2024
2895224
Fixed typo repeated word in DETR docs (#33250)
sergiopaniego Sep 2, 2024
cff06aa
Fix: use `torch.from_numpy()` to create tensors for np.ndarrays (#33201)
shinyano Sep 2, 2024
5663026
remove torch input dependant control flow (#33245)
ArthurZucker Sep 3, 2024
7ed9789
Fix: `num_logits_to_keep` in composite models (#33168)
zucchini-nlp Sep 3, 2024
979f477
Fix Bark saving (#33266)
ylacombe Sep 3, 2024
edeca43
🚨 Support dequantization for most GGML types (#32625)
Isotr0py Sep 3, 2024
0d86727
Update chat template docs to remove Blenderbot (#33254)
Rocketknight1 Sep 3, 2024
e969d88
Bump opencv-python from 4.4.0.42 to 4.8.1.78 in /examples/research_pr…
dependabot[bot] Sep 3, 2024
03c12d0
Add sdpa support for Albert (#32092)
OmarManzoor Sep 3, 2024
6b7d64a
Only disallow DeepSpeed Zero-3 for auto bs finder (#31731)
muellerzr Sep 3, 2024
979d24e
fix the parallel number of CI nodes when it is smaller than number of…
ArthurZucker Sep 3, 2024
d6534f9
Repo checks: check documented methods exist (#32320)
gante Sep 3, 2024
ecd61c6
Add OLMoE (#32406)
Muennighoff Sep 3, 2024
35f72eb
Fix: multigpu training (#33271)
zucchini-nlp Sep 4, 2024
ebbe8d8
Cache docs: update (#32929)
zucchini-nlp Sep 4, 2024
d750b50
Config: unified logic to retrieve text config (#33219)
gante Sep 4, 2024
d703477
[fix] LlavaNextProcessor '_get_unpadded_features' method (#33263)
laurentd-lunit Sep 4, 2024
178cb6b
wait 15m before SSH into runner workflow stops (#33300)
ydshieh Sep 4, 2024
122ded0
Bugfix/alexsherstinsky/fix none check for attention factor in rope sc…
alexsherstinsky Sep 4, 2024
5731dc8
Bump cryptography from 42.0.0 to 43.0.1 in /examples/research_project…
dependabot[bot] Sep 4, 2024
d2dcff9
[InstructBLIP] qformer_tokenizer is required input (#33222)
amyeroberts Sep 4, 2024
2cb543d
Multi agents with manager (#32687)
aymeric-roucher Sep 4, 2024
01c8c6c
Add a warning to the chat template docs about the tool_calls format (…
Rocketknight1 Sep 4, 2024
cfd92c6
Add new documentation page for advanced agent usage (#33265)
aymeric-roucher Sep 4, 2024
a1faf22
[BUG] fix upper nltk version (#33301)
ylacombe Sep 4, 2024
b390998
Fix excessive CPU memory usage with FSDP and cpu_ram_efficient_loadin…
matthewdouglas Sep 4, 2024
9230d78
Add validate images and text inputs order util for processors and tes…
yonigozlan Sep 4, 2024
43df47d
Llava Onevision: add model (#32673)
zucchini-nlp Sep 5, 2024
47b0964
Fix: Fix `FalconMamba` training issues due to incompatible kernels (#…
younesbelkada Sep 5, 2024
03164ba
Add paper link (#33305)
Muennighoff Sep 5, 2024
c6d2848
🚨 Fix `torch.jit.trace` for `interpolate_pos_encoding` in all vision …
xenova Sep 5, 2024
132e875
Update SECURITY.md (#32680)
Michellehbn Sep 5, 2024
5d11de4
Add Qwen2Moe GGUF loading support (#33264)
VladOS95-cyber Sep 5, 2024
21fac7a
simple align qwen2vl kv_seq_len calculation with qwen2 (#33161)
simonJJJ Sep 5, 2024
5792c45
Add a community notebook for fine-tuning with QLoRA, PEFT, and MLflow…
daniellok-db Sep 6, 2024
1759bb9
Fix: StaticCache & `inputs_embeds` (#32932)
zucchini-nlp Sep 6, 2024
2b789f2
Docs: add more cross-references to the KV cache docs (#33323)
gante Sep 6, 2024
51d15eb
[whisper] alternative fix for long-form timestamps (#32131)
sanchit-gandhi Sep 6, 2024
1bd9d1c
fix qwen2vl vision eager-attention (#33213)
simonJJJ Sep 6, 2024
e1c2b69
Load dynamic module (remote code) only once if code isn't change (#33…
XuehaiPan Sep 6, 2024
363301f
support loading model without config.json file (#32356)
itazap Sep 6, 2024
3314fe1
Add validation for maximum sequence length in modeling_whisper.py (#3…
AmirMohammadFakhimi Sep 6, 2024
2b18354
add self.head_dim for VisionAttention in Qwen2-VL (#33211)
GeLee-Q Sep 6, 2024
342e800
support 3D attention mask in bert (#32105)
gathierry Sep 6, 2024
e48e5f1
Support reading tiktoken tokenizer.model file (#31656)
itazap Sep 6, 2024
2d75700
red-ci on main, fix copies (#33356)
ArthurZucker Sep 6, 2024
6ff6069
RoPE: fix BC warning (#33331)
gante Sep 6, 2024
d7b04ea
Fix Prefill docs (#33352)
Rocketknight1 Sep 6, 2024
a70286f
Update author for QLorA/PEFT community notebook (#33338)
daniellok-db Sep 6, 2024
66bc4de
add sdpa mbart (#32033)
nbroad1881 Sep 7, 2024
60226fd
Fix quantized cache tests (#33351)
zucchini-nlp Sep 9, 2024
62aecd8
schedulefree optimizers (#30079)
winglian Sep 9, 2024
489cbfd
Add visit webpage tool (#33353)
aymeric-roucher Sep 9, 2024
eedd21b
Fixed Majority of the Typos in `transformers[en]` Documentation (#33350)
nnilayy Sep 9, 2024
65bb284
Compile compatibilty for decoder-only models (#32617)
zucchini-nlp Sep 9, 2024
0574fa6
Adjust templates (#33384)
LysandreJik Sep 9, 2024
f745e7d
Remove repeated prepare_images in processor tests (#33163)
amyeroberts Sep 9, 2024
7f112ca
Fix import of `FalconMambaForCausalLM` (#33381)
younesbelkada Sep 10, 2024
f24f084
Import structure & first three model refactors (#31329)
LysandreJik Sep 10, 2024
7d2d6ce
VLM: fixes after refactor (#32907)
zucchini-nlp Sep 10, 2024
8e8e7d8
fixed Mask2Former image processor segmentation maps handling (#33364)
maciej-adamiak Sep 10, 2024
96429e7
Add support for GGUF Phi-3 (#31844)
a8nova Sep 10, 2024
6ed2b10
Bug Fix: Update hub.py to fix NoneType error (#33315)
rishiraj Sep 10, 2024
dfee4f2
Update WhisperTokenizer Doc: Timestamps and Previous Tokens Behaviour…
bruno-hays Sep 10, 2024
f38590d
Make StaticCache configurable at model construct time (#32830)
guangy10 Sep 10, 2024
781bbc4
use diff internal model in tests (#33387)
itazap Sep 11, 2024
e719b65
Fix `FbgemmFp8Linear` not preserving tensor shape (#33239)
vgel Sep 11, 2024
91f19a5
Fix failing windows (#33436)
LysandreJik Sep 11, 2024
42babe8
Remove deprecated task in load_dataset (#33433)
albertvillanova Sep 11, 2024
7a51cbc
Dynamic number of speculative tokens in order to accelerate speculati…
jmamou Sep 11, 2024
ecf7024
Fix: Cast prefetch_bucket_size to integer for deepspeed >= 0.15 (#33402)
kiddj Sep 11, 2024
c403441
[docs] add the missing huggingface hub username (#33431)
faaany Sep 11, 2024
cea9ec0
[docs] add the missing tokenizer when pushing models to huggingface h…
faaany Sep 11, 2024
d7a553b
Update stale.yml (#33434)
LysandreJik Sep 12, 2024
e0ff432
Docs - update formatting of llama3 model card (#33438)
MichaelCurrin Sep 12, 2024
516ee6a
Fix incomplete sentence in `Zero-shot object detection` documentation…
sergiopaniego Sep 12, 2024
8ed6352
Fix flax whisper tokenizer bug (#33151)
hannan72 Sep 12, 2024
c8ea675
Clean-up deprecated code (#33446)
zucchini-nlp Sep 12, 2024
d71d6cb
Fix default revision for pipelines (#33395)
ankane Sep 12, 2024
5334b61
Revive AMD scheduled CI (#33448)
ydshieh Sep 12, 2024
e688996
Allow send `SSH into runner` info. to DM (#33346)
ydshieh Sep 12, 2024
8f8af0f
Correct Whisper's beam search scores computation (#32336)
ylacombe Sep 12, 2024
2f611d3
Qwen2-VL: clean-up and add more tests (#33354)
zucchini-nlp Sep 12, 2024
5c6257d
[whisper] Clarify error message when setting max_new_tokens (#33324)
benniekiss Sep 12, 2024
a05ce55
[docs] refine the doc for `train with a script` (#33423)
faaany Sep 12, 2024
9c4639b
Return image hidden states (#33426)
zucchini-nlp Sep 13, 2024
1027a53
add a callback hook right before the optimizer step (#33444)
winglian Sep 13, 2024
4b0418d
Enable `padding_side` as call time kwargs (#33385)
zucchini-nlp Sep 13, 2024
7a56598
Mitigate a conflict when using sentencepiece (#33327)
tengomucho Sep 13, 2024
dfd3115
[Phi-3] Bug on stale kv cache (#33129)
garg-amit Sep 13, 2024
6cc4dfe
Fix the initialization of the cache when we have multi gpu (#33303)
SunMarc Sep 13, 2024
0963229
Enable finetuning with torchao quantized model (#33361)
SunMarc Sep 13, 2024
e39b6c1
Corrected `Agents and tools` documentation links typos (#33471)
sergiopaniego Sep 13, 2024
7bb1c99
chore: fix typo in comment in tokenization_utils_base.py (#33466)
DavidLemayian Sep 13, 2024
8bd2b1e
Add support for Pixtral (#33449)
ArthurZucker Sep 14, 2024
95e816f
Cohere: update RoPE structure (#33408)
gante Sep 16, 2024
5ce0a11
Fix SSH workflow (#33451)
ydshieh Sep 16, 2024
ce62a41
Add keypoint-detection task guide (#33274)
merveenoyan Sep 16, 2024
2f62146
Uniformize kwargs for LLaVa processor and update docs (#32858)
yonigozlan Sep 16, 2024
c7a91f5
`Agents, supercharged - Multi-agents, External tools, and more` docs …
sergiopaniego Sep 16, 2024
c2d0589
[i18n-ar] Add File : `docs/source/ar/_toctree.yml` (#32696)
AhmedAlmaghz Sep 16, 2024
98adf24
[Whisper test] Fix some failing tests (#33450)
ylacombe Sep 16, 2024
4ba531c
Fix: Qwen2-VL training on video datasets (#33307)
hiyouga Sep 17, 2024
ba1f1dc
Updated Trainer's liger-kernel integration to call correct patching A…
shimizust Sep 17, 2024
9f196ef
Replace `accelerator.use_fp16` in examples (#33513)
hlky Sep 17, 2024
18e1a9c
Fix parametrization-based weight norm (#33275)
ylacombe Sep 17, 2024
bcf8946
Fix number of patch check for different vision feature select strateg…
insujang Sep 17, 2024
642256d
chore: migrate coverage cfg to pyproject.toml (#32650)
SauravMaheshkar Sep 17, 2024
74026b4
idefics2 enable_input_require_grads not aligned with disable_input_re…
sywangyi Sep 17, 2024
ac5a055
Update chameleon.md — fix runtime type error (#33494)
maxwbuckley Sep 17, 2024
7635484
Add explicit example for RAG chat templating (#33503)
A-Duss Sep 17, 2024
3476c19
CI Build image - move runners (#33530)
glegendre01 Sep 17, 2024
46c2757
fix to jamba config, asserting attention and expert offset (#33316)
ErezSC42 Sep 17, 2024
c29a869
Fix missing `sequences_scores` in the Whisper beam search output (#3…
Nik-Kras Sep 17, 2024
d8500cd
Uniformize kwargs for Pixtral processor (#33521)
yonigozlan Sep 17, 2024
6c051b4
Add revision to trainer push_to_hub (#33482)
teamclouday Sep 17, 2024
454a0f2
fix patch_attention_mask incorrect setting which leads to the differe…
sywangyi Sep 17, 2024
fee8651
Support LLaVa-OV-Chat (#33532)
zucchini-nlp Sep 18, 2024
e6d9f39
Decorator for easier tool building (#33439)
aymeric-roucher Sep 18, 2024
52e22cb
Fix for slow the bug tokenizer adding spaces to single id decodes (#3…
DuyguA Sep 18, 2024
db72894
Chat template: save and load correctly for processors (#33462)
zucchini-nlp Sep 18, 2024
9f2b8cc
Fix missing head_dim in llama config from gguf model (#33526)
Isotr0py Sep 18, 2024
5427eaa
[i18n-ur] Added README_ur.md file (#33461)
akkefa Sep 18, 2024
4f1e9ba
fix the wandb logging issue (#33464)
ZIYU-DEEP Sep 18, 2024
f883827
Fix tests in ASR pipeline (#33545)
ylacombe Sep 18, 2024
fc83a4d
Added support for bfloat16 to zero-shot classification pipeline (#33554)
umarbutler Sep 18, 2024
7542fac
Pipeline: no side-effects on `model.config` and `model.generation_con…
gante Sep 18, 2024
8efc06e
Return attention mask in ASR pipeline to avoid warnings (#33509)
Rocketknight1 Sep 18, 2024
9db963a
enforce original size to be a list (#33564)
dom-dziela Sep 18, 2024
7b1ce63
Improve compiled RT-DETR inference speed (#33412)
yonigozlan Sep 18, 2024
6019f3f
Fix bnb dequantization (#33546)
SunMarc Sep 18, 2024
5af7d41
Codec integration (#33565)
ylacombe Sep 18, 2024
e40bb48
Load and save video-processor from separate folder (#33562)
zucchini-nlp Sep 19, 2024
d7975a5
VLMs: enable generation tests (#33533)
zucchini-nlp Sep 19, 2024
f3b3810
rag: fix CI (#33578)
gante Sep 19, 2024
80b774e
Cache: don't show warning in forward passes when `past_key_values` is…
gante Sep 19, 2024
e7aa36d
Merge downstream main into tmp-develop_test_2-20240919 with conflicts
github-actions[bot] Sep 19, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
85 changes: 33 additions & 52 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -34,64 +34,44 @@ jobs:
- run: echo 'export "GIT_COMMIT_MESSAGE=$(git show -s --format=%s)"' >> "$BASH_ENV" && source "$BASH_ENV"
- run: mkdir -p test_preparation
- run: python utils/tests_fetcher.py | tee tests_fetched_summary.txt
- store_artifacts:
path: ~/transformers/tests_fetched_summary.txt
- run: |
if [ -f test_list.txt ]; then
cp test_list.txt test_preparation/test_list.txt
else
touch test_preparation/test_list.txt
fi
- run: |
if [ -f examples_test_list.txt ]; then
mv examples_test_list.txt test_preparation/examples_test_list.txt
else
touch test_preparation/examples_test_list.txt
fi
- run: |
if [ -f filtered_test_list_cross_tests.txt ]; then
mv filtered_test_list_cross_tests.txt test_preparation/filtered_test_list_cross_tests.txt
else
touch test_preparation/filtered_test_list_cross_tests.txt
fi
- run: |
if [ -f doctest_list.txt ]; then
cp doctest_list.txt test_preparation/doctest_list.txt
else
touch test_preparation/doctest_list.txt
fi
- run: |
if [ -f test_repo_utils.txt ]; then
mv test_repo_utils.txt test_preparation/test_repo_utils.txt
else
touch test_preparation/test_repo_utils.txt
fi
- run: python utils/tests_fetcher.py --filter_tests
- run: export "GIT_COMMIT_MESSAGE=$(git show -s --format=%s)" && echo $GIT_COMMIT_MESSAGE && python .circleci/create_circleci_config.py --fetcher_folder test_preparation
- run: |
if [ -f test_list.txt ]; then
mv test_list.txt test_preparation/filtered_test_list.txt
else
touch test_preparation/filtered_test_list.txt
if [ ! -s test_preparation/generated_config.yml ]; then
echo "No tests to run, exiting early!"
circleci-agent step halt
fi

- store_artifacts:
path: test_preparation/test_list.txt
- store_artifacts:
path: test_preparation/doctest_list.txt
- store_artifacts:
path: ~/transformers/test_preparation/filtered_test_list.txt
- store_artifacts:
path: test_preparation/examples_test_list.txt
- run: export "GIT_COMMIT_MESSAGE=$(git show -s --format=%s)" && echo $GIT_COMMIT_MESSAGE && python .circleci/create_circleci_config.py --fetcher_folder test_preparation
- run: |
if [ ! -s test_preparation/generated_config.yml ]; then
echo "No tests to run, exiting early!"
circleci-agent step halt
fi
path: test_preparation

- run:
name: "Retrieve Artifact Paths"
env:
CIRCLE_TOKEN: ${{ secrets.CI_ARTIFACT_TOKEN }}
command: |
project_slug="gh/${CIRCLE_PROJECT_USERNAME}/${CIRCLE_PROJECT_REPONAME}"
job_number=${CIRCLE_BUILD_NUM}
url="https://circleci.com/api/v2/project/${project_slug}/${job_number}/artifacts"
curl -o test_preparation/artifacts.json ${url}
- run:
name: "Prepare pipeline parameters"
command: |
python utils/process_test_artifacts.py

# To avoid too long generated_config.yaml on the continuation orb, we pass the links to the artifacts as parameters.
# Otherwise the list of tests was just too big. Explicit is good but for that it was a limitation.
# We used:

# https://circleci.com/docs/api/v2/index.html#operation/getJobArtifacts : to get the job artifacts
# We could not pass a nested dict, which is why we create the test_file_... parameters for every single job

- store_artifacts:
path: test_preparation/generated_config.yml
path: test_preparation/transformed_artifacts.json
- store_artifacts:
path: test_preparation/filtered_test_list_cross_tests.txt
path: test_preparation/artifacts.json
- continuation/continue:
parameters: test_preparation/transformed_artifacts.json
configuration_path: test_preparation/generated_config.yml

# To run all tests for the nightly build
Expand Down Expand Up @@ -142,6 +122,7 @@ jobs:
- run: python utils/custom_init_isort.py --check_only
- run: python utils/sort_auto_mappings.py --check_only
- run: python utils/check_doc_toc.py
- run: python utils/check_docstrings.py --check_all

check_repository_consistency:
working_directory: ~/transformers
Expand Down Expand Up @@ -190,4 +171,4 @@ workflows:
- check_circleci_user
- check_code_quality
- check_repository_consistency
- fetch_all_tests
- fetch_all_tests
Loading