Skip to content

[FA] Refactor FA CB kwargs#44553

Draft
vasqu wants to merge 8 commits intohuggingface:mainfrom
vasqu:cleanup-fa-paged
Draft

[FA] Refactor FA CB kwargs#44553
vasqu wants to merge 8 commits intohuggingface:mainfrom
vasqu:cleanup-fa-paged

Conversation

@vasqu
Copy link
Copy Markdown
Contributor

@vasqu vasqu commented Mar 10, 2026

As per title, wip and really unsure if we really want this

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@github-actions
Copy link
Copy Markdown
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: bark, diffllama, falcon, gpt_neo, gptj, kyutai_speech_to_text, mimi, moshi, nemotron

@vasqu vasqu changed the title [FA] Move CB support to main modeling [FA] Refactor FA CB kwarg Mar 16, 2026
@vasqu vasqu changed the title [FA] Refactor FA CB kwarg [FA] Refactor FA CB kwargs Mar 16, 2026
@github-actions
Copy link
Copy Markdown
Contributor

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=44553&sha=a63e4e

inputs=input_ids, generation_config=gen_config, continuous_batching_config=continuous_batching_config
)
self.assertTrue(mock_get_block_table_key.called, "get_block_table_key method was not called.")
outputs_kvcache = model.generate_batch(
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could we keep the patch to check the function is indeed called? But the patch would be on _prepare_kvcache_fa

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants