Skip to content

Update configuration_Clarify rotary_pct reset behavior in GPTNeoXConfiggpt_neox.py#45025

Closed
layla1824 wants to merge 1 commit intohuggingface:mainfrom
layla1824:patch-1
Closed

Update configuration_Clarify rotary_pct reset behavior in GPTNeoXConfiggpt_neox.py#45025
layla1824 wants to merge 1 commit intohuggingface:mainfrom
layla1824:patch-1

Conversation

@layla1824
Copy link
Copy Markdown

This PR adds a clarification comment regarding the behavior of rotary_pct.

Currently, rotary_pct may reset to its default value (0.25) after reload due to the use of kwargs.pop.

This note helps developers better understand this behavior.

Add note about rotary_pct reset behavior
@github-actions
Copy link
Copy Markdown
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: gpt_neox

@zucchini-nlp
Copy link
Copy Markdown
Member

Duplicate of #44985, will close as I am merging the other PR today. It got stale due to CI issues

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants