[Fix] Add `original_max_position_embeddings` to YARN rope_scaling optional keys #36877

JustinTong0323 · 2025-03-21T09:17:51Z

What does this PR do?

This PR adds support for the original_max_position_embeddings parameter in YARN rope scaling configurations, addressing compatibility issues with Qwen-32B series models.

The Qwen team requires this parameter for their YARN implementation in Qwen-32B models, ref: link.

Previously, transformers would raise warnings about unrecognized keys despite this being a valid configuration parameter:

"rope_scaling": {
    "factor": 4.0,
    "original_max_position_embeddings": 32768,  # Previously unrecognized
    "type": "yarn"
}

Impact of this PR:

Eliminates spurious warnings for Qwen-32B users
Enables proper configuration validation for YARN-based models

This PR could also solves downstream issues:
sgl-project/sglang#4145
vllm-project/vllm#10293

Clarify #33783

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…iginal_max_position_embeddings

github-actions · 2025-03-21T09:18:01Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. When it is ready for review, please click the Ready for review button (at the bottom of the PR page).

ArthurZucker

Sure ! Thanks for finding this 🤗

…ional keys (huggingface#36877) [fix] Update optional keys in _validate_yarn_parameters to include original_max_position_embeddings

[fix] Update optional keys in _validate_yarn_parameters to include or…

0651788

…iginal_max_position_embeddings

github-actions bot marked this pull request as draft March 21, 2025 09:18

Merge branch 'main' into xinyuan/yarn_original_max_position_embeddings

63649b6

JustinTong0323 marked this pull request as ready for review March 21, 2025 09:18

github-actions bot requested review from ArthurZucker and Rocketknight1 March 21, 2025 09:18

Merge branch 'main' into xinyuan/yarn_original_max_position_embeddings

ed3f110

ArthurZucker approved these changes Mar 24, 2025

View reviewed changes

ArthurZucker merged commit e28be7a into huggingface:main Mar 24, 2025
19 of 21 checks passed

This was referenced Apr 22, 2025

[Bug] Unrecognized keys in rope_scaling for 'rope_type'='yarn': {'original_max_position_embeddings'} sgl-project/sglang#2943

Closed

[Bug]: Can't use yarn rope config for long context in Qwen2 model vllm-project/vllm#10293

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Fix] Add `original_max_position_embeddings` to YARN rope_scaling optional keys #36877

[Fix] Add `original_max_position_embeddings` to YARN rope_scaling optional keys #36877

Uh oh!

JustinTong0323 commented Mar 21, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Mar 21, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Fix] Add original_max_position_embeddings to YARN rope_scaling optional keys #36877

[Fix] Add original_max_position_embeddings to YARN rope_scaling optional keys #36877

Uh oh!

Conversation

JustinTong0323 commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

github-actions bot commented Mar 21, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Fix] Add `original_max_position_embeddings` to YARN rope_scaling optional keys #36877

[Fix] Add `original_max_position_embeddings` to YARN rope_scaling optional keys #36877

JustinTong0323 commented Mar 21, 2025 •

edited

Loading