Add initial support for RotaryEmbedding fusion for onnx opset 23 #2450

gramalingam · 2025-07-14T02:47:46Z

Add initial support for RotaryEmbedding fusion for onnx opset 23

Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>

codecov · 2025-07-14T02:51:27Z

❌ 9 Tests Failed:

Tests completed	Failed	Passed	Skipped
16456	9	16447	3852

View the top 3 failed test(s) by shortest run time

::onnxscript.tools.training_helper

Stack Traces | 0s run time

ImportError while importing test module '.../onnxscript/tools/training_helper.py'.
Hint: make sure your test modules/packages have valid Python names.
Traceback:
.../Library/Frameworks/Python.framework/Versions/3.11/lib/python3.11/importlib/__init__.py:126: in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
onnxscript/tools/training_helper.py:6: in <module>
    from torch.onnx import _OrtBackend, _OrtBackendOptions
E   ImportError: cannot import name '_OrtBackend' from 'torch.onnx' (.../onnxscript/onnxscript/.nox.../test_torch_nightly/lib/python3.11.../torch/onnx/__init__.py)

::onnxscript.tools.transformers_models.llama_test

Stack Traces | 0s run time

ImportError while importing test module '.../tools/transformers_models/llama_test.py'.
Hint: make sure your test modules/packages have valid Python names.
Traceback:
.../Library/Frameworks/Python.framework/Versions/3.11/lib/python3.11/importlib/__init__.py:126: in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
.../tools/transformers_models/llama_test.py:12: in <module>
    import onnxscript.tools.training_helper
onnxscript/tools/training_helper.py:6: in <module>
    from torch.onnx import _OrtBackend, _OrtBackendOptions
E   ImportError: cannot import name '_OrtBackend' from 'torch.onnx' (.../onnxscript/onnxscript/.nox.../test_torch_nightly/lib/python3.11.../torch/onnx/__init__.py)

::onnxscript.tools.transformers_models.mistral_test

Stack Traces | 0s run time

ImportError while importing test module '.../tools/transformers_models/mistral_test.py'.
Hint: make sure your test modules/packages have valid Python names.
Traceback:
.../Library/Frameworks/Python.framework/Versions/3.11/lib/python3.11/importlib/__init__.py:126: in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
.../tools/transformers_models/mistral_test.py:14: in <module>
    import onnxscript.tools.training_helper
onnxscript/tools/training_helper.py:6: in <module>
    from torch.onnx import _OrtBackend, _OrtBackendOptions
E   ImportError: cannot import name '_OrtBackend' from 'torch.onnx' (.../onnxscript/onnxscript/.nox.../test_torch_nightly/lib/python3.11.../torch/onnx/__init__.py)

To view more test analytics, go to the Test Analytics Dashboard
_{📋 Got 3 mins? Take this short survey to help us improve Test Analytics.}

onnxscript/rewriter/onnx_fusions/_onnx_fusions_test.py

onnxscript/rewriter/onnx_fusions/_rotary_embedding.py

Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>

onnxscript/rewriter/onnx_fusions/_rotary_embedding.py

Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>

onnxscript/rewriter/onnx_fusions/_rotary_embedding.py

+#    def rotate_half(x):
+#        """Rotates half the hidden dims of the input."""
+#        x1 = x[..., : x.shape[-1] // 2]
+#        x2 = x[..., x.shape[-1] // 2 :]
+#        return torch.cat((-x2, x1), dim=-1)
+# and
+#        q_embed = (q * cos) + (rotate_half(q) * sin)


The best way to address the issue is to remove the commented-out code and replace it with a concise, well-structured explanation of the referenced logic. The explanation can include a link to the external function's implementation in Hugging Face's repository and a summary of what the function does, ensuring clarity without including raw commented-out code.

Specifically:

Remove the commented-out rotate_half function code (lines 13-20).

Replace it with a concise comment explaining the logic and its relevance to _rotate_half_pattern.

Retain the link to the external repository for further reference.

justinchuby · 2025-07-18T23:13:21Z

Looks like there is merge conflicts

Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>

gramalingam · 2025-07-21T19:23:38Z

Looks like there is merge conflicts

Resolved

gramalingam added 3 commits July 13, 2025 17:44

Add rotary embedding fusion for opset 23

82c6493

Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>

Add test case

4dc9d4c

Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>

Call rotary fusion

2b4b473

Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>

github-project-automation bot added this to ONNX Script Review Board Jul 14, 2025

github-project-automation bot moved this to Todo in ONNX Script Review Board Jul 14, 2025