Skip to content

Conversation

@mycpuorg
Copy link

New example under examples/attention_optimization/ with docs, configs, evaluator, MLIR IRs, scripts, and tests. Evolves transformation parameters to optimize attention kernels; supports IR-sim and real execution. Aims for 15–32% speedup; usage documented in README.

TODO:

  • integration with mlir-opt for automated profiling to complete the validation loop
  • GPU support

New example under examples/attention_optimization/ with docs, configs, evaluator, MLIR IRs, scripts, and tests.
Evolves transformation parameters to optimize attention kernels; supports IR-sim and real execution.
Aims for 15–32% speedup; usage documented in README.

TODO:
* integration with mlir-opt for automated profiling to complete the validation loop
* GPU support
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant