Skip to content

Conversation

@yut23
Copy link
Collaborator

@yut23 yut23 commented Jun 10, 2025

This decreases the runtime on CPU from 1.81s to 1.45s (-20%), compared to current development (f3ca377). It's still 25% slower than the original hand-written derivatives, which I believe is mostly due to the creation of extra temporary GradArrays in the autodiff code.

This decreases the runtime on CPU from 1.81s to 1.45s (-20%), compared
to PR AMReX-Astro#1803. It's still 25% slower than the original hand-written
derivatives, which I believe is mostly due to the creation of extra
temporary GradArrays in the autodiff code.
@yut23 yut23 force-pushed the sneut5-partial-arrays branch from 5ebf787 to 0fa11b4 Compare June 12, 2025 19:27
@yut23 yut23 marked this pull request as ready for review June 12, 2025 19:28
@zingale zingale merged commit a6ce7b7 into AMReX-Astro:development Jun 23, 2025
31 of 32 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants