Skip to content

[Perf Improvement] Stage monolith linesearch search vector#52

Merged
yaoliu13 merged 1 commit intoamd-integrationfrom
perf/monolith-linesearch
May 1, 2026
Merged

[Perf Improvement] Stage monolith linesearch search vector#52
yaoliu13 merged 1 commit intoamd-integrationfrom
perf/monolith-linesearch

Conversation

@peizhang56
Copy link
Copy Markdown

Use linesearch in monolith to reduce the register pressure.

@peizhang56
Copy link
Copy Markdown
Author

/run_ci

@peizhang56
Copy link
Copy Markdown
Author

/run-ci

@peizhang56
Copy link
Copy Markdown
Author

Looks the perf gain is gone with latest amd-integration. Should investigate further.

Avoid rereading the linesearch search vector from global memory on AMDGPU while preserving the default CPU path and test correctness.

[Perf Improvement] Reduce monolith linesearch state traffic

Keep linesearch counters and convergence state local inside the monolith loop to avoid hot global field round-trips while preserving solver behavior.

[Perf Improvement] Restore solver previous cost state

Keep the monolith linesearch traffic reductions while preserving prev_cost as observable solver state for CPU correctness checks.
@peizhang56 peizhang56 force-pushed the perf/monolith-linesearch branch from 44d96dd to e81265e Compare April 30, 2026 23:14
@peizhang56
Copy link
Copy Markdown
Author

/run-ci

@nilapate nilapate self-requested a review May 1, 2026 07:18
Copy link
Copy Markdown

@nilapate nilapate left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Copy Markdown
Collaborator

@yaoliu13 yaoliu13 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

1303095 and 5273

@yaoliu13 yaoliu13 merged commit 097ceaf into amd-integration May 1, 2026
@yaoliu13 yaoliu13 deleted the perf/monolith-linesearch branch May 1, 2026 07:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants