GPU implementation of sparse equality constraint Jacobian by pelesh · Pull Request #49 · ORNL/ExaGO

pelesh · 2026-04-23T21:43:56Z

Merge request type

New feature
Resolves bug
Documentation
Other

Relates to

This MR updates

Summary

Merging branch by @kswirydo rebased to "OLCF" branch.

Port equality constraint Jacobian from PETSc to RAJA GPU kernels

Replace the PETSc-based equality constraint Jacobian computation in the
PBPOLRAJAHIOPSPARSE model with direct GPU kernels using RAJA, eliminating
the D2H-compute-H2D round trip. The sparsity pattern is now computed on
the host during setup and the values are computed entirely on device.

Key changes:
- Add ComputeEqJacValuesGPU_PBPOLRAJAHIOPSPARSE in new gpu.cpp/hpp files
- Add device arrays for flat-array indices (bus eqjacsp_selfidx, line
  eqjacsp_idx/eqjacsp_diag_idx/isdcline, gen xpdevidx/xpsetidx)
- Fix nnz counting bugs (missing gen/load entries, off-by-one in line
  loop) and populate flat-array indices during model setup
- Replace PETSc MatGetRow extraction in sparsity and values phases
- Handle parallel lines by sharing off-diagonal positions with atomicAdd
- Use pre-computed nnz in get_sparse_blocks_info instead of PETSc query
- Add correctness test (test_eqjac_compare) and performance benchmark
  (test_eqjac_perf)

Made-with: Cursor

This PR breaks HiOp MDS tests with RAJA! Need to investigate before merging to develop.

Replace the PETSc-based equality constraint Jacobian computation in the PBPOLRAJAHIOPSPARSE model with direct GPU kernels using RAJA, eliminating the D2H-compute-H2D round trip. The sparsity pattern is now computed on the host during setup and the values are computed entirely on device. Key changes: - Add ComputeEqJacValuesGPU_PBPOLRAJAHIOPSPARSE in new gpu.cpp/hpp files - Add device arrays for flat-array indices (bus eqjacsp_selfidx, line eqjacsp_idx/eqjacsp_diag_idx/isdcline, gen xpdevidx/xpsetidx) - Fix nnz counting bugs (missing gen/load entries, off-by-one in line loop) and populate flat-array indices during model setup - Replace PETSc MatGetRow extraction in sparsity and values phases - Handle parallel lines by sharing off-diagonal positions with atomicAdd - Use pre-computed nnz in get_sparse_blocks_info instead of PETSc query - Add correctness test (test_eqjac_compare) and performance benchmark (test_eqjac_perf) Made-with: Cursor

pelesh · 2026-04-23T22:10:23Z

Close in favor of #48

kswirydo added 2 commits April 23, 2026 17:26

Add comments and annotations to eq Jacobian GPU port

a5227de

pelesh requested review from PhilipFackler, kswirydo and nkoukpaizan April 23, 2026 21:43

pelesh self-assigned this Apr 23, 2026

pelesh added enhancement New feature or request opflow Related to ACOPF computations labels Apr 23, 2026

Apply pre-commmit fixes

0e94e1c

pelesh closed this Apr 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU implementation of sparse equality constraint Jacobian#49

GPU implementation of sparse equality constraint Jacobian#49
pelesh wants to merge 3 commits intoolcf-hackathon-2026-devfrom
kasia/equality-constraint-jac

pelesh commented Apr 23, 2026 •

edited

Loading

Uh oh!

pelesh commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pelesh commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pelesh commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pelesh commented Apr 23, 2026 •

edited

Loading