Papilo primal/dual crush by aliceb-nv · Pull Request #1104 · NVIDIA/cuopt

aliceb-nv · 2026-04-15T10:18:40Z

This PR implements support for crushing primal incumbents in MIP mode into the Papilo problem space, and crushing primal/dual vectors for LP.

A bugfix is also included to allow consecutive solves to be run in the same GTest process without corrupting the OpenMP runtime.

Closes #513
Closes #1060

Description

Issue

Checklist

I am familiar with the Contributing Guidelines.
Testing
- New or existing tests cover these changes
- Added tests
- Created an issue to follow-up
- NA
Documentation
- The documentation is up to date with these changes
- Added new documentation
- NA

copy-pr-bot · 2026-04-15T10:18:45Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

aliceb-nv · 2026-04-15T10:19:25Z

/ok to test d47d709

coderabbitai · 2026-04-15T10:29:57Z

📝 Walkthrough

Walkthrough

This change adds support for user-provided initial solutions in conjunction with Papilo presolve by implementing forward solution transformation ("crushing") from original to reduced variable space. It includes presolve method refinements, extended initial solution handling in the solver, comprehensive test coverage for round-trip solution transformations, and dataset expansion.

Changes

Cohort / File(s)	Summary
Solution Crushing Implementation `cpp/src/mip_heuristics/presolve/third_party_presolve.hpp`, `cpp/src/mip_heuristics/presolve/third_party_presolve.cpp`	Added `crush_primal_solution` and `crush_primal_dual_solution` methods that transform solutions from original to reduced Papilo space. The implementation validates PaPILO usage, replays reductions in forward order with explicit handling for `kParallelCol`, `kRowBoundChangeForcedByRow`, `kCoefficientChange`, and `kSubstitutedColWithDual` reduction types, and projects results onto reduced indices. Also conditionally applies `SingletonStuffing` presolve method only when `!dual_postsolve` and restricts integer integrality flags to MIP problems.
Initial Solution Integration `cpp/src/mip_heuristics/diversity/diversity_manager.cu`, `cpp/src/mip_heuristics/solve.cu`	Integrated Papilo crushing for user-provided initial solutions in the diversity manager by checking Papilo data availability, copying assignments to host, calling crush methods, and expanding back to device. In solver, removed presolve disable condition for empty initial solutions, introduced `early_incumbent_pool` to collect multiple early heuristics, and ensure initial user solutions are processed in GPU-disabled paths. Changed success logging from `CUOPT_LOG_INFO` to `CUOPT_LOG_DEBUG`.
Test Coverage & Utilities `cpp/tests/linear_programming/unit_tests/presolve_test.cu`, `cpp/tests/mip/incumbent_callback_test.cu`	Added KKT validation helpers (`check_reduced_cost_consistency`, `check_dual_sign_consistency`) and CSR transpose support for dual crushing verification. Introduced parameterized tests for dual crushing round-trip and warmstart workflows. Added `scoped_env_restore_t` RAII helper for environment variable management and new test validating initial solutions survive Papilo crushing.
Infrastructure & Resources `cpp/src/mip_heuristics/solve.cu` (additional), `datasets/mip/download_miplib_test_dataset.sh`	Added OMP resource pause guard (`omp_pause_resource_all`) to prevent affinity/deadlock issues across consecutive solves. Extended MIPLIB test dataset with 27 additional instance identifiers to increase presolve test coverage.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~75 minutes

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 30.77% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'Papilo primal/dual crush' directly and concisely describes the main implementation focus of the PR—adding support for crushing primal and dual solutions in Papilo's reduced problem space.
Description check	✅ Passed	The description clearly articulates the PR's purpose: crushing primal incumbents in MIP mode and primal/dual vectors for LP, plus an OpenMP bugfix, and links to related issues `#513` and `#1060`.
Linked Issues check	✅ Passed	The PR fully addresses both linked issues: it implements crushing of original-space solutions into Papilo reduced space (resolving `#513`) and provides primal/dual crushing APIs (resolving `#1060`) via new methods in third_party_presolve and integration in solve paths.
Out of Scope Changes check	✅ Passed	All changes are directly aligned with the stated objectives: Papilo solution crushing for MIP/LP (third_party_presolve.cpp/hpp, diversity_manager.cu, solve.cu), an OMP bugfix (solve.cu), tests, and dataset extension—no unrelated changes detected.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@cpp/src/mip_heuristics/presolve/third_party_presolve.cpp`:
- Around line 1092-1096: The loop that updates z for removed rows uses an exact
check y[i] == 0 which is too strict; replace that check with the
function/epsilon-based test used elsewhere in this function (i.e., treat y as
zero when fabs(y[i]) <= the existing numeric tolerance or by calling the
existing isZero/is_near_zero helper) so that only truly near-zero y[i] are
skipped; locate the loop referencing storage.nRowsOriginal, row_survives, y,
A_offsets, A_indices, z and get_coeff and change the condition to use the same
tolerance variable or helper used elsewhere in this file to avoid injecting
noise into z (and therefore z_reduced).
- Around line 973-980: The kParallelCol case only updates the primal x but fails
to fold reduced costs; update the corresponding dual/reduced-cost vector (z)
analogously so eliminated parallel-column contributions aren't lost: inside the
ReductionType::kParallelCol branch (where col1 = indices[first], col2 =
indices[first+2], scale = values[first+4] and x[col2] += scale * x[col1]) also
perform z[col2] += scale * z[col1] (using the same index mapping and scale),
ensuring you access the z array from the same problem context and respect any
index-mapping helpers used elsewhere in this function.

In `@cpp/tests/linear_programming/unit_tests/presolve_test.cu`:
- Around line 859-861: The test currently uses EXPECT_LT(warm_iters, cold_iters)
which enforces a strict decrease and makes the test flaky; change the assertion
to EXPECT_LE(warm_iters, cold_iters) so the warm-started PDLP is allowed to take
the same number of iterations as the cold run. Update the failure message if
desired but keep the same variables (warm_iters, cold_iters) and replace the
EXPECT_LT macro with EXPECT_LE in the presolve_test assertion.

In `@cpp/tests/mip/incumbent_callback_test.cu`:
- Around line 41-44: The destructor of scoped_env_restore_t always calls
::setenv(name_, prev_value_.c_str(), 1) which re-creates the variable as an
empty string if it was originally unset; change scoped_env_restore_t to record
whether the environment var existed (e.g. a bool prev_exists_ set in the
constructor when std::getenv(env_name) != nullptr) and in
~scoped_env_restore_t() call ::unsetenv(name_) when prev_exists_ is false,
otherwise restore the original value via ::setenv using prev_value_; update the
constructor and member fields (prev_exists_ and prev_value_) accordingly and
ensure behavior is consistent for CUOPT_DISABLE_GPU_HEURISTICS and similar uses.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: 81564330-0c16-431c-9cbc-81de259afadb

📥 Commits

Reviewing files that changed from the base of the PR and between 9fe95f2 and d47d709.

📒 Files selected for processing (7)

cpp/src/mip_heuristics/diversity/diversity_manager.cu
cpp/src/mip_heuristics/presolve/third_party_presolve.cpp
cpp/src/mip_heuristics/presolve/third_party_presolve.hpp
cpp/src/mip_heuristics/solve.cu
cpp/tests/linear_programming/unit_tests/presolve_test.cu
cpp/tests/mip/incumbent_callback_test.cu
datasets/mip/download_miplib_test_dataset.sh

coderabbitai · 2026-04-15T10:30:00Z

cpp/src/mip_heuristics/presolve/third_party_presolve.cpp

+      case ReductionType::kParallelCol: {
+        // Storage layout: [orig_col1, flags1, orig_col2, flags2, -1]
+        //                 [col1lb,    col1ub, col2lb,    col2ub, col2scale]
+        int col1         = indices[first];
+        int col2         = indices[first + 2];
+        const f_t& scale = values[first + 4];
+        x[col2] += scale * x[col1];
+        break;


⚠️ Potential issue | 🟠 Major

Fold reduced costs through kParallelCol as well.

This forward replay updates the survivor primal value, but it leaves z in the original basis. After the final projection, any eliminated parallel column contribution is dropped, so z_reduced is wrong whenever dual/reduced-cost crushing hits a parallel-column reduction.

🐛 Suggested fix

case ReductionType::kParallelCol: { // Storage layout: [orig_col1, flags1, orig_col2, flags2, -1] // [col1lb, col1ub, col2lb, col2ub, col2scale] int col1 = indices[first]; int col2 = indices[first + 2]; const f_t& scale = values[first + 4]; x[col2] += scale * x[col1]; + if (crush_rc) { z[col2] += scale * z[col1]; } break; }

As per coding guidelines, **/*.{cu,cuh,cpp,hpp,h}: Validate algorithm correctness in optimization logic and ensure variables and constraints are accessed from the correct problem context (original vs presolve vs folded vs postsolve); verify index mapping consistency across problem transformations.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@cpp/src/mip_heuristics/presolve/third_party_presolve.cpp` around lines 973 - 980, The kParallelCol case only updates the primal x but fails to fold reduced costs; update the corresponding dual/reduced-cost vector (z) analogously so eliminated parallel-column contributions aren't lost: inside the ReductionType::kParallelCol branch (where col1 = indices[first], col2 = indices[first+2], scale = values[first+4] and x[col2] += scale * x[col1]) also perform z[col2] += scale * z[col1] (using the same index mapping and scale), ensuring you access the z array from the same problem context and respect any index-mapping helpers used elsewhere in this function.

coderabbitai · 2026-04-15T10:30:00Z

cpp/src/mip_heuristics/presolve/third_party_presolve.cpp

+    for (int i = 0; i < (int)storage.nRowsOriginal; ++i) {
+      if (row_survives[i] || y[i] == 0) continue;
+      for (i_t p = A_offsets[i]; p < A_offsets[i + 1]; ++p) {
+        z[A_indices[p]] += y[i] * get_coeff(i, A_indices[p]);
+      }


⚠️ Potential issue | 🟡 Minor

Avoid exact-zero checks on removed-row duals.

y[i] == 0 is too strict for approximate PDLP/DualSimplex duals. Near-zero removed-row multipliers will still enter this correction path and inject noise into z_reduced. Use the same numeric tolerance machinery you already use elsewhere in this function.

As per coding guidelines, **/*.{cu,cuh,cpp,hpp,h}: Check numerical stability: prevent overflow/underflow, precision loss, division by zero/near-zero, and use epsilon comparisons for floating-point equality checks.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@cpp/src/mip_heuristics/presolve/third_party_presolve.cpp` around lines 1092 - 1096, The loop that updates z for removed rows uses an exact check y[i] == 0 which is too strict; replace that check with the function/epsilon-based test used elsewhere in this function (i.e., treat y as zero when fabs(y[i]) <= the existing numeric tolerance or by calling the existing isZero/is_near_zero helper) so that only truly near-zero y[i] are skipped; locate the loop referencing storage.nRowsOriginal, row_survives, y, A_offsets, A_indices, z and get_coeff and change the condition to use the same tolerance variable or helper used elsewhere in this file to avoid injecting noise into z (and therefore z_reduced).

coderabbitai · 2026-04-15T10:30:00Z

cpp/tests/linear_programming/unit_tests/presolve_test.cu

+  EXPECT_LT(warm_iters, cold_iters)
+    << "warmstarted solve should not take more iterations than cold solve"
+    << " (cold=" << cold_iters << ", warm=" << warm_iters << ")";


⚠️ Potential issue | 🟡 Minor

Don’t require a strict iteration drop here.

Warm-started PDLP can legitimately converge in the same number of iterations as the cold run because of fixed startup/restart behavior. EXPECT_LT makes this test flaky even when the warm start is working.

💡 Suggested fix

- EXPECT_LT(warm_iters, cold_iters) + EXPECT_LE(warm_iters, cold_iters) << "warmstarted solve should not take more iterations than cold solve" << " (cold=" << cold_iters << ", warm=" << warm_iters << ")";

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

EXPECT_LT(warm_iters, cold_iters)

<< "warmstarted solve should not take more iterations than cold solve"

<< " (cold=" << cold_iters << ", warm=" << warm_iters << ")";

EXPECT_LE(warm_iters, cold_iters)

<< "warmstarted solve should not take more iterations than cold solve"

<< " (cold=" << cold_iters << ", warm=" << warm_iters << ")";

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@cpp/tests/linear_programming/unit_tests/presolve_test.cu` around lines 859 - 861, The test currently uses EXPECT_LT(warm_iters, cold_iters) which enforces a strict decrease and makes the test flaky; change the assertion to EXPECT_LE(warm_iters, cold_iters) so the warm-started PDLP is allowed to take the same number of iterations as the cold run. Update the failure message if desired but keep the same variables (warm_iters, cold_iters) and replace the EXPECT_LT macro with EXPECT_LE in the presolve_test assertion.

coderabbitai · 2026-04-15T10:30:00Z

cpp/tests/mip/incumbent_callback_test.cu

+    if (const char* prev = std::getenv(env_name)) { prev_value_ = prev; }
+    ::setenv(env_name, new_value, 1);
+  }
+  ~scoped_env_restore_t() { ::setenv(name_, prev_value_.c_str(), 1); }


⚠️ Potential issue | 🟡 Minor

Restore the original unset state, not an empty string.

If CUOPT_DISABLE_GPU_HEURISTICS was originally unset, the destructor currently leaves it defined as "". That leaks process-global state across tests and can change behavior for code that branches on std::getenv(...) != nullptr.

💡 Suggested fix

class scoped_env_restore_t { public: scoped_env_restore_t(const char* env_name, const char* new_value) : name_(env_name) { - if (const char* prev = std::getenv(env_name)) { prev_value_ = prev; } + if (const char* prev = std::getenv(env_name)) { + had_prev_value_ = true; + prev_value_ = prev; + } ::setenv(env_name, new_value, 1); } - ~scoped_env_restore_t() { ::setenv(name_, prev_value_.c_str(), 1); } + ~scoped_env_restore_t() + { + if (had_prev_value_) { + ::setenv(name_, prev_value_.c_str(), 1); + } else { + ::unsetenv(name_); + } + } scoped_env_restore_t(const scoped_env_restore_t&) = delete; scoped_env_restore_t& operator=(const scoped_env_restore_t&) = delete; private: const char* name_; + bool had_prev_value_ = false; std::string prev_value_; };

As per coding guidelines, **/*test*.{cpp,cu,py}: Ensure test isolation: prevent GPU state, cached memory, and global variables from leaking between test cases; verify each test independently initializes its environment.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@cpp/tests/mip/incumbent_callback_test.cu` around lines 41 - 44, The destructor of scoped_env_restore_t always calls ::setenv(name_, prev_value_.c_str(), 1) which re-creates the variable as an empty string if it was originally unset; change scoped_env_restore_t to record whether the environment var existed (e.g. a bool prev_exists_ set in the constructor when std::getenv(env_name) != nullptr) and in ~scoped_env_restore_t() call ::unsetenv(name_) when prev_exists_ is false, otherwise restore the original value via ::setenv using prev_value_; update the constructor and member fields (prev_exists_ and prev_value_) accordingly and ensure behavior is consistent for CUOPT_DISABLE_GPU_HEURISTICS and similar uses.

aliceb-nv added 7 commits April 10, 2026 08:35

primal crush for papilo

871f60a

tentative crash fix

b27c1f0

crush dual vectors and reduced costs

bb9c91b

more tests, clenaup

e709130

Merge branch 'main' into papilo-crush

2b69c95

cleanup the KKT check

d69e8d0

cleanup

d47d709

aliceb-nv added this to the 26.04 milestone Apr 15, 2026

aliceb-nv requested a review from rg20 April 15, 2026 10:18

aliceb-nv requested review from a team as code owners April 15, 2026 10:18

aliceb-nv requested a review from chris-maes April 15, 2026 10:18

aliceb-nv added non-breaking Introduces a non-breaking change improvement Improves an existing functionality labels Apr 15, 2026

aliceb-nv requested a review from tmckayus April 15, 2026 10:18

coderabbitai bot reviewed Apr 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Papilo primal/dual crush#1104

Papilo primal/dual crush#1104
aliceb-nv wants to merge 7 commits intoNVIDIA:mainfrom
aliceb-nv:papilo-crush

aliceb-nv commented Apr 15, 2026

Uh oh!

copy-pr-bot bot commented Apr 15, 2026

Uh oh!

aliceb-nv commented Apr 15, 2026

Uh oh!

coderabbitai bot commented Apr 15, 2026

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Apr 15, 2026

Uh oh!

coderabbitai bot Apr 15, 2026

Uh oh!

coderabbitai bot Apr 15, 2026

Uh oh!

coderabbitai bot Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aliceb-nv commented Apr 15, 2026

Description

Issue

Checklist

Uh oh!

copy-pr-bot bot commented Apr 15, 2026

Uh oh!

aliceb-nv commented Apr 15, 2026

Uh oh!

coderabbitai bot commented Apr 15, 2026

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant