[BUG] Critical logic errors, OOB hazards, and validation gaps in cuOpt Python layer

A comprehensive audit of the `cuOpt` Python codebase has identified several critical logic bugs, validation gaps, and usability issues.

**Notably, some of these validation gaps (Issues 1 & 2) may allow out-of-range indices / malformed CSR inputs to pass the Python layer, which could lead to out-of-bounds access or undefined behavior in downstream CUDA/native layers, depending on the robustness of lower-level validation. Additionally, Issue 4 may cause incorrect MIP detection, which can lead to unexpected failures or severe performance degradation in production pipelines (potential DoS via crash/timeout), though this is not an overflow issue by itself.**

---

### **1. [Critical] Off-by-one index validation (OOB Hazard)**

* **Problem:** `validate_range` accepts indices equal to `num_locations` (max).
* **Code Evidence:**

```python
validate_range(locations, ..., 0, self.get_num_locations())

```

* **Impact:** Allows invalid index  for 0-based indexing, potentially leading to out-of-bounds access or undefined behavior in downstream CUDA/native layers.

---

### **2. [Critical] Key Shadowing in `construct_rerouting_model` (State Corruption)**

* **Problem:** The function uses a leaked loop variable `name` as a dictionary key instead of the intended `demand_name`.
* **Code Evidence:**

```python
# 'name' accidentally holds the last value of a previous loop
capacity_dimensions_h[name] = {"demand": order_demand, "capacity": vehicle_cap}

```

* **Impact:** Critical state corruption. Base demand data is stored under an incorrect key, breaking the rerouting logic and downstream solver configuration.

---

### **3. [Robustness] Missing CSR Invariant & dtype Validation**

* **Problem:** `set_csr_constraint_matrix` forwards raw arrays without validating CSR invariants (monotonicity, nnz match) or dtypes (int64 vs int32).
* **Code Evidence:**

```python
# No validation for dtype or CSR structure before calling super()
super().set_csr_constraint_matrix(A_values, A_indices, A_offsets)

```

* **Impact:** Confusing low-level C++/CUDA errors or silent failures for common NumPy default types (int64).

---

### **4. [Performance/Robustness] `is_mip()` type normalization & inefficient iteration**

* **Problem:** Performs  Python iteration on `cudf.Series` and fails to recognize `numpy.bytes_` or `numpy.str_` scalars.
* **Impact:** Inefficient GPU-to-CPU data transfer and potential incorrect solver mode selection (LP vs MIP).

---

### **5. [Usability] Inconsistent Environment Variable Parsing**

* **Problem:** `CUOPT_EXTRA_TIMESTAMPS` ignores standard truthy values like `"1"`, `"yes"`, or `"on"`.
* **Code Evidence:**

```python
os.environ.get(..., False) in (True, "True", "true")

```

* **Impact:** Silent misconfiguration and inconsistent behavior in containerized/shell environments.

---

### **6. [Docs] Docstring/API Mismatch**

* **Problem:** Docstring instructs users to call `set_maximization()`, but the actual method name is `set_maximize()`.
* **Impact:** Avoidable `AttributeError` and user confusion.

---

### **Suggested Fix Strategies**

* **Fix Key Access:** Replace `capacity_dimensions_h[name]` with `capacity_dimensions_h[demand_name]` in `routing.py`.
* **Strict Indexing:** Update call sites to use `max = get_num_locations() - 1` or modify `validate_range` to support exclusive bounds.
* **Robust CSR Handling:** Add explicit `dtype=np.int32` casting and basic CSR invariant checks in the Python layer.
* **Shared Env Helper:** Use a normalized utility function to check for `{"1", "true", "yes", "on", "y"}`.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Critical logic errors, OOB hazards, and validation gaps in cuOpt Python layer #869

1. [Critical] Off-by-one index validation (OOB Hazard)

2. [Critical] Key Shadowing in `construct_rerouting_model` (State Corruption)

3. [Robustness] Missing CSR Invariant & dtype Validation

4. [Performance/Robustness] `is_mip()` type normalization & inefficient iteration

5. [Usability] Inconsistent Environment Variable Parsing

6. [Docs] Docstring/API Mismatch

Suggested Fix Strategies

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG] Critical logic errors, OOB hazards, and validation gaps in cuOpt Python layer #869

Description

1. [Critical] Off-by-one index validation (OOB Hazard)

2. [Critical] Key Shadowing in construct_rerouting_model (State Corruption)

3. [Robustness] Missing CSR Invariant & dtype Validation

4. [Performance/Robustness] is_mip() type normalization & inefficient iteration

5. [Usability] Inconsistent Environment Variable Parsing

6. [Docs] Docstring/API Mismatch

Suggested Fix Strategies

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

2. [Critical] Key Shadowing in `construct_rerouting_model` (State Corruption)

4. [Performance/Robustness] `is_mip()` type normalization & inefficient iteration