Functionality to patch models with changes by tsmbland · Pull Request #1031 · EnergySystemsModellingLab/MUSE2

tsmbland · 2025-12-16T12:07:50Z

Description

Following on from the discussion in #937, this PR introduces a way to make changes to existing models for testing purposes. For example, if you wanted to make sure that the validation checks flag a certain parameter combination, you could introduce these changes as a patch to an existing example model, then run the validate command (handle_validate_command) on the patched model. At the moment this is only for testing purposes - rather than letting users patch models with "_diff" files as discussed previously (I've explored this in #1026, but it proved very fiddly).

The approach is to build a ModelPatch object, which can be made up of one or multiple FilePatch objects, and/or a toml table to indicate changes to model.toml. The ModelPatch gets applied to an existing model, outputting a new set of files to a temporary directory. The temporary directory can then be passed to load_model the same way that permanent model directory would, and will be deleted after the command has finished.

I've included some examples of how you might use this in tests/patch.rs (not necessarily good/useful examples, just examples to check everything is working). There's currently one annoyance which I'll comment below.

Fixes # (issue)

Type of change

Bug fix (non-breaking change to fix an issue)
New feature (non-breaking change to add functionality)
Refactoring (non-breaking, non-functional change to improve maintainability)
Optimization (non-breaking change to speed up the code)
Breaking change (whatever its nature)
Documentation (improve or add documentation)

Key checklist

All tests pass: $ cargo test
The documentation builds and looks OK: $ cargo doc

Further checks

Code is commented, particularly in hard-to-understand areas
Tests added that prove fix is effective or that feature works

…cters

tsmbland · 2025-12-16T16:24:53Z

@alexdewar This is nearly working. Before I finish it up, are you happy with the general approach (i.e. the approach for building patches, saving to a temp directory etc.)?

One annoying thing is that it doesn't work with multiple tests in the same file because of issue with the logger (this is why the tests are currently failing). I guess we could just keep all these tests in separate files, but if we're going to do this a lot then that could get a bit annoying.

src/patch.rs

alexdewar · 2025-12-17T10:56:34Z

src/patch.rs

+}
+
+/// Merge a TOML patch into a base TOML string and return the merged TOML.
+fn merge_model_toml(base_toml: &str, patch: &toml::value::Table) -> Result<String> {


Could this function just consume patch instead?

Possibly, but since it's called by ModelPatch::build on self.toml_patch you'd have to clone it first

alexdewar

I think this looks great in general. Would be super handy!

We could probably add more helper functions/macros for the common cases, e.g. "if you add this line to assets.csv of the simple example, validation should fail with this error message", but we can add those as we go along.

I think it is a problem if it can only be used in integration tests though -- ideally we'll want to do this in normal unit tests. Can we rework things so we can optionally skip setting up the logger for validating/running models? For validation, we could actually just call the load_model function rather than handle_validate_command, because we also don't need or want to load settings.toml.

tsmbland · 2025-12-17T17:14:48Z

Thanks

I think this looks great in general. Would be super handy!

We could probably add more helper functions/macros for the common cases, e.g. "if you add this line to assets.csv of the simple example, validation should fail with this error message", but we can add those as we go along.

Good idea

I think it is a problem if it can only be used in integration tests though -- ideally we'll want to do this in normal unit tests. Can we rework things so we can optionally skip setting up the logger for validating/running models? For validation, we could actually just call the load_model function rather than handle_validate_command, because we also don't need or want to load settings.toml.

Agree we can just call load_model

The other problem is with the BROKEN_OPTIONS_ALLOWED global variable which gets set when we run load_model and can only be set once per session

MUSE2/src/model/parameters.rs

Lines 178 to 180 in db2c40a

    
           BROKEN_OPTIONS_ALLOWED 
        
               .set(model_params.allow_broken_options) 
        
               .unwrap(); // Will only fail if there is a race condition, which shouldn't happen

What do you think we should do about this?

alexdewar · 2025-12-18T10:11:03Z

The other problem is with the BROKEN_OPTIONS_ALLOWED global variable which gets set when we run load_model and can only be set once per session

MUSE2/src/model/parameters.rs

Lines 178 to 180 in db2c40a

BROKEN_OPTIONS_ALLOWED

.set(model_params.allow_broken_options)

.unwrap(); // Will only fail if there is a race condition, which shouldn't happen

What do you think we should do about this?

It's a bit gross but I think we'll just have to disable the check when running tests 😞. Maybe something like this?

        // Set flag signalling whether broken model options are allowed or not
        let result = BROKEN_OPTIONS_ALLOWED.set(model_params.allow_broken_options);
        if result.is_err() {
            if cfg!(test) {
                // Sanity check
                assert_eq!(
                    model_params.allow_broken_options,
                    broken_model_options_allowed()
                );
            } else {
                panic!("Attempted to set BROKEN_OPTIONS_ALLOWED twice");
            }
        }

It should probably live in its own function. I guess this is the downside of having mutable global variables...

The problem with this approach is that we'll only be able to have unit tests for non-broken or broken options, but not both. We can work around this by having any tests for broken options as an integration test instead so it runs in a separate process.

codecov · 2025-12-18T17:30:35Z

Codecov Report

❌ Patch coverage is 87.13235% with 35 lines in your changes missing coverage. Please review.
✅ Project coverage is 81.07%. Comparing base (73a59b0) to head (589fbcd).
⚠️ Report is 36 commits behind head on main.

Files with missing lines	Patch %	Lines
src/patch.rs	85.96%	8 Missing and 24 partials ⚠️
src/fixture.rs	94.28%	0 Missing and 2 partials ⚠️
src/model/parameters.rs	88.88%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1031      +/-   ##
==========================================
+ Coverage   80.71%   81.07%   +0.35%     
==========================================
  Files          52       53       +1     
  Lines        6924     7193     +269     
  Branches     6924     7193     +269     
==========================================
+ Hits         5589     5832     +243     
- Misses       1063     1066       +3     
- Partials      272      295      +23

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

tsmbland · 2025-12-18T17:34:14Z

Thanks @alexdewar. I think this is ready for another look now.

I've added a couple of macros that could prove useful for testing. Currently just for applying file patches to the simple model, but could be adapted to toml patches and other example models.

Like you've mentioned we wont be able to test broken options like this. I initially thought we could set allow_broken_results to true as part of the patch within build_patched_simple_tempdir so that it's always turned on in the tests (I figured better that than always having it turned off), but there's one other test that didn't like this (test_model_params_from_path) so I backtracked

alexdewar

This is great!

I've got a couple of small suggestions about the API, but these aren't important and could be done later.

I'd add a ModelPatch::from_example helper, as that's what we'll want to do 99% of the time
I'd provide a macro to check the actual error message in case of error, so the test doesn't pass because some other error has occurred. I know there are a lot of places where we don't check the error message, but I think this is pretty fragile. More recently I've been trying to use the assert_error! macro in fixture.rs for tests. Example: we rename a commodity in the simple example, now a test which checks for errors related to commodities continues to pass, but because the commodity doesn't exist, not because the region was wrong (or something). We then break the functionality for checking regions are valid for commodities, but the test continues to pass, because all we're checking is whether there is an error, not which error occurred.

Could have macros assert_validation_succeeds! and assert_validation_fails_with! macros, or something.

Up to you whether you fancy doing this now or opening an issue. I'm sure we'll want to tweak the API as we go along anyway.

Now I think we need to use this in anger! A lot of the code doesn't have tests (or doesn't test all possible cases). We might not have time to retrofit tests for old code, but it would be good to add new tests along with new code as much as possible. Now it should be a lot easier 😄

tsmbland · 2025-12-19T10:43:39Z

Thanks!

This is great!

I've got a couple of small suggestions about the API, but these aren't important and could be done later.

I'd add a ModelPatch::from_example helper, as that's what we'll want to do 99% of the time

Good idea, I've added this

I'd provide a macro to check the actual error message in case of error, so the test doesn't pass because some other error has occurred. I know there are a lot of places where we don't check the error message, but I think this is pretty fragile. More recently I've been trying to use the assert_error! macro in fixture.rs for tests. Example: we rename a commodity in the simple example, now a test which checks for errors related to commodities continues to pass, but because the commodity doesn't exist, not because the region was wrong (or something). We then break the functionality for checking regions are valid for commodities, but the test continues to pass, because all we're checking is whether there is an error, not which error occurred.

Could have macros assert_validation_succeeds! and assert_validation_fails_with! macros, or something.

Yeah could be useful. I figured best for now just to return the error and up to the caller whether it checks the specific error message. But if we find we're doing this a lot then yes a dedicated macro would be helpful.

I've also found with this kind of thing that the specific error message isn't always predictable or what you might expect. For example, deleting a commodity from the simple example you get

Can only provide demand data for SVD commodities. Found entry for 'RSHEAT'

Not necessarily a bad error message, but also if you're writing a test for this you wouldn't necessarily predict that that's the error message you'd get. I could also imagine fairly innocent changes to the code such as changing the order that files are read could lead to a completely different error message.

That said, maybe takeaway here is that we need to be a bit more intentional with error messages. In this case, I think something more direct like Unrecognised commodity: 'RSHEAT' would be a bit clearer.

Up to you whether you fancy doing this now or opening an issue. I'm sure we'll want to tweak the API as we go along anyway.

Now I think we need to use this in anger! A lot of the code doesn't have tests (or doesn't test all possible cases). We might not have time to retrofit tests for old code, but it would be good to add new tests along with new code as much as possible. Now it should be a lot easier 😄

👍

tsmbland added 24 commits December 9, 2025 13:34

Rough approach to patch models with diff files

395d465

Add tests

b9764c2

More robustness and error handling

1504078

Use indexset to prevent duplicate rows, fix issues with newline chara…

e87fd9d

…cters

Robustness to white space, correct reading of headers

9f8fe17

Simplify the code - don't need to preserve whitespace

a6109ad

Store base_filename within Patch

951b6b1

Builder method for making patches in tests

3cffe4b

Make the header optional

859c288

Store fields rather than comma-separated strings

2cb08e0

Allow patching to permanent paths

7427b56

Introduce ModelPatch

28b2613

Path handling fixes

bae9f85

Move patch code to standalone module

670c610

Add toml tests and integration tests

23b4cd5

Pass toml patch as a string

5117c45

Add missing_commodity_patch example

15ed820

Fix error in missing_commodity_patch

13374a0

Remove diff files interface

47d4496

Fix test_toml_patch_and_validate

bdc34a6

Simplify error handling a bit

4be1f4f

Fix tests

d3165a1

Small fixes

84ccf2e

Remove checks we don't need

f04599f

alexdewar reviewed Dec 17, 2025

View reviewed changes

src/patch.rs Outdated Show resolved Hide resolved

alexdewar reviewed Dec 17, 2025

View reviewed changes

src/patch.rs Outdated Show resolved Hide resolved

alexdewar reviewed Dec 17, 2025

View reviewed changes

tsmbland added 8 commits December 18, 2025 15:38

Fix issues with tests

a9b88bb

Method renames

687617e

Add macros

5d3f7c9

Merge branch 'main' into patch_model_v2

93fbad5

Small fixes from self-review

2b445b0

build_patched_simple_tempdir should panic upon failure

6dc1e29

No longer set broken model options in build_patched_simple_tempdir

d6bbf9d

Remove import

a87a26e

tsmbland marked this pull request as ready for review December 18, 2025 17:34

tsmbland requested review from Aurashk, alexdewar and dalonsoa December 18, 2025 17:34

alexdewar approved these changes Dec 19, 2025

View reviewed changes

Add from_example method

589fbcd

tsmbland merged commit 78916ad into main Dec 19, 2025
8 checks passed

tsmbland deleted the patch_model_v2 branch December 19, 2025 12:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Functionality to patch models with changes#1031

Functionality to patch models with changes#1031
tsmbland merged 33 commits intomainfrom
patch_model_v2

tsmbland commented Dec 16, 2025 •

edited

Loading

Uh oh!

tsmbland commented Dec 16, 2025

Uh oh!

Uh oh!

Uh oh!

alexdewar Dec 17, 2025

Uh oh!

tsmbland Dec 18, 2025

Uh oh!

alexdewar left a comment

Uh oh!

tsmbland commented Dec 17, 2025

Uh oh!

alexdewar commented Dec 18, 2025 •

edited

Loading

Uh oh!

codecov bot commented Dec 18, 2025 •

edited

Loading

Uh oh!

tsmbland commented Dec 18, 2025

Uh oh!

alexdewar left a comment

Uh oh!

tsmbland commented Dec 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tsmbland commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Key checklist

Further checks

Uh oh!

tsmbland commented Dec 16, 2025

Uh oh!

Uh oh!

Uh oh!

alexdewar Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

tsmbland Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

alexdewar left a comment

Choose a reason for hiding this comment

Uh oh!

tsmbland commented Dec 17, 2025

Uh oh!

alexdewar commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

tsmbland commented Dec 18, 2025

Uh oh!

alexdewar left a comment

Choose a reason for hiding this comment

Uh oh!

tsmbland commented Dec 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tsmbland commented Dec 16, 2025 •

edited

Loading

alexdewar commented Dec 18, 2025 •

edited

Loading

codecov bot commented Dec 18, 2025 •

edited

Loading