Split leaf and root dynamics optimization notebooks by SarahAlidoost · Pull Request #69 · WUR-AI/diffWOFOST

SarahAlidoost · 2025-12-17T15:00:05Z

closes #66
closes #68

🔴 After merging #61, I will update the docs.

SarahAlidoost · 2025-12-17T15:30:30Z

@michielkallenberg @ronvree @SCiarella @fnattino in this pull request, STE method is used for the sigmoid approximation for the parameter SPAN in leaf_dynamics, see the issue #68. Please let me know what you think about this approach, or any other ideas/suggestions. Thanks!

SCiarella

Thanks @SarahAlidoost, I really prefer this new approach to the sigmoid because the gradient now is much more controlled.

Later, we could even allow the user to set a lower sharpness to have a better gradient for parameter optimization, but right now the value of 1000 that you have chosen looks ok.

I approve this PR for merge 👍

ronvree · 2025-12-18T13:24:53Z

I also think this is a really good idea! During the development it's much better to remove the ambiguity of whether some sigmoid approximates the original model well enough. Great suggestion!

Although the soft threshold might in some case have a more realistic biophysical interpretation (as also mentioned by Francesco in issue #68), I feel these discussions should be separated from the initial model development.

If I understand correctly the sharpness should ideally still be set based on the magnitude of the input if we want to guarantee there are no issues during optimization?

Would it make sense to implement the thresholds as nn.Module instances so it's easier to inspect their behavior and see the impact of any potential adjustments?

Thanks @SarahAlidoost! I also approve the PR for merge

SarahAlidoost · 2026-01-05T11:15:18Z

I also think this is a really good idea! During the development it's much better to remove the ambiguity of whether some sigmoid approximates the original model well enough. Great suggestion!

Although the soft threshold might in some case have a more realistic biophysical interpretation (as also mentioned by Francesco in issue #68), I feel these discussions should be separated from the initial model development.

If I understand correctly the sharpness should ideally still be set based on the magnitude of the input if we want to guarantee there are no issues during optimization?

yes, smaller sharpness means larger gradients (DALV is more sensitive to changes in SPAN), and vice versa. But there is a practical challenge. if training is unstable, or gradient vanish, the sharpness should be adjusted accordingly.

Would it make sense to implement the thresholds as nn.Module instances so it's easier to inspect their behavior and see the impact of any potential adjustments?

If I understood your comment correctly, you mean to wrap the threshold in a nn.Module. But this is just a wrapper, still we need a quantization operation; sigmoid or something else. A good way to handle this, I think, would be to expose sharpness as a learnable parameter in leaf_dynamic model. Please see my reply here.

Thanks @SarahAlidoost! I also approve the PR for merge

Thanks for reviewing it 👍

fnattino

Thanks @SarahAlidoost! I have commented to this as part of #68. Looks good to me, my main concern was the large value of the sharpness parameter, but as also mentioned in the issue, we can probably get back to what is a reasonable default value for it later on!

SarahAlidoost · 2026-01-08T15:26:42Z

@michielkallenberg when you have time, can you please have a look at this PR and discussion in #68? Thanks!

michielkallenberg · 2026-01-08T19:30:40Z

Thanks @SarahAlidoost. I did not know this STE trick. Nice.
Just a double check: the zero-gradient that was found here does not result from the external-state-fixing? (An issue we identified before)
In any case I figure STE looks like a good approach to me, also after having read the discussion.

SarahAlidoost · 2026-01-09T09:14:14Z

Thanks @SarahAlidoost. I did not know this STE trick. Nice. Just a double check: the zero-gradient that was found here does not result from the external-state-fixing? (An issue we identified before)

No, that was not the issue here. LAI should have gradient wrt SPAN. The reason was a sharp sigmoid that acts like a step function in the previous changes. This is now fixed with STE method.

In any case I figure STE looks like a good approach to me, also after having read the discussion.

Thanks for reviewing.

sonarqubecloud · 2026-01-09T12:21:41Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

ronvree · 2026-01-12T14:06:56Z

If I understood your comment correctly, you mean to wrap the threshold in a nn.Module. But this is just a wrapper, still we need a quantization operation; sigmoid or something else. A good way to handle this, I think, would be to expose sharpness as a learnable parameter in leaf_dynamic model. Please see my reply here.

Thanks for your response! What I meant was that this soft threshold pattern will reoccur a lot in diffWofost and understanding its behavior will matter a lot for understanding the model. I agree a nn.Module is in a way just a wrapper and doesn't provide any functional behavior but it makes things more modular and makes refactoring easier. For me it would make sense to implement it that way but of course it's just a suggestion

SarahAlidoost · 2026-01-13T15:58:58Z

If I understood your comment correctly, you mean to wrap the threshold in a nn.Module. But this is just a wrapper, still we need a quantization operation; sigmoid or something else. A good way to handle this, I think, would be to expose sharpness as a learnable parameter in leaf_dynamic model. Please see my reply here.

Thanks for your response! What I meant was that this soft threshold pattern will reoccur a lot in diffWofost and understanding its behavior will matter a lot for understanding the model. I agree a nn.Module is in a way just a wrapper and doesn't provide any functional behavior but it makes things more modular and makes refactoring easier. For me it would make sense to implement it that way but of course it's just a suggestion

thanks for the clarification. I submitted issue #72 for this.

SarahAlidoost added 10 commits December 15, 2025 09:38

update nb

0b6a937

fix docstring in leaf_dynamics

103377e

fix a test in test_leaf_dynamics

bd9ffde

add STE method

9ea633b

split optimization nb to root and leaf nbs

68f1244

Merge branch 'main' into split_nbs

7dbd88e

rerun the notebooks

a249712

add STE method to sigmoid in leaf_dynamics

d4365df

fix test in leaf_dynamics

3fbf6d7

fix linter errors

284c639

SarahAlidoost marked this pull request as ready for review December 17, 2025 15:26

SarahAlidoost changed the title ~~Split leaf and root dynamics~~ Split leaf and root dynamics optimization notebooks Dec 17, 2025

SarahAlidoost requested review from SCiarella, fnattino, michielkallenberg and ronvree December 17, 2025 15:27

SCiarella approved these changes Dec 18, 2025

View reviewed changes

fnattino mentioned this pull request Dec 18, 2025

sigmoid approximation and Straight-Through Estimator: SPAN in leaf_dynamics #68

Closed

ronvree approved these changes Dec 18, 2025

View reviewed changes

fnattino approved these changes Jan 5, 2026

View reviewed changes

fnattino mentioned this pull request Jan 7, 2026

Implement access to device and dtype #65

Merged

michielkallenberg approved these changes Jan 8, 2026

View reviewed changes

SarahAlidoost added 2 commits January 9, 2026 12:57

Merge branch 'main' into split_nbs

61ae94e

rerun nbs

a5201cf

SarahAlidoost merged commit 4a25b90 into main Jan 9, 2026
11 checks passed

SarahAlidoost deleted the split_nbs branch January 9, 2026 12:38

SarahAlidoost mentioned this pull request Jan 13, 2026

Expose sharpness in leaf_dynamics as a learnable parameter #72

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split leaf and root dynamics optimization notebooks#69

Split leaf and root dynamics optimization notebooks#69
SarahAlidoost merged 12 commits into
mainfrom
split_nbs

SarahAlidoost commented Dec 17, 2025

Uh oh!

SarahAlidoost commented Dec 17, 2025

Uh oh!

SCiarella left a comment

Uh oh!

ronvree commented Dec 18, 2025

Uh oh!

SarahAlidoost commented Jan 5, 2026

Uh oh!

fnattino left a comment

Uh oh!

SarahAlidoost commented Jan 8, 2026

Uh oh!

michielkallenberg commented Jan 8, 2026

Uh oh!

SarahAlidoost commented Jan 9, 2026

Uh oh!

sonarqubecloud Bot commented Jan 9, 2026

Uh oh!

Uh oh!

ronvree commented Jan 12, 2026

Uh oh!

SarahAlidoost commented Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

SarahAlidoost commented Dec 17, 2025

Uh oh!

SarahAlidoost commented Dec 17, 2025

Uh oh!

SCiarella left a comment

Choose a reason for hiding this comment

Uh oh!

ronvree commented Dec 18, 2025

Uh oh!

SarahAlidoost commented Jan 5, 2026

Uh oh!

fnattino left a comment

Choose a reason for hiding this comment

Uh oh!

SarahAlidoost commented Jan 8, 2026

Uh oh!

michielkallenberg commented Jan 8, 2026

Uh oh!

SarahAlidoost commented Jan 9, 2026

Uh oh!

sonarqubecloud Bot commented Jan 9, 2026

Quality Gate passed

Uh oh!

Uh oh!

ronvree commented Jan 12, 2026

Uh oh!

SarahAlidoost commented Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants