Close form optimisation for block linear -> activation -> linear by Edarfix · Pull Request #231 · growingnet/gromo

Edarfix · 2026-03-24T17:02:14Z

Add optimisation.py and its corresponding tests.

This PR introduces closed-form optimization utilities for Linear -> activation -> Linear blocks, along with a dedicated test suite covering the main supported layouts and optimization paths.

It also includes a small typing update in TensorStatistic to suppress a Pylance error by broadening the accepted update_function callable signature.

… into layer-initialization

…ization

codecov · 2026-03-31T13:19:06Z

Codecov Report

❌ Patch coverage is 97.26027% with 8 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/gromo/utils/optimisation.py	97.19%	6 Missing and 2 partials ⚠️

Flag	Coverage Δ
unittests	`94.54% <97.26%> (+0.16%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
src/gromo/utils/tensor_statistic.py	`100.00% <100.00%> (ø)`
src/gromo/utils/optimisation.py	`97.19% <97.19%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull request overview

Adds a new closed-form optimizer for Linear -> activation -> Linear blocks (alternating closed-form updates for the two linear layers), along with a dedicated unit test suite, and updates TensorStatistic typing / naming to remove a Pylance warning.

Changes:

Add src/gromo/utils/optimisation.py implementing the closed-form block optimizer, statistics collection, and helper solvers.
Add tests/test_optimisation.py covering supported block layouts, solver branches, early-stopping paths, and end-to-end teacher/student improvement.
Fix naming/typing around TensorStatisticWithEstimationError and broaden the accepted update function signature.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
tests/test_tensor_statistics.py	Renames the estimation-error statistic class usage to the corrected spelling.
tests/test_optimisation.py	New tests for the closed-form optimizer module and its helper functions.
src/gromo/utils/tensor_statistic.py	Adds type aliases for update functions and renames `TensorStatisticWithEstimationError`.
src/gromo/utils/optimisation.py	New closed-form optimizer implementation for 2-layer linear blocks.
docs/source/whats_new.rst	Adds an entry (currently contains unresolved merge-conflict markers).
docs/source/sphinxext/gh_substitutions.py	Adjusts `gh_role` definition (currently introduces mutable-default risk).
docs/source/conf.py	Minor doc config formatting / noqa cleanup.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-31T14:29:18Z

+        update_function=lambda batch_tensor, batch_weight: (
+            (batch_tensor * batch_weight.unsqueeze(1)).sum(dim=0),
+            float(batch_weight.sum().item()),
+        ),


These weighted-statistic update functions return float(batch_weight.sum().item()) as the sample count/mass. However TensorStatistic/TensorStatisticWithEstimationError currently types and stores samples as an int and expects nb_sample to be an integer count. Either change the statistics helpers to support a float-valued sample mass (and type it accordingly) or return an int count and track weight mass separately; otherwise typing (Pyright/Pylance) and the semantics of samples become inconsistent.

Copilot · 2026-03-31T14:29:19Z

+StatisticUpdateResult: TypeAlias = tuple[torch.Tensor, int]
+StatisticUpdateFunction: TypeAlias = Callable[..., StatisticUpdateResult]


StatisticUpdateResult fixes the returned sample count to int, but the new closed-form optimizer uses TensorStatisticWithEstimationError to accumulate weighted moments where the natural normalization is a float-valued sample mass. To avoid type-checker errors and clarify semantics, consider changing this alias (and TensorStatistic.samples) to accept float (or float | int).

@copilot apply changes based on this feedback

Copilot · 2026-03-31T14:29:19Z

+    def optimize(
+        self,
+        block: nn.Module | TwoLayerLinearBlockView,
+        dataloader: list[Any] | Any,
+    ) -> OptimizationResult:


ClosedFormBlockOptimizer.optimize iterates over dataloader multiple times (once per alternating step, plus a final pass). If callers pass an iterator/generator (single-pass), subsequent iterations will silently see no batches and raise/produce incorrect updates. Consider requiring a re-iterable Iterable/DataLoader (and document it), or materialize/copy batches when a one-shot iterator is provided.

@copilot apply changes based on this feedback

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

sylvchev

An important PR, we could iterate in the next push to update the code if needed, but it seems mature enough. On top of the tests, it could bé interesting to add an example demonstrating how to use the code.

Edarfix and others added 13 commits June 24, 2025 15:51

Initial commit

8731273

Implement the heurisitic initialisation of a new fully connected layer

d1fe169

Improve comment clarity in optimization.py

bbebee5

Add a line on the What's new

566fc5d

Merge branch 'main' into Félix-branch

91c968e

Merge branch 'main' into Félix-branch

4d314bf

Initial commit

69d4f87

Implement the heurisitic initialisation of a new fully connected layer

861a624

Improve comment clarity in optimization.py

1cd06a0

Merge branch 'layer-initialization' of https://github.com/Edarfix/gromo…

a2552ca

… into layer-initialization

Create TypeAlias for Pylance

e72bf80

Add optimisation and corresponding test

4fc7754

Merge branch 'main' of github.com:growingnet/gromo into layer-initial…

3bf83bd

…ization

TheoRudkiewicz assigned Edarfix Mar 27, 2026

Edarfix added 2 commits March 31, 2026 14:59

coorect spelling of "TensorStatisticWithEstimationError"

152d82c

ruff + pydoclint

271af2e

Merge branch 'main' into layer-initialization

6e2370c

Edarfix requested a review from Copilot March 31, 2026 14:22

Copilot started reviewing on behalf of Edarfix March 31, 2026 14:23 View session

Copilot AI reviewed Mar 31, 2026

View reviewed changes

Edarfix and others added 3 commits March 31, 2026 17:07

Update docs/source/whats_new.rst

eb2521b

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update src/gromo/utils/optimisation.py

e8ca1f0

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update docs/source/sphinxext/gh_substitutions.py

d1ccd49

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Edarfix requested review from TheoRudkiewicz and sylvchev April 13, 2026 06:53

sylvchev approved these changes Apr 13, 2026

View reviewed changes

Merge branch 'main' into layer-initialization

f0f3a69

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Close form optimisation for block linear -> activation -> linear#231

Close form optimisation for block linear -> activation -> linear#231
Edarfix wants to merge 20 commits intogrowingnet:mainfrom
Edarfix:layer-initialization

Edarfix commented Mar 24, 2026

Uh oh!

codecov Bot commented Mar 31, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Mar 31, 2026

Uh oh!

Copilot AI Mar 31, 2026

Uh oh!

Edarfix Apr 13, 2026

Uh oh!

Copilot AI Mar 31, 2026

Uh oh!

Edarfix Apr 13, 2026

Uh oh!

sylvchev left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		StatisticUpdateResult: TypeAlias = tuple[torch.Tensor, int]
		StatisticUpdateFunction: TypeAlias = Callable[..., StatisticUpdateResult]

Conversation

Edarfix commented Mar 24, 2026

Uh oh!

codecov Bot commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Edarfix Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Edarfix Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

sylvchev left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov Bot commented Mar 31, 2026 •

edited

Loading