Add eigsolve-style rrule for CTMRG fixed-point gradient by leburgel · Pull Request #126 · QuantumKitHub/PEPSKit.jl

leburgel · 2025-02-03T22:34:19Z

Solving the Sylvester problem as an eigenvalue equation, applied to the CTMRG fixed-point gradient linear problem.

Idea after:
https://github.com/Jutho/KrylovKit.jl/blob/fb56bbc7ee952bc8a6af6d278c42e78e553aa62e/ext/KrylovKitChainRulesCoreExt/eigsolve.jl#L167-L290

Actual implementation after:
https://github.com/tangwei94/VUMPSAutoDiff.jl/blob/57afc1cbcb2af9c596197c7d07adfb26ebabcd7a/src/vumps.jl#L161-L203

codecov · 2025-02-03T22:41:12Z

Codecov Report

Attention: Patch coverage is 86.66667% with 2 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
...rithms/optimization/fixed_point_differentiation.jl	86.66%	2 Missing ⚠️

Files with missing lines	Coverage Δ
src/PEPSKit.jl	`87.50% <ø> (ø)`
...rithms/optimization/fixed_point_differentiation.jl	`94.73% <86.66%> (-1.52%)`	⬇️

... and 1 file with indirect coverage changes

leburgel · 2025-02-05T10:36:34Z

I had a go at adding this after a suggestion from @Jutho, for now mainly just so I didn't forget about it again. From what I can tell solving the fixed-point gradient linear problem like this is actually very fast and stable for all the examples. It's also the method of choice for the VUMPS pullback in VUMPSAutoDiff.jl.

However, there seems to be a problem with the gradient tests for iterscheme==:diffgauge. One possibility is that there is actually some residual gauge freedom left which makes it such that the modified eigenvalue problem doesn't work because there's no unique eigenvector with eigenvalue 1. I certainly don't really understand what is going on so I don't know an easy fix, but though I should leave this here as a suggestion.

Do you have an idea why diffgauge would give so much trouble here @pbrehmer?

pbrehmer · 2025-02-05T16:27:16Z

This is super interesting, thanks for adding this. I will try to take a closer look the next few days.

Regarding :diffgauge: I am kind of surprised by these issues since I thought that :diffgauge is in principle more stable. In that scheme, we differentiate through the gauge fixing itself so I would expect AD to take care of any residual gauge freedom - but perhaps I'm wrong.

pbrehmer · 2025-02-10T15:00:36Z

I noticed a few things but I can't claim I have really understood what's going on. The problem seems to be that :diffgauge, as you said, doesn't converge properly and outputs an eigenvalue different from 1. When looking at the output, I noticed that the :fixed mode converges after expanding the Krylov subspace to the specified dimension, whereas :diffgauge needs multiple Schur solves before it stops.

So when choosing :diffgauge, it seems that the Krylov dimension really matters. KrylovKit defaults to krylovdim=30 which might be too high in general - reducing the dimension to e.g. krylovdim=8 really speeds up the Arnoldi convergence and it will actually converge to the correct gradient with eigenvalue 1. There seems to be a certain Krylov dimension above which the :diffgauge Arnoldi will break; below that, the Arnoldi convergence is comparable to the :fixed mode.

One can also enable eager=true so that the Krylov subspace is not expanded to the full specified dimension and this also seems to repair the :diffgauge mode and even speeds it up. Perhaps this also speeds up the :fixed mode.

The difference between :fixed and :diffgauge might be a stability thing? After all, with :diffgauge each application of f(X) will differentiate through an eigenvalue problem inside gauge_fix whereas :fixed really just boils down to a fixed SVD.

pbrehmer

Thanks again for the addition, this really seems like an efficient approach to differentiating CTMRG! Perhaps this should be the new default? I will try to benchmark against LinSolver and see what seems best in a different PR.

src/algorithms/peps_opt.jl

test/ctmrg/gradients.jl

Co-authored-by: Paul Brehmer <paul.brehmer@univie.ac.at>

leburgel · 2025-02-11T09:30:06Z

One can also enable eager=true so that the Krylov subspace is not expanded to the full specified dimension and this also seems to repair the :diffgauge mode and even speeds it up. Perhaps this also speeds up the :fixed mode.

Good catch, I had not thought of that! I added a default gradient_eigsolve which sets eager=true, and enabled eager mode in the tests using iterscheme=:fixed as well.

leburgel · 2025-02-11T15:26:45Z

Sorry to dismiss the reviews, but I was maybe too quick in un-drafting this since there were two things that still needed to be addressed before merging:

I expanded the warning given when the norm of the auxiliary component vanishes. This basically means that either the eigsolve is unconverged, or the Jacobian doesn't have a unique leading eigenvalue 1. I was struggling with the tolerance for this check a bit, I picked the easiest thing that seemed sensible but maybe we can do better
I switched to use realeigsolve for the same reason we switched to reallinsolve before.

If this doesn't break anything, should be good to go then

Add eigsolve-style rrule for CTMRG fixed-point gradient

1862402

leburgel marked this pull request as draft February 3, 2025 22:34

Format

0b05c44

pbrehmer requested changes Feb 10, 2025

View reviewed changes

src/algorithms/peps_opt.jl Outdated Show resolved Hide resolved

test/ctmrg/gradients.jl Outdated Show resolved Hide resolved

test/ctmrg/gradients.jl Outdated Show resolved Hide resolved

leburgel and others added 3 commits February 10, 2025 16:52

Apply suggestions from code review

14f4065

Co-authored-by: Paul Brehmer <paul.brehmer@univie.ac.at>

Merge branch 'master' into lb/arnoldi_pullback

74f2fe9

Format

9358a8b

leburgel marked this pull request as ready for review February 11, 2025 09:05

leburgel added 2 commits February 11, 2025 10:19

Merge branch 'master' into lb/arnoldi_pullback

df08888

Add default EigSolver which has eager=true

ee73535

pbrehmer previously approved these changes Feb 11, 2025

View reviewed changes

lkdvos previously approved these changes Feb 11, 2025

View reviewed changes

Update to use realeigsolve, make warning actually make sense

1b438a4

leburgel dismissed stale reviews from lkdvos and pbrehmer via 1b438a4 February 11, 2025 15:22

pbrehmer approved these changes Feb 11, 2025

View reviewed changes

leburgel enabled auto-merge (squash) February 11, 2025 15:57

leburgel merged commit dcf1bb2 into master Feb 11, 2025
27 checks passed

leburgel deleted the lb/arnoldi_pullback branch February 11, 2025 16:02

leburgel mentioned this pull request Mar 12, 2025

Bump KrylovKit compat #156

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add eigsolve-style rrule for CTMRG fixed-point gradient#126

Add eigsolve-style rrule for CTMRG fixed-point gradient#126
leburgel merged 8 commits intomasterfrom
lb/arnoldi_pullback

leburgel commented Feb 3, 2025

Uh oh!

codecov bot commented Feb 3, 2025 •

edited

Loading

Uh oh!

leburgel commented Feb 5, 2025

Uh oh!

pbrehmer commented Feb 5, 2025

Uh oh!

pbrehmer commented Feb 10, 2025

Uh oh!

pbrehmer left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leburgel commented Feb 11, 2025

Uh oh!

leburgel commented Feb 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

leburgel commented Feb 3, 2025

Uh oh!

codecov bot commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

leburgel commented Feb 5, 2025

Uh oh!

pbrehmer commented Feb 5, 2025

Uh oh!

pbrehmer commented Feb 10, 2025

Uh oh!

pbrehmer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leburgel commented Feb 11, 2025

Uh oh!

leburgel commented Feb 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Feb 3, 2025 •

edited

Loading