Mooncake testsuite refactor by lkdvos · Pull Request #175 · QuantumKitHub/MatrixAlgebraKit.jl

lkdvos · 2026-02-19T20:56:59Z

This is a somewhat large refactor of the Mooncake test suite that hinges on the idea to wrap the in-place functions in such a way that they become admissible to the internal Mooncake testing framework. (Based on ideas suggested by @Jutho, thanks!)

The basic idea is that for a function f!(A, input, alg) we need to:

scratch the A space to not count finite-differences results
avoid generating random tangents for the input variables

I also somewhat reorganized the tests to get a slightly better overview of the mooncake tests.
Finally, this also allowed me to find some small mistakes in the mutating rules, which I have here fixed.
This is also relevant for #174 and #173.

add QR gauge projection

codecov · 2026-02-19T21:17:14Z

Codecov Report

❌ Patch coverage is 88.88889% with 3 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/pullbacks/qr.jl	81.81%	2 Missing ⚠️
src/pullbacks/lq.jl	90.00%	1 Missing ⚠️

Files with missing lines	Coverage Δ
...gebraKitMooncakeExt/MatrixAlgebraKitMooncakeExt.jl	`62.57% <100.00%> (+1.81%)`	⬆️
src/pullbacks/lq.jl	`96.00% <90.00%> (+0.10%)`	⬆️
src/pullbacks/qr.jl	`94.73% <81.81%> (-1.16%)`	⬇️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

test/testsuite/mooncake/eig.jl

ext/MatrixAlgebraKitMooncakeExt/MatrixAlgebraKitMooncakeExt.jl

src/pullbacks/qr.jl

ext/MatrixAlgebraKitMooncakeExt/MatrixAlgebraKitMooncakeExt.jl

Co-authored-by: Jutho <Jutho@users.noreply.github.com>

test/testsuite/mooncake/mooncake.jl

test/testsuite/mooncake/lq.jl

kshyatt · 2026-02-23T10:22:29Z

I guess I really don't see the point of removing all the ad_*_setup functions given we'll need something similar to add support for Enzyme? Why not just keep them but allow alg to be passed?

test/testsuite/mooncake/eigh.jl

test/testsuite/mooncake/eig.jl

test/testsuite/chainrules.jl

kshyatt · 2026-02-24T17:07:32Z

Once this merges, I'll update the other PR

Jutho · 2026-02-25T21:43:18Z

src/pullbacks/lq.jl

            Q2 = view(Q, (p + 1):size(Q, 1), :)
            ΔQ2 = view(ΔQ, (p + 1):size(Q, 1), :)
            ΔQ2Q1ᴴ = ΔQ2 * Q1'
            check_lq_full_cotangents(Q1, ΔQ2, ΔQ2Q1ᴴ; gauge_atol)


For a rank-deficient matrix p < minmn, check_lq_full_cotangents can also be called by lq_compact (i.e. size(Q,1) == minmn for lq_compact and size(Q,1) == n for lq_full). Since this is not a change from the current PR, I will try to look into when this was introduced, and whether everything is still fully correct.

Does that mean we can go ahead with this one? That seems somewhat unrelated to the testsuite refactor anyways, so might be clearer in a separate PR anyways

Jutho · 2026-02-26T12:17:12Z

test/testsuite/ad_utils.jl

+    remove_eig_gauge_dependence!(ΔV, D, V)
+
+Remove the gauge-dependent part from the cotangent `ΔV` of the eigenvector matrix `V`. The
+eigenvectors are only determined up to complex phase (and unitary mixing for degenerate


In the eig case there can also be non-unitary mixing. I think the only convention respected by LAPACK is that all the eigenvectors have norm 1, but that condition cannot easily be transformed into a restriction onto a restriction on the matrices. In fact, even for non-degenerate eigenvalues we actually require v' * Δv (the diagonal element of V' * ΔV) to be completely zero, not just the imaginary part of it (the imaginary part corresponds to phase rotations, the real part to norm changes).

Jutho · 2026-02-26T12:27:16Z

test/testsuite/ad_utils.jl

+    ΔU[:, (minmn + 1):end] .= 0
+    ΔVᴴ[(minmn + 1):end, :] .= 0


Do we also want to support rank deficient matrices, where we should use some p instead of minmn?

Jutho · 2026-02-26T12:29:04Z

test/testsuite/ad_utils.jl

+    Q₁ = @view Q[:, 1:r]
+    ΔQ₂ = @view ΔQ[:, (r + 1):end]


For consistency, since you do use regular view in the last line:

Suggested change

Q₁ = @view Q[:, 1:r]

ΔQ₂ = @view ΔQ[:, (r + 1):end]

Q₁ = view(Q, :, 1:r)

ΔQ₂ = view(ΔQ, :, (r + 1):end)

Jutho · 2026-02-26T12:31:13Z

test/testsuite/ad_utils.jl

+ambiguity. Additionally, rows of `ΔR` beyond the rank are zeroed out.
+"""
+function remove_qr_gauge_dependence!(ΔQ, ΔR, A, Q, R)
+    r = MatrixAlgebraKit.qr_rank(R)


Do we want to support a rank_atol keyword to be passed along from remove_qr_gauge_dependence! to qr_rank?

Jutho · 2026-02-26T12:31:41Z

test/testsuite/ad_utils.jl

+    Q₁ = @view Q[1:r, :]
+    ΔQ₂ = @view ΔQ[(r + 1):end, :]


Suggested change

Q₁ = @view Q[1:r, :]

ΔQ₂ = @view ΔQ[(r + 1):end, :]

Q₁ = view(Q, 1:r, :)

ΔQ₂ = view(ΔQ, (r + 1):end, :)

Jutho · 2026-02-26T12:32:05Z

test/testsuite/ad_utils.jl

+Additionally, columns of `ΔL` beyond the rank are zeroed out.
+"""
+function remove_lq_gauge_dependence!(ΔL, ΔQ, A, L, Q)
+    r = MatrixAlgebraKit.lq_rank(L)


Same question about rank_atol

Jutho · 2026-02-26T12:33:30Z

test/testsuite/ad_utils.jl

+    Q, _ = qr_compact(A)
+    mul!(ΔN, Q, Q' * ΔN)
+    return ΔN


Any reason to not simply pass this on to remove_qr_null_gauge_dependence in order to avoid code duplication?

Jutho · 2026-02-26T12:33:51Z

test/testsuite/ad_utils.jl

+null space basis is only determined up to a unitary rotation, so `ΔNᴴ` is projected onto the
+row span of the compact LQ factor `Q₁` of `A`.
+"""
+function remove_right_null_gauge_dependence!(ΔNᴴ, A, Nᴴ)


Same question with remove_lq_null_gauge_dependence!?

Jutho · 2026-02-26T12:52:01Z

test/testsuite/ad_utils.jl

-    ΔD2 = Diagonal(randn!(similar(A, complex(T), m)))
-    return DV, (ΔD, ΔV), (ΔD2, ΔV)
+    ΔV = remove_eig_gauge_dependence!(ΔV, D, V)
+    ΔD = Diagonal(randn!(similar(A, complex(T), m)))


Suggested change

ΔD = Diagonal(randn!(similar(A, complex(T), m)))

ΔD = Diagonal(randn!(similar(diagview(D))))

Jutho · 2026-02-26T12:53:23Z

test/testsuite/ad_utils.jl

    ΔV = randn!(similar(A.diag, T, m, m))
-    ΔV = remove_eiggauge_dependence!(ΔV, D, V)
+    ΔV = remove_eig_gauge_dependence!(ΔV, D, V)
    ΔD = Diagonal(randn!(similar(A.diag, T, m)))


Suggested change

ΔD = Diagonal(randn!(similar(A.diag, T, m)))

ΔD = Diagonal(randn!(similar(D.diag)))

or also using diagview(D) if we don't want to access .diag.

Jutho · 2026-02-26T12:54:11Z

test/testsuite/ad_utils.jl

-    ΔD2 = Diagonal(randn!(similar(A, real(T), m)))
-    return DV, (ΔD, ΔV), (ΔD2, ΔV)
+    ΔV = remove_eigh_gauge_dependence!(ΔV, D, V)
+    ΔD = Diagonal(randn!(similar(A, real(T), m)))


Suggested change

ΔD = Diagonal(randn!(similar(A, real(T), m)))

ΔD = Diagonal(randn!(similar(diagview(D))))

Jutho · 2026-02-26T12:54:54Z

test/testsuite/ad_utils.jl

    m, n = size(A)
    T = complex(eltype(A))
    D = eig_vals(A)
    ΔD = randn!(similar(A, complex(T), m))


Suggested change

ΔD = randn!(similar(D))

Jutho · 2026-02-26T12:55:21Z

test/testsuite/ad_utils.jl

    m, n = size(A)
    T = complex(eltype(A))
    D = eig_vals(A)
    ΔD = randn!(similar(A.diag, T, m))


Suggested change

ΔD = randn!(similar(D))

Jutho · 2026-02-26T12:56:57Z

test/testsuite/ad_utils.jl

    m, n = size(A)
    T = eltype(A)
    D = eigh_vals(A)
    ΔD = randn!(similar(A, real(T), m))


Suggested change

ΔD = randn!(similar(D))

Jutho · 2026-02-26T13:01:07Z

test/testsuite/ad_utils.jl

 function ad_svd_compact_setup(A)
    m, n = size(A)
    T = eltype(A)
    minmn = min(m, n)
    ΔU = randn!(similar(A, T, m, minmn))
-    ΔS = randn!(similar(A, real(T), minmn, minmn))
-    ΔS2 = Diagonal(randn!(similar(A, real(T), minmn)))
+    ΔS = Diagonal(randn!(similar(A, real(T), minmn)))
    ΔVᴴ = randn!(similar(A, T, minmn, n))
    U, S, Vᴴ = svd_compact(A)
-    ΔU, ΔVᴴ = remove_svdgauge_dependence!(ΔU, ΔVᴴ, U, S, Vᴴ)
-    return (U, S, Vᴴ), (ΔU, ΔS, ΔVᴴ), (ΔU, ΔS2, ΔVᴴ)
+    ΔU, ΔVᴴ = remove_svd_gauge_dependence!(ΔU, ΔVᴴ, U, S, Vᴴ)
+    return (U, S, Vᴴ), (ΔU, ΔS, ΔVᴴ)
 end


Not sure why I cannot make a suggestion here, but anyway

function ad_svd_compact_setup(A) U, S, Vᴴ = svd_compact(A) ΔU = randn!(similar(U)) ΔVᴴ = randn!(similar(Vᴴ)) ΔS = Diagonal(randn!(similar(diagview(S)))) ΔU, ΔVᴴ = remove_svd_gauge_dependence!(ΔU, ΔVᴴ, U, S, Vᴴ) return (U, S, Vᴴ), (ΔU, ΔS, ΔVᴴ) end

That would probably even remove the need to special case A::Diagonal below.

Jutho · 2026-02-26T13:06:50Z

test/testsuite/ad_utils.jl

@@ -324,7 +440,7 @@
    U, S, Vᴴ = svd_full(A)


It seems a bit wasteful to do svd_compact in ad_svd_compact_setup, and the again svd_full here.

Jutho · 2026-02-26T13:20:18Z

test/testsuite/ad_utils.jl

-    diagview(ΔSfull)[1:minmn] .= diagview(ΔS2)
+    diagview(ΔSfull)[1:minmn] .= diagview(ΔS)
    return (U, S, Vᴴ), (ΔUfull, ΔSfull, ΔVᴴfull)
 end


function ad_svd_full_setup(A) U, S, Vᴴ = svd_full(A) ΔU = randn!(similar(U)) ΔVᴴ = randn!(similar(Vᴴ)) ΔS = zero(S) rand!(diagview(ΔS)) # although I think nonzero random contributions in all of ΔS would also just work fine ΔU, ΔVᴴ = remove_svd_gauge_dependence!(ΔU, ΔVᴴ, U, S, Vᴴ) return (U, S, Vᴴ), (ΔU, ΔS, ΔVᴴ) end

Jutho · 2026-02-26T13:21:38Z

test/testsuite/ad_utils.jl

    m, n = size(A)
    T = eltype(A)
    WP = left_polar(A)
    ΔWP = (randn!(similar(A, T, m, n)), randn!(similar(A, T, n, n)))


Suggested change

ΔWP = randn!.(similar.(WP))

Since this is not about this PR, I should probably make some changes to ad_utils.jl in a separate PR.

Jutho

I left some suggestions, but I already approve!

kshyatt · 2026-02-26T13:24:59Z

Yay congrats!

Jutho · 2026-02-26T13:26:22Z

I didn't realize automerge was in effect.

Jutho · 2026-02-26T13:35:22Z

I will try to make a PR with the ad_utils suggestions right away.

lkdvos added 13 commits February 19, 2026 09:12

small refactor QR pullback

c69b57b

add QR gauge projection

testsuite reorganisation

734f390

add QR mooncake tests

023cfed

Genius suggestion by @Jutho fixes everything

159da4d

Refactor Mooncake LQ tests

deda605

Refactor Mooncake Eig tests

98c9fc0

fix pullback implementations!

64a4246

Refactor Mooncake SVD tests

8b209c2

Refactor Mooncake Polar tests

f8d7cc6

make testsets verbose

552559f

Refactor Mooncake OrthNull tests

9161bde

clean up

6550d8b

rename call_and_zero!

b3329de

kshyatt reviewed Feb 20, 2026

View reviewed changes

test/testsuite/mooncake/eig.jl Outdated Show resolved Hide resolved

lkdvos added 2 commits February 20, 2026 10:09

move gauge dependence removal to ad_utils again

61e8061

separate out mooncake tests

0de37f0

lkdvos mentioned this pull request Feb 22, 2026

AD rules for (anti-) hermitian projection #174

Merged

Jutho reviewed Feb 22, 2026

View reviewed changes

ext/MatrixAlgebraKitMooncakeExt/MatrixAlgebraKitMooncakeExt.jl Show resolved Hide resolved

Jutho reviewed Feb 22, 2026

View reviewed changes

src/pullbacks/qr.jl Outdated Show resolved Hide resolved

Jutho reviewed Feb 22, 2026

View reviewed changes

ext/MatrixAlgebraKitMooncakeExt/MatrixAlgebraKitMooncakeExt.jl Show resolved Hide resolved

Update src/pullbacks/qr.jl

2238bf8

Co-authored-by: Jutho <Jutho@users.noreply.github.com>