fix: Parallel Leiden bugs by adsharma · Pull Request #19 · Ladybug-Memory/icebug

adsharma · 2026-02-20T01:34:34Z

Previous versions had problems:

memory usage kept increasing
high lock contention and lock memory allocation
algorithm diverged from the original in that it didn't converge

…ntention Replace on-demand caching with single global mutex (29% runtime overhead) with parallel pre-computation at construction time. Uses dense vector instead of unordered_map for O(1) lock-free neighbor lookups during algorithm execution.

- Return node move count from parallelMove() for monitoring progress - Add INFO logs showing inner iteration number, nodes moved, and community count - Add max inner iterations safety limit (100) with log message when reached - Remove broken early termination checks that were breaking algorithm correctness

Pre-computing neighbor lists for millions of supernodes was consuming too much memory. Switch to on-demand computation without caching. Tradeoff: - Memory: Much lower (no pre-computed cache) - Performance: ~2-3x slower (recompute neighbors on each access) This allows the algorithm to run on large graphs without OOM.

The coarsenedGraphs vector was accumulating all intermediate graph views, each holding 4GB+ of memory (nodeMapping + supernodeToOriginal for 115M nodes). Fix: Only keep current coarsened view, not historical ones. The 'mappings' vector already stores everything needed for flattenPartition().

Instead of storing all intermediate mappings (each ~920MB for 115M nodes), compose mappings incrementally keeping only one mapping at a time. Memory usage reduced but algorithm still OOMs due to: - parallelRefine allocates 13M mutexes per iteration (~832MB) - parallelMove allocates 115M atomics for inQueue (~115MB) - CoarsenedGraphView uses ~4GB per level Further optimization needed for these components.

After each coarsening step, compact the result partition to remap community IDs to a contiguous range. This reduces the upper bound from ~115M (number of original nodes) to ~13M (number of communities), allowing smaller vectors for communityVolumes and per-thread cutWeights. Also recalculate volumes after compacting to ensure correctness. This reduces peak memory from >64GB (OOM) to ~40GB.

When countSelfLoopsTwice=true, self-loops should be counted twice in the weighted degree (matching Graph::weightedDegree behavior). This was causing incorrect volume calculations in the coarsened view.

Also cap max inner iterations to 20.

adsharma added 22 commits February 19, 2026 17:19

Fix weightedDegree to count self-loops twice for undirected graphs

6bcf910

When countSelfLoopsTwice=true, self-loops should be counted twice in the weighted degree (matching Graph::weightedDegree behavior). This was causing incorrect volume calculations in the coarsened view.

Fix ParallelLeidenView coarsening semantics and mapping

da20bc5

Add regression test for CoarsenedGraphView weight parity

8da6d9e

Stop ParallelLeidenView inner-loop stagnation and zero-move churn

74acbc4

Stop LeidenView inner loop on community-count plateau

cd19bed

Use 1% community-reduction tolerance to stop LeidenView inner loop

9dc2273

Also cap max inner iterations to 20.

Stabilize connected-cycle Leiden validation for deterministic behavior

6df27b9

Remove heuristic inner-loop early exits from ParallelLeidenView

62ed6d6

Add ParallelLeidenView move-gain diagnostics per inner iteration

34401fa

Filter marginal Leiden moves and log rejected micro-moves

5213ef5

Optimize CoarsenedGraphView neighbor aggregation without hash maps

301d6b1

Increase Leiden marginal-move filter and log rejection rate

4a084e8

Make Leiden move-gain epsilon runtime configurable

603e3fe

Set ParallelLeidenView max inner iterations to 20

0c835a5

Add env knobs for Leiden max-inner and vector oversize

5ceb98a

Add optional min community-reduction stop knob

12261e5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Parallel Leiden bugs#19

fix: Parallel Leiden bugs#19
adsharma wants to merge 22 commits intomainfrom
leiden_memory_convergence

adsharma commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

adsharma commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant