Make Prio queues more compact by treeowl · Pull Request #118 · lspitzner/pqueue

treeowl · 2023-05-04T20:47:09Z

Store the value associated with each key as its rightmost child,
which saves one word per element.

As a result, the binomial trees must become lazy, which should be
good for maps and lazy traversals. The down side is that we will
need tag checks to know that we have realized Succ constructors.

Benchmarks indicate this improves performance.

Closes #115.

treeowl · 2023-05-04T22:09:03Z

@konsumlamm This is obviously not quite ready to actually merge, but before I make it so, I was hoping you'd give it a glance and see if it's something you'd be okay with if indeed it improves performance.

treeowl · 2023-05-04T22:30:46Z

Note: along with reducing the total queue size by $n$ words, this reduces the number of words allocated on insertion by $2$ (amortized) and the number of words allocated on deletion by $O(\log n)$, thanks to the slightly more compact Cons nodes.

treeowl · 2023-05-07T05:12:13Z

@konsumlamm I believe this is ready to go.

treeowl · 2023-05-07T19:14:46Z

I guess there's still the question of whether to let maps be lazy (better for performance in general) or whether to make them stricter (maintaining theoretical worst-case bounds). I'd lean toward the former in this instance, but I hate having to decide.

treeowl · 2023-05-15T20:20:06Z

@konsumlamm, this still awaits your comments.

treeowl · 2023-05-21T17:19:35Z

@konsumlamm It's been two weeks now.

konsumlamm

Sorry for taking so long to review, this is not a simple PR.

I find the Nattish approach a bit confusing, but the numbers speak for themselves. I'm actually surprised that the benchmarks improve so much though, all that really changed is Zero, isn't it? I'd expect data Zero k a = Zero to be a single global value, so why does this make such a big difference?

Did you benchmark the folds? They look like they might get slower with this change.

treeowl · 2023-05-21T20:08:26Z

I don't remember the numbers I got from benchmarks. What did they look like to you? The space savings aren't in the Zero value itself; they're in reclaiming the space we previously used to point to it (all over the place) and using that space to store attached values instead.

konsumlamm · 2023-05-21T20:23:33Z

I don't remember the numbers I got from benchmarks. What did they look like to you?

The benchmarks take 10-20% less time for me.

The space savings aren't in the Zero value itself; they're in reclaiming the space we previously used to point to it (all over the place) and using that space to store attached values instead.

I see, so we save one word per tree. This is $\log n$ in total though, not $n$, as you said, right?

konsumlamm · 2023-05-21T20:39:04Z

Ok, the folds seem to improve as well.

treeowl · 2023-05-21T20:41:53Z

I see, so we save one word per tree. This is log⁡n in total though, not n, as you said, right?

Incorrect. You can count out the Zeros in a tree of a given size to get a better understanding. Or you can just notice that we have enough room to fit all $n$ of the values where we used to have nullary Zeros.

konsumlamm · 2023-05-21T20:49:49Z

I see, so we save one word per tree. This is log⁡n in total though, not n, as you said, right?

Incorrect. You can count out the Zeros in a tree of a given size to get a better understanding. Or you can just notice that we have enough room to fit all n of the values where we used to have nullary Zeros.

Ah right, the Zeros are the leaves, not the roots, that's where my mistake was.

konsumlamm · 2023-05-21T20:55:55Z

I guess there's still the question of whether to let maps be lazy (better for performance in general) or whether to make them stricter (maintaining theoretical worst-case bounds). I'd lean toward the former in this instance, but I hate having to decide.

You mean lazy in the spine? What is the current situation? Does this PR change that?

treeowl · 2023-05-21T20:59:36Z

What were off were my allocation calculations, which I believe I overstated. I'll have to recalculate those if anyone cares.

treeowl · 2023-05-21T21:02:41Z

Lazy all through. If we want to be strict instead, I can do that, but the code will be more complicated and for the most part I expect slower.

konsumlamm · 2023-05-21T21:23:09Z

So this basically reverts #103? Did you change your mind on #100? I'd be fine with lazy maps, if you say that's more efficient.

treeowl · 2023-05-21T21:50:33Z

Well ... for now it keeps the plain (non-Prio) ones the same, as the strict constructors are better there. It more than reverts that for Prio queues, since it extends laziness into the trees (not just the spine). It all falls out of the representation change, which leads to different trade-offs.

konsumlamm · 2023-05-21T23:17:30Z

I don't like having different laziness for Prio and non-Prio queues.

Can you explain why the new representation necessitates fully lazy maps? Why can't we just keep the old implementation?

I'm a bit worried that fully lazy maps might lead to situations where you build up a lot of computations which slow down a subsequent deleteMin, or will that not happen?

treeowl · 2023-05-22T00:32:51Z

It doesn't necessitate fully lazy maps. Keeping them strict in the spine is easy, but not very useful for predictable performance. Keeping them entirely strict in the structure is quite possible too, but it's slightly trickier. That said, I just realized it should be somewhat less tricky with the Nattish maps than the old ones. We just need to stop the strict mapping at Succy Zeroy, and be lazy at Zeroy. It's still more code, but should be pretty readable. I'll give it a whirl and see how that goes.

konsumlamm · 2023-05-22T08:44:22Z

Keeping them strict in the spine is easy, but not very useful for predictable performance.

So why was that useful before, but not anymore?

treeowl · 2023-05-22T12:46:40Z

The old representation had entirely strict trees, which held lazy values, so calculating the spine strictly calculated everything (except the values) strictly. The new representation uses the same fields for values as for lists of trees, so those fields must be lazy. To calculate everything but the values strictly, we must calculate strictly down to Succy Zeroy, then calculate lazily. I suspect this won't be a big deal. With the old auxiliary class approach, it would've required an extra "layer" of computational structure, but I don't think we'll need that now.

Store the value associated with each key as its rightmost child, which saves one word per element. As a result, the binomial trees must become lazy, which should be good for maps and lazy traversals. The down side is that we will need tag checks to know that we have realized `Succ` constructors. Benchmarking suggests this implementation is substantially faster than the previous one.

* Make maps and unordered traversals build the structure eagerly again, and restore the key strictness of `mapKeysMonotonic`. * Fix the documentation of `mapKeysMonotonic`. The given function need only be weakly monotonic; strict monotonicity is not required.

We no longer require the function to be *strictly* monotonic, so we should test with weakly monotonic functions.

treeowl · 2023-05-23T22:13:59Z

@konsumlamm I think maps should be back to the way they were now.

Document `mapU` and strengthen its test.

konsumlamm

Great work!

treeowl · 2023-05-31T19:15:07Z

Sorry, I double confused myself. The allocation reduction is as big as I thought.

treeowl force-pushed the compact-prio branch 5 times, most recently from 5e6790d to 87bdf15 Compare May 4, 2023 22:07

treeowl requested a review from konsumlamm May 4, 2023 22:08

treeowl force-pushed the compact-prio branch 4 times, most recently from c2df822 to 88c8c90 Compare May 4, 2023 22:25

treeowl commented May 4, 2023

View reviewed changes

Comment thread src/Data/PQueue/Min.hs

treeowl force-pushed the compact-prio branch from 88c8c90 to 67ae2d3 Compare May 7, 2023 04:21

treeowl force-pushed the compact-prio branch 3 times, most recently from e77449e to 11e01c4 Compare May 7, 2023 05:30

treeowl force-pushed the compact-prio branch from 11e01c4 to 4f80165 Compare May 7, 2023 19:15

konsumlamm reviewed May 21, 2023

View reviewed changes

Comment thread src/Data/PQueue/Prio/Internals.hs

Comment thread src/Data/PQueue/Prio/Internals.hs

treeowl mentioned this pull request May 21, 2023

New release/hackage revision for GHC 9.6.1? #121

Closed

treeowl force-pushed the compact-prio branch from 4f80165 to b51f1f4 Compare May 23, 2023 01:39

treeowl added 2 commits May 22, 2023 22:22

Remove unused auxiliary classes

3e250cc

treeowl force-pushed the compact-prio branch from b5ddfa1 to 3e250cc Compare May 23, 2023 02:22

treeowl added 2 commits May 22, 2023 22:44

Use coerce for mapWithKey

6f0af6f

Adjust mapMonotonic tests

76bdf5f

We no longer require the function to be *strictly* monotonic, so we should test with weakly monotonic functions.

treeowl force-pushed the compact-prio branch from 091f068 to 76bdf5f Compare May 23, 2023 03:06

treeowl added 3 commits May 23, 2023 18:18

Remove silly comment

73e9f6f

Fewer cases

39ca8b0

Document mapU

4be154d

Document `mapU` and strengthen its test.

konsumlamm approved these changes May 24, 2023

View reviewed changes

Comment thread src/Data/PQueue/Prio/Internals.hs

treeowl merged commit e4fbf94 into lspitzner:master May 31, 2023

Conversation

treeowl commented May 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

treeowl commented May 4, 2023

Uh oh!

treeowl commented May 4, 2023

Uh oh!

Uh oh!

treeowl commented May 7, 2023

Uh oh!

treeowl commented May 7, 2023

Uh oh!

treeowl commented May 15, 2023

Uh oh!

treeowl commented May 21, 2023

Uh oh!

konsumlamm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

treeowl commented May 21, 2023

Uh oh!

konsumlamm commented May 21, 2023

Uh oh!

konsumlamm commented May 21, 2023

Uh oh!

treeowl commented May 21, 2023

Uh oh!

konsumlamm commented May 21, 2023

Uh oh!

konsumlamm commented May 21, 2023

Uh oh!

treeowl commented May 21, 2023

Uh oh!

treeowl commented May 21, 2023

Uh oh!

konsumlamm commented May 21, 2023

Uh oh!

treeowl commented May 21, 2023

Uh oh!

konsumlamm commented May 21, 2023

Uh oh!

treeowl commented May 22, 2023

Uh oh!

konsumlamm commented May 22, 2023

Uh oh!

treeowl commented May 22, 2023

Uh oh!

treeowl commented May 23, 2023

Uh oh!

konsumlamm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

treeowl commented May 31, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

treeowl commented May 4, 2023 •

edited

Loading