Improve worst-case bounds by treeowl · Pull Request #26 · lspitzner/pqueue

treeowl · 2020-11-16T23:50:51Z

Previously, minView was amortized O(log n) but worst-case O(n).
Improve that to amortized and worst-case O(log n) (ignoring the
impact of repeated applications of mapMonotonic). In informal
testing, these changes lead to large performance improvements.

Previously, lots of things were suspended that didn't need to be.
Document the actual laziness requirements with a debit invariant
and be more eager where allowed.
Rework extractBin to calculate the minimum on the way down instead
of on the way up. This avoids building a chain of thunks that
(if forced) actually rebuilds the queue.

I chose to make the internal nodes of the binomial tree quite strict.
For most purposes, this is good. The only downside is that
mapMonotonic is now slower, since it cannot be "operationally fused"
with surrounding operations. This doesn't seem like a huge deal,
since I don't imagine mapping over priority queues is something
that happens all that much.

Closes #24

lspitzner

Looks good, all in all. Just some minor nitpicks and questions, in general this looks good to go.

But disclaimer: I did not benchmark this to any noteworthy degree, and I don't feel like I can make an educated comment on the "Debit invariants". So I am a bit lost on the exact implications of the strictness changes, sorry.

If you have any expressions to test the performance with / highlight the changes I can re-test them and play around a bit. But I would be fine with merging it; the optimisations make sense and otherwise I trust your judgement.

lspitzner · 2020-11-26T23:46:10Z

+-- The BinomTree and Succ constructors are entirely strict, primarily because
+-- that makes it easier to make sure everything is as strict as it should
+-- be. The downside is that this slows down `mapMonotonic`. If that's important,
+-- we can do all the forcing manually; it will be a pain.


I think this is fine. I tried looking for uses of mapMonotonic in the wild but could not find any (at least not in the pqueue revdeps on hackage or on github, although the github search may not be accurate). Anyway I don't think anyone will use mapMonotonic in some (inner) loop.

For key-value queues, this also affects fmap, but again, that's kind of a niche operation for a priority queue.

treeowl · 2020-11-28T00:28:38Z

Looks good, all in all. Just some minor nitpicks and questions, in general this looks good to go.

But disclaimer: I did not benchmark this to any noteworthy degree, and I don't feel like I can make an educated comment on the "Debit invariants". So I am a bit lost on the exact implications of the strictness changes, sorry.

If you have any expressions to test the performance with / highlight the changes I can re-test them and play around a bit. But I would be fine with merging it; the optimisations make sense and otherwise I trust your judgement.

For the new approach to forcing on insert, look at the Hinze-Paterson finger tree paper. If you just build a new thunk around the old child thunk, you get thunk chains with O(n) worst-case resolution. An easy test is to build a big heap (some millions or tens of millions of elements) with insert, evaluate it, print a message, then print take 5 . toAscList of the thing. Previously, there'd be a significant lag (a few seconds) between the two outputs. With these changes, there is no lag.

treeowl · 2020-11-28T00:43:17Z

Could you explain what I was unclear about regarding the debit invariant? That's kind of important to understanding what's going on!

treeowl · 2021-08-24T20:27:39Z

I think that should address all your concerns.

lspitzner · 2021-12-06T09:14:21Z

I have pushed a rebased version of this branch to this repository; there was only a very simple conflict. I can force-push to this branch as well, I just don't like to do that out of the blue.

lspitzner

LGTM

konsumlamm · 2021-12-06T13:58:52Z

-    | minKey `lt` x -> incrExtract' le t ex
-  _                 -> Extract x ts (Skip f)
-  where a `lt` b = not (b `le` a)
+extractBin le0 = start le0


Is it more efficient to pass around le0 instead of using it directly in the helper functions? At least the latter would be easier to read imo.

I have no idea why we don't just use Ord constraints to get specialization. Let's deal with that in a different PR.

Previously, `minView` was amortized `O(log n)` but worst-case `O(n)`. Improve that to amortized *and* worst-case `O(log n)` (ignoring the impact of repeated applications of `mapMonotonic`). In informal testing, these changes lead to large performance improvements. * Previously, lots of things were suspended that didn't need to be. Document the actual laziness requirements with a debit invariant and be more eager where allowed. * Rework `extractBin` to calculate the minimum on the way down instead of on the way up. This avoids building a chain of thunks that (if forced) actually rebuilds the queue. I chose to make the internal nodes of the binomial tree quite strict. For most purposes, this is good. The only downside is that `mapMonotonic` is now slower, since it cannot be "operationally fused" with surrounding operations. This doesn't seem like a huge deal, since I don't imagine mapping over priority queues is something that happens all that much. * Force on cascade in `insertMin`. * Expand the strictification to key-value queues. This should be good for performance of everything except `mapKeysMonotonic`, `mapWithKey`, and `fmap`. Closes lspitzner#24 Improve list conversion * Implement a strictly accumulating `fromAscList`. * Use one less comparison per element in `fromList`. Make min-replacement faster Use `insertMin`/`incrMin` to avoid further comparisons when the new key replaces the minimum.

treeowl force-pushed the stricter branch 5 times, most recently from 94d6426 to b6c2b22 Compare November 21, 2020 20:49

treeowl mentioned this pull request Nov 22, 2020

Speed up fromList #31

Open

lspitzner reviewed Nov 27, 2020

View reviewed changes

konsumlamm mentioned this pull request Jun 27, 2021

Add real-time versions? #38

Open

konsumlamm mentioned this pull request Aug 14, 2021

Active maintainer #41

Closed

lspitzner approved these changes Dec 6, 2021

View reviewed changes

konsumlamm reviewed Dec 6, 2021

View reviewed changes

Use a strict fold for fromList

2cc2d27

treeowl force-pushed the stricter branch 2 times, most recently from 53af002 to 6d6c414 Compare December 7, 2021 00:32

treeowl force-pushed the stricter branch from 6d6c414 to dcac791 Compare December 7, 2021 00:57

treeowl merged commit 69bcb19 into lspitzner:master Dec 7, 2021

treeowl mentioned this pull request Dec 7, 2021

Speed up fromAscList #32

Closed

konsumlamm mentioned this pull request Dec 7, 2021

Foldable traversable #46

Merged

treeowl mentioned this pull request Dec 8, 2021

Remove all orphan instances #53

Merged

This was referenced Dec 16, 2021

Add k-way merge and heapsort benchmarks #65

Merged

Offer untopped queues #33

Open

Conversation

treeowl commented Nov 16, 2020

Uh oh!

lspitzner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lspitzner Nov 26, 2020

Choose a reason for hiding this comment

Uh oh!

treeowl Nov 28, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

treeowl commented Nov 28, 2020

Uh oh!

treeowl commented Nov 28, 2020

Uh oh!

treeowl commented Aug 24, 2021

Uh oh!

lspitzner commented Dec 6, 2021

Uh oh!

lspitzner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

konsumlamm Dec 6, 2021

Choose a reason for hiding this comment

Uh oh!

treeowl Dec 7, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants