Faster random seq generation using rand_xoshiro #3

imartayan · 2025-03-17T19:15:18Z

The current PRNG is quite slow, this is frustrating when it becomes the slowest part of a benchmark.
Switching to rand_xoshiro (based on Blackman & Vigna's Xoshiro) makes it an order of magnitude faster.

RagnarGrootKoerkamp · 2025-03-17T19:22:47Z

Ah nice!
Just to check: could you do a quick&dirty comparison against the following?

collecting u64 instead of u8 (or even better, fill_bytes)
using the SmallRng: https://docs.rs/rand/latest/rand/rngs/struct.SmallRng.html
or using the fastrand crate: https://docs.rs/fastrand/latest/fastrand/

(It's not like this adds recursive dependencies so it's gonna be fine anyway, but I'm curious and just came across SmallRng earlier this week.)

Edit: Ah I see that this is maintained by the same people as rand itself 👍

RagnarGrootKoerkamp · 2025-03-17T19:29:11Z

Ah: SmallRng is already using xoshiro!

Additional context on Vigna's rng page: https://prng.di.unimi.it/

Should we just use that instead? It feels 'simpler' instead. (But feel free to argue that it's better to pin the algorithm for reproducability, in which case yes, we should just use the crate directly as you did.)

imartayan · 2025-03-17T19:33:43Z

Oh I didn't know that! I agree, using SmallRng seems simpler.

imartayan · 2025-03-17T20:37:01Z

I did a small benchmark here: https://github.com/imartayan/prng-bench
Surprisingly, fastrand seems much faster than all other methods.
Can you reproduce these results?

imartayan · 2025-03-17T21:11:50Z

The difference seems less significant on x86 than ARM, but fastrand is still the fastest.
Given these results, I think we should use fastrand, are you okay with that?

RagnarGrootKoerkamp · 2025-03-17T22:31:17Z

Some more background reading:

Better default RNG in the future? JuliaLang/julia#27614: using xoshiro256 in Julia
https://prng.di.unimi.it/: Sebastiano's notes, inclusing a remark on vectorization
the 64bit state space of fastrand is kinda small for general applications. I guess it would be good enough here, but would anyway prefer to stick with SmallRng.
Auto-vectorization smol-rs/fastrand#101: issue I created on fastrand

Also note: On my hardware, fastrand is vectorized by default, which is as fast as smallrng. When preventing vectorization, it becomes twice as fast. But then we'd have to modify the library.

RagnarGrootKoerkamp · 2025-03-17T22:38:33Z

src/ascii_seq.rs

    fn random(n: usize) -> Self {
-        let mut rng = rand::rng();
+        let mut seq = vec![0; n];
+        rand::rngs::SmallRng::from_os_rng().fill_bytes(&mut seq);


hmm; this generates 4x the randomness we need. We could instead generate 4x less and then only use 2 bits at a time in the loop below. But probably it's fine as it.

If we really wanted to optimize this we might be able to replace the array indexing below with simd shuffles but not feeling like that now.

Actually, what might be faster is to make an 256 long array mapping bytes to [u8; 4], and then appending 4 values at a time. That's still relatively clean and should be fast.

(Merged for now. We can revisit this if becomes a bottleneck later.)

src/ascii.rs

imartayan force-pushed the rand_xoshiro branch from f9b4e90 to 1479de0 Compare March 17, 2025 19:17

Faster random seq generation using rand_xoshiro

a1ad296

imartayan force-pushed the rand_xoshiro branch from 1479de0 to a1ad296 Compare March 17, 2025 19:17

Use smallrng instead of rand_xoshiro

887bc1f

RagnarGrootKoerkamp approved these changes Mar 17, 2025

View reviewed changes

RagnarGrootKoerkamp merged commit cf4a2b7 into rust-seq:master Mar 21, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Faster random seq generation using rand_xoshiro #3

Faster random seq generation using rand_xoshiro #3

Uh oh!

imartayan commented Mar 17, 2025

Uh oh!

RagnarGrootKoerkamp commented Mar 17, 2025 •

edited

Loading

Uh oh!

RagnarGrootKoerkamp commented Mar 17, 2025 •

edited

Loading

Uh oh!

imartayan commented Mar 17, 2025

Uh oh!

imartayan commented Mar 17, 2025

Uh oh!

imartayan commented Mar 17, 2025

Uh oh!

RagnarGrootKoerkamp commented Mar 17, 2025 •

edited

Loading

Uh oh!

RagnarGrootKoerkamp Mar 17, 2025

Uh oh!

RagnarGrootKoerkamp Mar 17, 2025

Uh oh!

RagnarGrootKoerkamp Mar 21, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Faster random seq generation using rand_xoshiro #3

Faster random seq generation using rand_xoshiro #3

Uh oh!

Conversation

imartayan commented Mar 17, 2025

Uh oh!

RagnarGrootKoerkamp commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RagnarGrootKoerkamp commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

imartayan commented Mar 17, 2025

Uh oh!

imartayan commented Mar 17, 2025

Uh oh!

imartayan commented Mar 17, 2025

Uh oh!

RagnarGrootKoerkamp commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RagnarGrootKoerkamp Mar 17, 2025

Choose a reason for hiding this comment

Uh oh!

RagnarGrootKoerkamp Mar 17, 2025

Choose a reason for hiding this comment

Uh oh!

RagnarGrootKoerkamp Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RagnarGrootKoerkamp commented Mar 17, 2025 •

edited

Loading

RagnarGrootKoerkamp commented Mar 17, 2025 •

edited

Loading

RagnarGrootKoerkamp commented Mar 17, 2025 •

edited

Loading