factor: Faster modular arithmetic with the Montgomery transform by nicoonoclaste · Pull Request #1529 · uutils/coreutils

nicoonoclaste · 2020-05-30T12:21:12Z

This can probably be optimised further, but as of commit 4851619 this is already ~2.43 times faster as previously, taking ~3.55s for all integers from 2 to 10⁶.

Add tests to factor::{factor,miller_rabin}.
Rework the Arithmetic trait, implement the Montgomery transform on 64b integers (requires 128b integers for some intermediate values)
Add consistency checks with debug_assert!

A further optimisation would be to make the Montgomery implementation generic, and add a 32b variant. Moreover, a 32b variant could use a shorter basis in the Miller-Rabin primality test (either 3 witnesses, or a single witness depending on n).

This seems however out-of-scope for this PR. Moreover, such an optimisation would be currently premature, and its impact hard to measure: after this change, factor only spends ~10% of its time in the Miller-Rabin primality test, and another ~10% in Pollard's ρ algorithm, versus ~35% in table::factor and ~25% printing the factorisations (!)

nicoonoclaste · 2020-05-30T12:22:25Z

Marked WIP as 2 of the tests I introduced are now failing, after switching to the Montgomery transform. I suspect there is an overflow-related issue somewhere.

Arcterus · 2020-06-08T21:26:15Z

Let me know when you finish this. It looks good to me other than the test failures.

nicoonoclaste · 2020-06-15T17:08:56Z

@Arcterus I fixed one of the test failures, will take a break before having a go at the next one.

This is a facter way to perform arithmetic mod n, when n is odd and a 64b number.

In debug mode, checks that all arithmetic operations coincide with the plain-u64 versions, as long as the latter does not overflow.

Just call `u64::wrapping_{mul,sub}` instead of (de)constructing Wrapping<u64> values.

nicoonoclaste · 2020-06-15T20:52:18Z

This should now be working, and ~45% faster than master.

This also unlocks a lot of further algorithmic work, as we now have fast, non-overflowing modular arithmetic.

Approx. 25% speedup

nicoonoclaste · 2020-06-15T21:06:31Z

Update: about 59% faster than master.

sylvestre · 2020-06-16T13:09:46Z

We have integrations tests here:
https://github.com/uutils/coreutils/blob/master/tests/by-util/test_factor.rs
Could you make sure that your changes are also covered here? thanks :)

nicoonoclaste · 2020-06-16T13:39:12Z

We have integrations tests here:
https://github.com/uutils/coreutils/blob/master/tests/by-util/test_factor.rs
Could you make sure that your changes are also covered here? thanks :)

Yes, the integration tests do exercise the new code too (essentially, anything that calls into miller_rabin::test or rho::factor does)

nicoonoclaste · 2020-06-16T13:45:56Z

@sylvestre Done :)

Arcterus · 2020-06-16T18:41:51Z

This fails test_random_big for me. It’s apparently overflowing somewhere.

nicoonoclaste · 2020-06-16T18:45:23Z

This fails test_random_big for me. It’s apparently overflowing somewhere.

Odd; can you provide a log output (and preferably the backtrace too) ?

Arcterus · 2020-06-16T18:52:50Z

Here’s the backtrace. There is no output on stdout, and this is all that is on stderr (when run with RUST_BACKTRACE=1. Sorry that the formatting is a little messed up.

thread \'main\' panicked at \'attempt to add with overflow\', src/uu/factor/src/numeric.rs:126:17
stack backtrace:
   $
: backtrace::backtrace::libunwind::trace
             at /cargo/registry/src/github.com-1ecc6299db9ec823/backtrace0.3.46/src/bac$trace/libunwind.rs:86
   1: backtrace::backtrace::trace_unsynchronized
             at /cargo/registry/src/github.com-1ecc6299db$ec823/backtrace-0.3.46/src/backtrace/mod.rs:66
   2: std::sys_common::backtrace::_print_fmt
             at src/libstd/sys_commo$/backtrace.rs:78
   3: <std::sys_common::backtrace::_print::DisplayBacktrace as core::fmt::Display>::fmt
             at src/lib$td/sys_common/backtrace.rs:59
   4: core::fmt::write
             at src/libcore/fmt/mod.rs:1069
   5: std::io::Write::write_fm$
             at src/libstd/io/mod.rs:1504
   6: std::sys_common::backtrace::_print
             at src/libstd/sys_common/backt$ace.rs:62
   7: std::sys_common::backtrace::print
             at src/libstd/sys_common/backtrace.rs:49
   8: std::panicking::d$fault_hook::{{closure}}
             at src/libstd/panicking.rs:198
   9: std::panicking::default_hook
             at src/libs$d/panicking.rs:218
  10: <alloc::boxed::Box<F> as core::ops::function::Fn<A>>::call
             at /rustc/49cae55760da0a43428eb$73abcb659bb70cf2e4/src/liballoc/boxed.rs:1022
  11: uucore::mods::panic::mute_sigpipe_panic::{{closure}}
             at /home/arcterus/.cargo/git/checkouts/uucore-cbba7adad0ea8524/47ad6ab/src/lib/mods/panic.rs:15
  12: std::panicking::rust_panic_with_hook
            at src/libstd/panicking.rs:515
  13: rust_begin_unwind
             at src/libstd/panicking.rs:419
  14: core::panicking::panic_fmt
             at src/libcore/panicking.rs:111
  15: core::panicking::panic
             at src/libcore/panicking.rs:54
  16: <uu_factor::numeric::Montgomery as uu_factor::numeric::Arithmetic>::add
             at src/uu/factor/src/numeric.rs:126          
  17: uu_factor::rho::find_divisor::{{closure}}::{{closure}}
             at src/uu/factor/src/rho.rs:18
  18: uu_factor::rho::find_divisor
             at src/uu/factor/src/rho.rs:26
  19: uu_factor::rho::_factor
             at src/uu/factor/src/rho.rs:70        
  20: uu_factor::rho::factor
             at src/uu/factor/src/rho.rs:77
  21: uu_factor::factor
             at src/uu/factor/src/factor.rs:105
  22: uu_factor::print_factors
             at src/uu/factor/src/factor.rs:112
  23: uu_factor::print_factors_str::{{closure}}
             at src/uu/factor/src/factor.rs:118
  24: core::result::Result<T,E>::and_then
             at /rustc/49cae55760da0a43428eba73abcb659bb70cf2e4/src/libcore/result.rs:729
  25: uu_factor::print_factors_str
             at src
/uu/factor/src/factor.rs:117
  26: uu_factor::uumain
             at src/uu/factor/src/factor.rs:132
  27: coreutils::main
         at src/bin/coreutils.rs:80
  28: std::rt::lang_start::{{closure}}
             at /rustc/49cae55760da0a43428eba73abcb659bb70cf2e4/src/libstd/rt.rs:67
  29: std::rt::lang_start_internal::{{closure}}
             at src/libstd/rt.rs:52
  30: std::pan
icking::try::do_call
             at src/libstd/panicking.rs:331
  31: std::panicking::try
             at src/libstd/panicking.rs:274
  32: std::panic::catch_unwind
             at src/libstd/panic.rs:394
  33: std::rt::lang_start_internal
             at src/libstd/rt.rs:51
  34: std::rt::lang_start
             at /rustc/49cae55760da0a43428eba73abcb659bb70cf2e4/src/libstd/rt.rs:
67           
  35: main
  36: __libc_start_main
  37: _start 
note: Some details are omitted, run with `RUST_BACKTRACE=full` for a verbose
 backtrace.

nicoonoclaste · 2020-06-16T18:54:46Z

@Arcterus Thanks, I think I know what the issue might be then. Will have a deeper look after dinner.

Arcterus · 2020-06-16T18:56:05Z

Sounds good 👍

nicoonoclaste · 2020-06-18T12:29:38Z

@Arcterus Fixed ~~there's an unrelated CI failure (failed while running rustup) though~~

nicoonoclaste · 2020-06-18T17:40:10Z

🎉

@nbraud

…lures) - probably fixes uutils#1531 (via uutils#1529) per @nbraud

Arcterus added I - Performance (Speed) U - factor labels Jun 8, 2020

rivy force-pushed the master branch 4 times, most recently from f07992e to 813e57d Compare June 15, 2020 04:39

nicoonoclaste added 9 commits June 15, 2020 19:10

factor::miller_rabin: Add tests

bada753

factor::factor: Add integration tests

e911555

factor::numeric: Implement Montgomery's transform

8a4d0d3

This is a facter way to perform arithmetic mod n, when n is odd and a 64b number.

factor::numeric::Montgomery: Add debug assertions

33e18b4

In debug mode, checks that all arithmetic operations coincide with the plain-u64 versions, as long as the latter does not overflow.

factor::Factors::add: Make the precondition check a debug_assert

f84d0f9

factor: Fix for old Rust

918035e

factor::numeric: Simplify inv_mod_u64

19a0645

Just call `u64::wrapping_{mul,sub}` instead of (de)constructing Wrapping<u64> values.

factor::numeric: Simplify Montgomery (remove superfluous Wrapping)

2238065

factor::numeric::Montgomery: Fix overflow bug

cb6051c

nicoonoclaste changed the title ~~factor: Faster modular arithmetic with the Montgomery transform [WIP]~~ factor: Faster modular arithmetic with the Montgomery transform Jun 15, 2020

factor::miller_rabin: Avoid repeatedly transforming 1 and -1

4851619

Approx. 25% speedup

fixup! factor::numeric::Montgomery: Fix overflow bug

f1788d9

sylvestre reviewed Jun 16, 2020

View reviewed changes

Comment thread src/uu/factor/src/numeric.rs Outdated

sylvestre reviewed Jun 16, 2020

View reviewed changes

Comment thread src/uu/factor/src/numeric.rs

nicoonoclaste added 2 commits June 16, 2020 15:43

factor: Run cargo fmt

334e027

factor::numeric::gcd: Silence the (erroneous) dead code lint

d1470da

factor::numeric::Montgomery::add: Deal with rare overflow case

fb08d9f

Arcterus merged commit 6105cce into uutils:master Jun 18, 2020

nicoonoclaste deleted the factor/montgomery branch June 18, 2020 17:40

This was referenced Jun 18, 2020

factor: Add/update copyright notices as necessary #1546

Merged

test_random_big() fails (or hangs) randomly #1531

Closed

rivy added a commit to rivy/rs.coreutils that referenced this pull request Jun 21, 2020

tests/factor ~ re-enable factor tests (with additional detail for fai…

f3ee451

…lures) - probably fixes uutils#1531 (via uutils#1529) per @nbraud

rivy mentioned this pull request Jun 21, 2020

tests/factor ~ re-enable factor tests #1553

Merged

Uh oh!

Conversation

nicoonoclaste commented May 30, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nicoonoclaste commented May 30, 2020

Uh oh!

Arcterus commented Jun 8, 2020

Uh oh!

nicoonoclaste commented Jun 15, 2020

Uh oh!

nicoonoclaste commented Jun 15, 2020

Uh oh!

nicoonoclaste commented Jun 15, 2020

Uh oh!

Uh oh!

sylvestre commented Jun 16, 2020

Uh oh!

nicoonoclaste commented Jun 16, 2020

Uh oh!

Uh oh!

nicoonoclaste commented Jun 16, 2020

Uh oh!

Arcterus commented Jun 16, 2020

Uh oh!

nicoonoclaste commented Jun 16, 2020

Uh oh!

Arcterus commented Jun 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nicoonoclaste commented Jun 16, 2020

Uh oh!

Arcterus commented Jun 16, 2020

Uh oh!

nicoonoclaste commented Jun 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nicoonoclaste commented Jun 18, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nicoonoclaste commented May 30, 2020 •

edited

Loading

Arcterus commented Jun 16, 2020 •

edited

Loading

nicoonoclaste commented Jun 18, 2020 •

edited

Loading