Skip to content

Use create-pull-request v2#112

Merged
mergify[bot] merged 9 commits into
masterfrom
actions/trigger/pkg-update
Jun 28, 2020
Merged

Use create-pull-request v2#112
mergify[bot] merged 9 commits into
masterfrom
actions/trigger/pkg-update

Conversation

@tkf
Copy link
Copy Markdown
Owner

@tkf tkf commented Jun 28, 2020

Trying to fix #100 (comment)

Commit Message

Use create-pull-request v2 (#112)

Using SSH-based configuration (which quires create-pull-request v2) so
that (hopefully) commits from this workflow will trigger actions.
With ssh-key is passed to actions/checkout@v2, git commands cannot
be used inside docker anymore (tkf/julia-merge-except-manifests@v1).
So, just inlining the script for now.

@codecov
Copy link
Copy Markdown

codecov Bot commented Jun 28, 2020

Codecov Report

Merging #112 into master will decrease coverage by 1.46%.
The diff coverage is n/a.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #112      +/-   ##
==========================================
- Coverage   80.34%   78.88%   -1.47%     
==========================================
  Files           8        8              
  Lines         407      412       +5     
==========================================
- Hits          327      325       -2     
- Misses         80       87       +7     
Impacted Files Coverage Δ
src/foreach.jl 81.81% <0.00%> (-3.04%) ⬇️
src/utils.jl 73.21% <0.00%> (-1.34%) ⬇️
src/countingsort.jl 11.47% <0.00%> (-1.03%) ⬇️
src/map.jl 85.71% <0.00%> (-0.96%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a573a0b...a3450bb. Read the comment docs.

@github-actions
Copy link
Copy Markdown
Contributor

Benchmark result

Judge result

Benchmark Report for /home/runner/work/ThreadsX.jl/ThreadsX.jl

Job Properties

  • Time of benchmarks:
    • Target: 28 Jun 2020 - 08:04
    • Baseline: 28 Jun 2020 - 08:10
  • Package commits:
    • Target: 58a7c6
    • Baseline: a573a0
  • Julia commits:
    • Target: 44fa15
    • Baseline: 44fa15
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 2
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 2

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["findfirst", "0%", "base"] 1.07 (5%) ❌ 1.00 (1%)
["findfirst", "0%", "tx"] 1.12 (5%) ❌ 1.00 (1%)
["findfirst", "0%", "tx-noterm"] 1.11 (5%) ❌ 1.00 (1%)
["findfirst", "0%", "tx-seq"] 1.07 (5%) ❌ 1.00 (1%)
["findfirst", "10%", "tx-noterm"] 1.03 (5%) 1.08 (1%) ❌
["findfirst", "20%", "tx-noterm"] 0.92 (5%) ✅ 1.49 (1%) ❌
["findfirst", "30%", "base"] 1.06 (5%) ❌ 1.00 (1%)
["findfirst", "30%", "tx"] 1.06 (5%) ❌ 1.00 (1%)
["findfirst", "40%", "tx"] 1.06 (5%) ❌ 1.00 (1%)
["findfirst", "50%", "tx"] 1.06 (5%) ❌ 1.00 (1%)
["findfirst", "50%", "tx-noterm"] 1.05 (5%) ❌ 0.96 (1%) ✅
["foreach", "base", "A .= B .+ B'"] 0.90 (5%) ✅ 1.00 (1%)
["foreach_seq_sum_many", ":nvecs => 8", "man"] 1.53 (5%) ❌ 1.00 (1%)
["foreach_seq_sum_many", ":nvecs => 8", "tx", ":simd => :ivdep"] 1.33 (5%) ❌ 1.00 (1%)
["sort", "F64 (narrow)", "Base"] 1.06 (5%) ❌ 1.00 (1%)
["sort", "F64 (narrow)", "ThreadsX.MergeSort"] 1.10 (5%) ❌ 1.00 (1%)
["sort", "F64 (narrow)", "ThreadsX.QuickSort"] 1.10 (5%) ❌ 1.00 (1%)
["sort", "F64 (narrow)", "ThreadsX.StableQuickSort"] 1.08 (5%) ❌ 1.00 (1%)
["sort", "F64 (wide)", "Base"] 1.23 (5%) ❌ 1.00 (1%)
["sort", "F64 (wide)", "ThreadsX.MergeSort"] 1.11 (5%) ❌ 1.00 (1%)
["sort", "F64 (wide)", "ThreadsX.QuickSort"] 1.18 (5%) ❌ 1.00 (1%)
["sort", "F64 (wide)", "ThreadsX.StableQuickSort"] 1.15 (5%) ❌ 1.00 (1%)
["sort", "reversed", "Base"] 1.07 (5%) ❌ 1.00 (1%)
["unique", "rand(1:10, 1000000)", "base"] 1.21 (5%) ❌ 1.00 (1%)
["unique", "rand(1:10, 1000000)", "tx"] 1.15 (5%) ❌ 1.00 (1%)
["unique", "rand(1:1000, 1000000)", "base"] 1.13 (5%) ❌ 1.00 (1%)
["unique", "rand(1:1000, 1000000)", "tx"] 1.18 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["findfirst", "0%"]
  • ["findfirst", "10%"]
  • ["findfirst", "20%"]
  • ["findfirst", "30%"]
  • ["findfirst", "40%"]
  • ["findfirst", "50%"]
  • ["foreach", "base"]
  • ["foreach", "broadcast"]
  • ["foreach", "tx"]
  • ["foreach_seq", "base"]
  • ["foreach_seq", "tx"]
  • ["foreach_seq_double", "cartesian"]
  • ["foreach_seq_double", "cartesian", "tx"]
  • ["foreach_seq_double", "linear"]
  • ["foreach_seq_double", "linear", "tx"]
  • ["foreach_seq_sum_many", ":nvecs => 8"]
  • ["foreach_seq_sum_many", ":nvecs => 8", "tx"]
  • ["sort", "F64 (narrow)"]
  • ["sort", "F64 (wide)"]
  • ["sort", "I64 (narrow)"]
  • ["sort", "I64 (wide)"]
  • ["sort", "reversed"]
  • ["sort", "sorted"]
  • ["unique", "rand(1:10, 1000000)"]
  • ["unique", "rand(1:1000, 1000000)"]

Julia versioninfo

Target

Julia Version 1.4.2
Commit 44fa15b150* (2020-05-23 18:35 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.3.0-1028-azure #29~18.04.1-Ubuntu SMP Fri Jun 5 14:32:34 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      58676 s          0 s       2614 s      26791 s          0 s
       #2  2294 MHz      50792 s          0 s       3024 s      34352 s          0 s
       
  Memory: 6.764884948730469 GB (2064.45703125 MB free)
  Uptime: 903.0 sec
  Load Avg:  1.24267578125  1.36181640625  0.9755859375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Baseline

Julia Version 1.4.2
Commit 44fa15b150* (2020-05-23 18:35 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.3.0-1028-azure #29~18.04.1-Ubuntu SMP Fri Jun 5 14:32:34 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      74290 s          0 s       3229 s      45116 s          0 s
       #2  2294 MHz      81210 s          0 s       3950 s      37510 s          0 s
       
  Memory: 6.764884948730469 GB (2449.16015625 MB free)
  Uptime: 1249.0 sec
  Load Avg:  1.37353515625  1.38525390625  1.10302734375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Target result

Benchmark Report for /home/runner/work/ThreadsX.jl/ThreadsX.jl

Job Properties

  • Time of benchmark: 28 Jun 2020 - 8:4
  • Package commit: 58a7c6
  • Julia commit: 44fa15
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["findfirst", "0%", "base"] 3.000 ns (5%)
["findfirst", "0%", "tx"] 25.100 μs (5%) 11.95 KiB (1%) 218
["findfirst", "0%", "tx-noterm"] 23.200 μs (5%) 11.97 KiB (1%) 218
["findfirst", "0%", "tx-seq"] 207.856 ns (5%) 544 bytes (1%) 14
["findfirst", "10%", "base"] 59.600 μs (5%)
["findfirst", "10%", "tx"] 65.900 μs (5%) 14.36 KiB (1%) 266
["findfirst", "10%", "tx-noterm"] 179.201 μs (5%) 32.94 KiB (1%) 608
["findfirst", "10%", "tx-seq"] 59.600 μs (5%) 560 bytes (1%) 15
["findfirst", "20%", "base"] 118.100 μs (5%)
["findfirst", "20%", "tx"] 116.301 μs (5%) 21.33 KiB (1%) 393
["findfirst", "20%", "tx-noterm"] 172.301 μs (5%) 35.33 KiB (1%) 652
["findfirst", "20%", "tx-seq"] 118.100 μs (5%) 560 bytes (1%) 15
["findfirst", "30%", "base"] 186.601 μs (5%)
["findfirst", "30%", "tx"] 170.801 μs (5%) 28.27 KiB (1%) 520
["findfirst", "30%", "tx-noterm"] 200.201 μs (5%) 28.31 KiB (1%) 522
["findfirst", "30%", "tx-seq"] 176.701 μs (5%) 560 bytes (1%) 15
["findfirst", "40%", "base"] 234.702 μs (5%)
["findfirst", "40%", "tx"] 247.702 μs (5%) 35.31 KiB (1%) 651
["findfirst", "40%", "tx-noterm"] 235.401 μs (5%) 35.31 KiB (1%) 650
["findfirst", "40%", "tx-seq"] 234.801 μs (5%) 560 bytes (1%) 15
["findfirst", "50%", "base"] 293.101 μs (5%)
["findfirst", "50%", "tx"] 269.702 μs (5%) 37.72 KiB (1%) 699
["findfirst", "50%", "tx-noterm"] 303.201 μs (5%) 51.56 KiB (1%) 950
["findfirst", "50%", "tx-seq"] 293.201 μs (5%) 560 bytes (1%) 15
["foreach", "base", "A .= B .+ B'"] 411.029 ms (5%) 44.591 ms 305.18 MiB (1%) 16000002
["foreach", "base", "A .= B .+ C"] 261.907 ms (5%) 29.679 ms 305.18 MiB (1%) 16000001
["foreach", "broadcast", "A .= B .+ B'"] 16.292 ms (5%)
["foreach", "broadcast", "A .= B .+ C"] 9.679 ms (5%)
["foreach", "tx", "A .= B .+ B'"] 8.420 ms (5%) 25.94 KiB (1%) 360
["foreach", "tx", "A .= B .+ C"] 5.045 ms (5%) 12.77 KiB (1%) 125
["foreach_seq", "base", "Matrix"] 561.002 μs (5%)
["foreach_seq", "base", "Transpose"] 2.049 ms (5%)
["foreach_seq", "base", "Vector"] 560.802 μs (5%)
["foreach_seq", "tx", "Matrix"] 564.401 μs (5%)
["foreach_seq", "tx", "Transpose"] 926.703 μs (5%) 16 bytes (1%) 1
["foreach_seq", "tx", "Vector"] 561.001 μs (5%)
["foreach_seq_double", "cartesian", "man"] 20.300 μs (5%)
["foreach_seq_double", "cartesian", "tx", ":simd => :ivdep"] 20.500 μs (5%)
["foreach_seq_double", "cartesian", "tx", ":simd => false"] 20.100 μs (5%)
["foreach_seq_double", "cartesian", "tx", ":simd => true"] 20.400 μs (5%)
["foreach_seq_double", "linear", "man"] 104.258 ns (5%)
["foreach_seq_double", "linear", "tx", ":simd => :ivdep"] 101.917 ns (5%)
["foreach_seq_double", "linear", "tx", ":simd => false"] 102.033 ns (5%)
["foreach_seq_double", "linear", "tx", ":simd => true"] 104.269 ns (5%)
["foreach_seq_sum_many", ":nvecs => 8", "man"] 2.289 μs (5%)
["foreach_seq_sum_many", ":nvecs => 8", "tx", ":simd => :ivdep"] 2.130 μs (5%)
["foreach_seq_sum_many", ":nvecs => 8", "tx", ":simd => false"] 2.611 μs (5%)
["foreach_seq_sum_many", ":nvecs => 8", "tx", ":simd => true"] 2.600 μs (5%)
["sort", "F64 (narrow)", "Base"] 2.373 ms (5%)
["sort", "F64 (narrow)", "ThreadsX.MergeSort"] 2.804 ms (5%) 1.19 MiB (1%) 535
["sort", "F64 (narrow)", "ThreadsX.QuickSort"] 1.704 ms (5%) 965.08 KiB (1%) 1224
["sort", "F64 (narrow)", "ThreadsX.StableQuickSort"] 1.676 ms (5%) 1.02 MiB (1%) 1246
["sort", "F64 (wide)", "Base"] 6.804 ms (5%)
["sort", "F64 (wide)", "ThreadsX.MergeSort"] 5.357 ms (5%) 1.19 MiB (1%) 564
["sort", "F64 (wide)", "ThreadsX.QuickSort"] 5.943 ms (5%) 1.01 MiB (1%) 2141
["sort", "F64 (wide)", "ThreadsX.StableQuickSort"] 6.506 ms (5%) 1.39 MiB (1%) 2194
["sort", "I64 (narrow)", "Base"] 129.000 μs (5%) 160 bytes (1%) 1
["sort", "I64 (narrow)", "ThreadsX.MergeSort"] 136.300 μs (5%) 864 bytes (1%) 17
["sort", "I64 (narrow)", "ThreadsX.QuickSort"] 137.201 μs (5%) 864 bytes (1%) 17
["sort", "I64 (narrow)", "ThreadsX.StableQuickSort"] 136.500 μs (5%) 864 bytes (1%) 17
["sort", "I64 (wide)", "Base"] 5.615 ms (5%)
["sort", "I64 (wide)", "ThreadsX.MergeSort"] 4.037 ms (5%) 1.19 MiB (1%) 555
["sort", "I64 (wide)", "ThreadsX.QuickSort"] 4.035 ms (5%) 1.01 MiB (1%) 2236
["sort", "I64 (wide)", "ThreadsX.StableQuickSort"] 4.607 ms (5%) 1.40 MiB (1%) 2270
["sort", "reversed", "Base"] 702.402 μs (5%)
["sort", "reversed", "ThreadsX.MergeSort"] 1.215 ms (5%) 1.18 MiB (1%) 434
["sort", "reversed", "ThreadsX.QuickSort"] 1.102 ms (5%) 998.75 KiB (1%) 1871
["sort", "reversed", "ThreadsX.StableQuickSort"] 1.536 ms (5%) 1.36 MiB (1%) 1903
["sort", "sorted", "Base"] 609.402 μs (5%)
["sort", "sorted", "ThreadsX.MergeSort"] 886.503 μs (5%) 1.18 MiB (1%) 431
["sort", "sorted", "ThreadsX.QuickSort"] 1.148 ms (5%) 998.75 KiB (1%) 1871
["sort", "sorted", "ThreadsX.StableQuickSort"] 1.199 ms (5%) 1.36 MiB (1%) 1903
["unique", "rand(1:10, 1000000)", "base"] 10.766 ms (5%) 832 bytes (1%) 8
["unique", "rand(1:10, 1000000)", "tx"] 5.408 ms (5%) 50.98 KiB (1%) 882
["unique", "rand(1:1000, 1000000)", "base"] 9.257 ms (5%) 65.95 KiB (1%) 27
["unique", "rand(1:1000, 1000000)", "tx"] 6.022 ms (5%) 1.07 MiB (1%) 1186

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["findfirst", "0%"]
  • ["findfirst", "10%"]
  • ["findfirst", "20%"]
  • ["findfirst", "30%"]
  • ["findfirst", "40%"]
  • ["findfirst", "50%"]
  • ["foreach", "base"]
  • ["foreach", "broadcast"]
  • ["foreach", "tx"]
  • ["foreach_seq", "base"]
  • ["foreach_seq", "tx"]
  • ["foreach_seq_double", "cartesian"]
  • ["foreach_seq_double", "cartesian", "tx"]
  • ["foreach_seq_double", "linear"]
  • ["foreach_seq_double", "linear", "tx"]
  • ["foreach_seq_sum_many", ":nvecs => 8"]
  • ["foreach_seq_sum_many", ":nvecs => 8", "tx"]
  • ["sort", "F64 (narrow)"]
  • ["sort", "F64 (wide)"]
  • ["sort", "I64 (narrow)"]
  • ["sort", "I64 (wide)"]
  • ["sort", "reversed"]
  • ["sort", "sorted"]
  • ["unique", "rand(1:10, 1000000)"]
  • ["unique", "rand(1:1000, 1000000)"]

Julia versioninfo

Julia Version 1.4.2
Commit 44fa15b150* (2020-05-23 18:35 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.3.0-1028-azure #29~18.04.1-Ubuntu SMP Fri Jun 5 14:32:34 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      58676 s          0 s       2614 s      26791 s          0 s
       #2  2294 MHz      50792 s          0 s       3024 s      34352 s          0 s
       
  Memory: 6.764884948730469 GB (2064.45703125 MB free)
  Uptime: 903.0 sec
  Load Avg:  1.24267578125  1.36181640625  0.9755859375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Baseline result

Benchmark Report for /home/runner/work/ThreadsX.jl/ThreadsX.jl

Job Properties

  • Time of benchmark: 28 Jun 2020 - 8:10
  • Package commit: a573a0
  • Julia commit: 44fa15
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["findfirst", "0%", "base"] 2.800 ns (5%)
["findfirst", "0%", "tx"] 22.400 μs (5%) 11.95 KiB (1%) 218
["findfirst", "0%", "tx-noterm"] 20.900 μs (5%) 11.97 KiB (1%) 218
["findfirst", "0%", "tx-seq"] 193.781 ns (5%) 544 bytes (1%) 14
["findfirst", "10%", "base"] 59.100 μs (5%)
["findfirst", "10%", "tx"] 66.400 μs (5%) 14.36 KiB (1%) 266
["findfirst", "10%", "tx-noterm"] 173.400 μs (5%) 30.53 KiB (1%) 558
["findfirst", "10%", "tx-seq"] 59.700 μs (5%) 560 bytes (1%) 15
["findfirst", "20%", "base"] 117.601 μs (5%)
["findfirst", "20%", "tx"] 121.800 μs (5%) 21.34 KiB (1%) 394
["findfirst", "20%", "tx-noterm"] 187.400 μs (5%) 23.66 KiB (1%) 436
["findfirst", "20%", "tx-seq"] 118.000 μs (5%) 560 bytes (1%) 15
["findfirst", "30%", "base"] 176.500 μs (5%)
["findfirst", "30%", "tx"] 160.400 μs (5%) 28.27 KiB (1%) 520
["findfirst", "30%", "tx-noterm"] 194.601 μs (5%) 28.31 KiB (1%) 522
["findfirst", "30%", "tx-seq"] 176.501 μs (5%) 560 bytes (1%) 15
["findfirst", "40%", "base"] 234.700 μs (5%)
["findfirst", "40%", "tx"] 234.101 μs (5%) 35.31 KiB (1%) 651
["findfirst", "40%", "tx-noterm"] 235.501 μs (5%) 35.34 KiB (1%) 652
["findfirst", "40%", "tx-seq"] 234.900 μs (5%) 560 bytes (1%) 15
["findfirst", "50%", "base"] 292.301 μs (5%)
["findfirst", "50%", "tx"] 255.401 μs (5%) 37.70 KiB (1%) 698
["findfirst", "50%", "tx-noterm"] 288.501 μs (5%) 53.91 KiB (1%) 993
["findfirst", "50%", "tx-seq"] 293.301 μs (5%) 560 bytes (1%) 15
["foreach", "base", "A .= B .+ B'"] 458.028 ms (5%) 39.227 ms 305.18 MiB (1%) 16000002
["foreach", "base", "A .= B .+ C"] 257.444 ms (5%) 37.023 ms 305.18 MiB (1%) 16000001
["foreach", "broadcast", "A .= B .+ B'"] 16.879 ms (5%)
["foreach", "broadcast", "A .= B .+ C"] 9.517 ms (5%)
["foreach", "tx", "A .= B .+ B'"] 8.041 ms (5%) 25.92 KiB (1%) 359
["foreach", "tx", "A .= B .+ C"] 5.105 ms (5%) 12.75 KiB (1%) 124
["foreach_seq", "base", "Matrix"] 560.803 μs (5%)
["foreach_seq", "base", "Transpose"] 1.963 ms (5%)
["foreach_seq", "base", "Vector"] 560.803 μs (5%)
["foreach_seq", "tx", "Matrix"] 564.403 μs (5%)
["foreach_seq", "tx", "Transpose"] 904.205 μs (5%) 16 bytes (1%) 1
["foreach_seq", "tx", "Vector"] 561.003 μs (5%)
["foreach_seq_double", "cartesian", "man"] 19.800 μs (5%)
["foreach_seq_double", "cartesian", "tx", ":simd => :ivdep"] 20.300 μs (5%)
["foreach_seq_double", "cartesian", "tx", ":simd => false"] 20.400 μs (5%)
["foreach_seq_double", "cartesian", "tx", ":simd => true"] 20.500 μs (5%)
["foreach_seq_double", "linear", "man"] 104.258 ns (5%)
["foreach_seq_double", "linear", "tx", ":simd => :ivdep"] 100.000 ns (5%)
["foreach_seq_double", "linear", "tx", ":simd => false"] 100.000 ns (5%)
["foreach_seq_double", "linear", "tx", ":simd => true"] 100.000 ns (5%)
["foreach_seq_sum_many", ":nvecs => 8", "man"] 1.500 μs (5%)
["foreach_seq_sum_many", ":nvecs => 8", "tx", ":simd => :ivdep"] 1.600 μs (5%)
["foreach_seq_sum_many", ":nvecs => 8", "tx", ":simd => false"] 2.500 μs (5%)
["foreach_seq_sum_many", ":nvecs => 8", "tx", ":simd => true"] 2.500 μs (5%)
["sort", "F64 (narrow)", "Base"] 2.241 ms (5%)
["sort", "F64 (narrow)", "ThreadsX.MergeSort"] 2.552 ms (5%) 1.19 MiB (1%) 535
["sort", "F64 (narrow)", "ThreadsX.QuickSort"] 1.549 ms (5%) 965.11 KiB (1%) 1226
["sort", "F64 (narrow)", "ThreadsX.StableQuickSort"] 1.553 ms (5%) 1.02 MiB (1%) 1246
["sort", "F64 (wide)", "Base"] 5.549 ms (5%)
["sort", "F64 (wide)", "ThreadsX.MergeSort"] 4.829 ms (5%) 1.19 MiB (1%) 564
["sort", "F64 (wide)", "ThreadsX.QuickSort"] 5.033 ms (5%) 1.01 MiB (1%) 2147
["sort", "F64 (wide)", "ThreadsX.StableQuickSort"] 5.647 ms (5%) 1.39 MiB (1%) 2193
["sort", "I64 (narrow)", "Base"] 128.700 μs (5%) 160 bytes (1%) 1
["sort", "I64 (narrow)", "ThreadsX.MergeSort"] 131.701 μs (5%) 864 bytes (1%) 17
["sort", "I64 (narrow)", "ThreadsX.QuickSort"] 131.601 μs (5%) 864 bytes (1%) 17
["sort", "I64 (narrow)", "ThreadsX.StableQuickSort"] 131.500 μs (5%) 864 bytes (1%) 17
["sort", "I64 (wide)", "Base"] 5.473 ms (5%)
["sort", "I64 (wide)", "ThreadsX.MergeSort"] 4.112 ms (5%) 1.19 MiB (1%) 554
["sort", "I64 (wide)", "ThreadsX.QuickSort"] 4.196 ms (5%) 1.01 MiB (1%) 2236
["sort", "I64 (wide)", "ThreadsX.StableQuickSort"] 4.772 ms (5%) 1.40 MiB (1%) 2272
["sort", "reversed", "Base"] 653.704 μs (5%)
["sort", "reversed", "ThreadsX.MergeSort"] 1.214 ms (5%) 1.18 MiB (1%) 434
["sort", "reversed", "ThreadsX.QuickSort"] 1.135 ms (5%) 998.77 KiB (1%) 1872
["sort", "reversed", "ThreadsX.StableQuickSort"] 1.547 ms (5%) 1.36 MiB (1%) 1902
["sort", "sorted", "Base"] 610.203 μs (5%)
["sort", "sorted", "ThreadsX.MergeSort"] 863.905 μs (5%) 1.18 MiB (1%) 431
["sort", "sorted", "ThreadsX.QuickSort"] 1.188 ms (5%) 998.75 KiB (1%) 1871
["sort", "sorted", "ThreadsX.StableQuickSort"] 1.253 ms (5%) 1.36 MiB (1%) 1903
["unique", "rand(1:10, 1000000)", "base"] 8.917 ms (5%) 832 bytes (1%) 8
["unique", "rand(1:10, 1000000)", "tx"] 4.687 ms (5%) 50.97 KiB (1%) 881
["unique", "rand(1:1000, 1000000)", "base"] 8.222 ms (5%) 65.95 KiB (1%) 27
["unique", "rand(1:1000, 1000000)", "tx"] 5.122 ms (5%) 1.07 MiB (1%) 1186

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["findfirst", "0%"]
  • ["findfirst", "10%"]
  • ["findfirst", "20%"]
  • ["findfirst", "30%"]
  • ["findfirst", "40%"]
  • ["findfirst", "50%"]
  • ["foreach", "base"]
  • ["foreach", "broadcast"]
  • ["foreach", "tx"]
  • ["foreach_seq", "base"]
  • ["foreach_seq", "tx"]
  • ["foreach_seq_double", "cartesian"]
  • ["foreach_seq_double", "cartesian", "tx"]
  • ["foreach_seq_double", "linear"]
  • ["foreach_seq_double", "linear", "tx"]
  • ["foreach_seq_sum_many", ":nvecs => 8"]
  • ["foreach_seq_sum_many", ":nvecs => 8", "tx"]
  • ["sort", "F64 (narrow)"]
  • ["sort", "F64 (wide)"]
  • ["sort", "I64 (narrow)"]
  • ["sort", "I64 (wide)"]
  • ["sort", "reversed"]
  • ["sort", "sorted"]
  • ["unique", "rand(1:10, 1000000)"]
  • ["unique", "rand(1:1000, 1000000)"]

Julia versioninfo

Julia Version 1.4.2
Commit 44fa15b150* (2020-05-23 18:35 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.3.0-1028-azure #29~18.04.1-Ubuntu SMP Fri Jun 5 14:32:34 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      74290 s          0 s       3229 s      45116 s          0 s
       #2  2294 MHz      81210 s          0 s       3950 s      37510 s          0 s
       
  Memory: 6.764884948730469 GB (2449.16015625 MB free)
  Uptime: 1249.0 sec
  Load Avg:  1.37353515625  1.38525390625  1.10302734375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               79
Model name:          Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Stepping:            1
CPU MHz:             2294.684
BogoMIPS:            4589.36
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            51200K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm rdseed adx smap xsaveopt md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Vendor :Intel
Architecture :Broadwell
Model Family: 0x06, Model: 0x4f, Stepping: 0x01, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 51200) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@mergify mergify Bot merged commit 7362ea8 into master Jun 28, 2020
@mergify mergify Bot deleted the actions/trigger/pkg-update branch June 28, 2020 08:11
mergify Bot pushed a commit to JuliaFolds/Transducers.jl that referenced this pull request Jul 1, 2020
Using SSH-based configuration (which quires create-pull-request v2) so
that commits from this workflow will trigger actions.  With `ssh-key`
is passed to actions/checkout@v2, git commands cannot be used inside
docker anymore (tkf/julia-merge-except-manifests@v1).  So, just
inlining the script for now.

This is based on:
tkf/ThreadsX.jl#112
tkf/ThreadsX.jl#114
tkf/ThreadsX.jl#115
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant