feat: split Parquet files into row-group-sized morsels by adriangb · Pull Request #10 · adriangb/datafusion

adriangb · 2026-04-21T03:35:07Z

Each Parquet file previously produced a single morsel containing one ParquetPushDecoder over the full pruned ParquetAccessPlan. Morselize at row-group granularity instead: after all pruning work is done, pack surviving row groups into chunks bounded by a per-morsel row budget and compressed-byte budget (defaults: 100k rows, 64 MiB). Each chunk becomes its own stream so the executor can interleave row-group decode work with other operators and — in a follow-up — let sibling FileStreams steal row-group-sized units of work across partitions.

A single oversized row group still becomes its own morsel; no sub-row-group splitting is introduced.

EarlyStoppingStream (which is driven by the non-Clone FilePruner) is attached only to the first morsel's stream so the whole file can still short-circuit on dynamic-filter narrowing. Row-group reversal is applied per-chunk on the PreparedAccessPlan and the chunk list is reversed so reverse output order is preserved.

Which issue does this PR close?

Closes #.

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

adriangb · 2026-04-21T03:35:55Z

run benchmarks

baseline:
    ref: main
    env:
       DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS: false
       DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS: false
changed:
    ref: HEAD
    env:
       DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS: false

adriangb · 2026-04-21T03:36:23Z

run benchmarks

env:
     DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS: true
     DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS: true

adriangb · 2026-04-21T03:36:28Z

run benchmarks

env:
     DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS: false
     DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS: false

adriangb · 2026-04-21T03:36:50Z

cancel benchmark #10 (comment)

adriangbot · 2026-04-21T03:39:04Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4285760346-1647-hpsnb 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (311a854) to afc0784 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:39:06Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4285758967-1645-npvgq 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing HEAD (311a854) to main diff using: tpcds
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:39:11Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4285760346-1648-kzfxt 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (311a854) to afc0784 (merge-base) diff using: tpcds
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:39:12Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4285760346-1649-fh64c 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (311a854) to afc0784 (merge-base) diff using: tpch
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:39:26Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4285758967-1646-99f77 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing HEAD (311a854) to main diff using: tpch
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:39:28Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4285758967-1644-hgcdk 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing HEAD (311a854) to main diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:58:06Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and row-group-morsel-split
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                row-group-morsel-split ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.18 / 4.35 ±6.24 / 16.84 ms │          1.18 / 4.38 ±6.30 / 16.98 ms │     no change │
│ QQuery 1  │        15.50 / 15.90 ±0.24 / 16.17 ms │        16.66 / 17.14 ±0.60 / 18.30 ms │  1.08x slower │
│ QQuery 2  │        43.68 / 44.01 ±0.26 / 44.35 ms │        46.99 / 47.46 ±0.39 / 48.13 ms │  1.08x slower │
│ QQuery 3  │        40.68 / 41.91 ±1.04 / 43.35 ms │        42.29 / 42.41 ±0.06 / 42.46 ms │     no change │
│ QQuery 4  │     281.12 / 285.88 ±4.67 / 294.23 ms │     281.92 / 290.97 ±5.93 / 299.09 ms │     no change │
│ QQuery 5  │     327.80 / 332.49 ±4.40 / 339.70 ms │     332.49 / 334.35 ±1.63 / 337.25 ms │     no change │
│ QQuery 6  │           5.54 / 6.13 ±0.41 / 6.78 ms │           5.65 / 6.25 ±0.42 / 6.90 ms │     no change │
│ QQuery 7  │        21.41 / 22.57 ±0.97 / 24.17 ms │        23.06 / 24.28 ±1.06 / 25.69 ms │  1.08x slower │
│ QQuery 8  │    404.33 / 412.87 ±13.52 / 439.58 ms │     401.80 / 414.08 ±6.83 / 421.44 ms │     no change │
│ QQuery 9  │     625.41 / 636.87 ±9.72 / 648.56 ms │     633.18 / 639.19 ±6.64 / 652.02 ms │     no change │
│ QQuery 10 │     117.76 / 120.57 ±1.84 / 123.13 ms │     119.34 / 122.96 ±3.49 / 129.09 ms │     no change │
│ QQuery 11 │     131.86 / 133.22 ±0.74 / 134.11 ms │     133.75 / 135.57 ±1.23 / 137.29 ms │     no change │
│ QQuery 12 │     370.07 / 375.73 ±4.99 / 384.87 ms │     366.81 / 372.12 ±3.07 / 376.00 ms │     no change │
│ QQuery 13 │     491.20 / 496.84 ±7.14 / 510.86 ms │    482.09 / 500.70 ±13.80 / 517.14 ms │     no change │
│ QQuery 14 │     376.49 / 385.35 ±5.09 / 390.99 ms │     380.42 / 384.08 ±3.31 / 389.50 ms │     no change │
│ QQuery 15 │     349.20 / 356.28 ±4.43 / 362.19 ms │    344.53 / 365.77 ±26.45 / 415.87 ms │     no change │
│ QQuery 16 │     699.82 / 708.81 ±6.56 / 718.99 ms │    698.80 / 711.27 ±12.12 / 733.76 ms │     no change │
│ QQuery 17 │     701.29 / 712.01 ±7.20 / 723.01 ms │     693.10 / 702.46 ±6.54 / 710.51 ms │     no change │
│ QQuery 18 │ 1420.48 / 1454.14 ±28.11 / 1494.11 ms │ 1420.77 / 1452.28 ±36.11 / 1519.27 ms │     no change │
│ QQuery 19 │        36.52 / 46.87 ±9.59 / 62.76 ms │        37.65 / 41.00 ±5.07 / 50.98 ms │ +1.14x faster │
│ QQuery 20 │    721.90 / 745.41 ±21.24 / 775.21 ms │    731.44 / 741.00 ±10.57 / 760.84 ms │     no change │
│ QQuery 21 │    746.46 / 761.37 ±19.84 / 800.52 ms │     752.20 / 755.98 ±3.71 / 762.18 ms │     no change │
│ QQuery 22 │  1169.22 / 1178.67 ±8.60 / 1194.82 ms │ 1170.52 / 1187.01 ±15.40 / 1210.04 ms │     no change │
│ QQuery 23 │     268.28 / 270.80 ±2.03 / 274.47 ms │    264.45 / 274.86 ±11.52 / 296.92 ms │     no change │
│ QQuery 24 │       94.95 / 98.68 ±3.65 / 105.65 ms │        92.40 / 94.63 ±1.91 / 97.79 ms │     no change │
│ QQuery 25 │     171.61 / 177.14 ±3.89 / 183.62 ms │     176.80 / 180.69 ±3.38 / 186.88 ms │     no change │
│ QQuery 26 │     106.70 / 109.36 ±1.76 / 111.49 ms │     113.24 / 115.94 ±2.46 / 119.43 ms │  1.06x slower │
│ QQuery 27 │   976.93 / 994.67 ±18.69 / 1030.32 ms │   965.42 / 984.89 ±22.65 / 1027.65 ms │     no change │
│ QQuery 28 │ 3368.79 / 3391.85 ±14.03 / 3408.81 ms │ 3364.76 / 3392.43 ±24.73 / 3428.20 ms │     no change │
│ QQuery 29 │        49.66 / 53.10 ±2.83 / 56.07 ms │        51.44 / 57.45 ±4.71 / 64.96 ms │  1.08x slower │
│ QQuery 30 │     363.09 / 370.29 ±5.32 / 379.65 ms │     362.35 / 368.32 ±4.74 / 376.66 ms │     no change │
│ QQuery 31 │    359.63 / 371.02 ±12.41 / 394.31 ms │    355.30 / 368.66 ±10.97 / 385.91 ms │     no change │
│ QQuery 32 │ 1178.07 / 1213.17 ±49.32 / 1308.70 ms │ 1198.69 / 1275.33 ±83.47 / 1429.00 ms │  1.05x slower │
│ QQuery 33 │ 1500.10 / 1555.82 ±39.69 / 1613.69 ms │ 1515.53 / 1555.18 ±22.72 / 1580.95 ms │     no change │
│ QQuery 34 │ 1454.88 / 1526.27 ±89.34 / 1696.87 ms │  1479.00 / 1490.88 ±9.06 / 1501.83 ms │     no change │
│ QQuery 35 │     377.59 / 383.64 ±4.92 / 392.65 ms │     382.46 / 384.67 ±2.76 / 390.00 ms │     no change │
│ QQuery 36 │     101.30 / 105.73 ±4.08 / 112.07 ms │     100.88 / 104.00 ±2.83 / 108.88 ms │     no change │
│ QQuery 37 │        52.76 / 56.85 ±2.51 / 60.06 ms │        50.94 / 52.38 ±0.73 / 52.91 ms │ +1.09x faster │
│ QQuery 38 │        54.35 / 57.46 ±2.06 / 60.01 ms │        56.84 / 60.43 ±2.00 / 62.80 ms │  1.05x slower │
│ QQuery 39 │     171.33 / 176.14 ±3.28 / 180.33 ms │     162.51 / 170.07 ±5.41 / 177.19 ms │     no change │
│ QQuery 40 │        28.32 / 31.41 ±1.72 / 33.56 ms │        27.38 / 29.51 ±1.46 / 31.46 ms │ +1.06x faster │
│ QQuery 41 │        24.64 / 25.80 ±1.16 / 27.64 ms │        24.48 / 25.56 ±0.65 / 26.51 ms │     no change │
│ QQuery 42 │        21.62 / 22.15 ±0.75 / 23.63 ms │        21.38 / 22.31 ±1.20 / 24.62 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 20269.63ms │
│ Total Time (row-group-morsel-split)   │ 20300.90ms │
│ Average Time (HEAD)                   │   471.39ms │
│ Average Time (row-group-morsel-split) │   472.11ms │
│ Queries Faster                        │          3 │
│ Queries Slower                        │          7 │
│ Queries with No Change                │         33 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	105.0s
Peak memory	35.4 GiB
Avg memory	26.5 GiB
CPU user	909.3s
CPU sys	106.8s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	105.0s
Peak memory	36.0 GiB
Avg memory	27.2 GiB
CPU user	911.4s
CPU sys	105.1s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:58:18Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and row-group-morsel-split
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                row-group-morsel-split ┃       Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.18 / 4.36 ±6.27 / 16.90 ms │          1.18 / 4.35 ±6.26 / 16.88 ms │    no change │
│ QQuery 1  │        11.94 / 12.37 ±0.24 / 12.68 ms │        14.65 / 15.34 ±0.40 / 15.87 ms │ 1.24x slower │
│ QQuery 2  │        36.25 / 36.42 ±0.22 / 36.86 ms │        44.36 / 44.78 ±0.30 / 45.17 ms │ 1.23x slower │
│ QQuery 3  │        31.18 / 32.01 ±0.88 / 33.61 ms │        41.85 / 42.39 ±0.46 / 43.01 ms │ 1.32x slower │
│ QQuery 4  │     252.46 / 255.46 ±2.70 / 258.84 ms │     284.13 / 292.50 ±9.73 / 310.52 ms │ 1.15x slower │
│ QQuery 5  │     291.37 / 298.11 ±5.11 / 305.65 ms │     331.76 / 339.50 ±3.91 / 342.31 ms │ 1.14x slower │
│ QQuery 6  │           6.42 / 7.34 ±0.94 / 8.94 ms │           6.05 / 7.09 ±0.84 / 8.59 ms │    no change │
│ QQuery 7  │        13.37 / 13.60 ±0.27 / 14.06 ms │        17.28 / 17.59 ±0.19 / 17.83 ms │ 1.29x slower │
│ QQuery 8  │     356.43 / 362.97 ±6.77 / 375.60 ms │     400.94 / 406.28 ±3.19 / 410.97 ms │ 1.12x slower │
│ QQuery 9  │    499.87 / 522.44 ±14.41 / 540.82 ms │     631.03 / 638.23 ±8.22 / 653.46 ms │ 1.22x slower │
│ QQuery 10 │        75.14 / 77.22 ±2.18 / 81.44 ms │       93.42 / 96.36 ±3.60 / 102.58 ms │ 1.25x slower │
│ QQuery 11 │        84.41 / 86.17 ±1.49 / 88.89 ms │     104.91 / 106.55 ±1.11 / 108.04 ms │ 1.24x slower │
│ QQuery 12 │    286.56 / 295.74 ±10.09 / 312.13 ms │     332.24 / 337.90 ±5.86 / 345.03 ms │ 1.14x slower │
│ QQuery 13 │     417.93 / 426.37 ±8.41 / 438.69 ms │     451.93 / 459.33 ±9.82 / 478.28 ms │ 1.08x slower │
│ QQuery 14 │     298.90 / 301.57 ±1.76 / 304.17 ms │     337.65 / 341.97 ±3.48 / 347.12 ms │ 1.13x slower │
│ QQuery 15 │     317.56 / 328.79 ±9.56 / 340.84 ms │    341.74 / 361.17 ±13.33 / 376.73 ms │ 1.10x slower │
│ QQuery 16 │     661.14 / 667.10 ±3.29 / 671.09 ms │     705.95 / 709.99 ±4.29 / 717.04 ms │ 1.06x slower │
│ QQuery 17 │     663.76 / 668.96 ±3.03 / 672.72 ms │     694.48 / 704.00 ±5.92 / 710.26 ms │ 1.05x slower │
│ QQuery 18 │  1317.91 / 1327.18 ±5.82 / 1332.48 ms │ 1407.31 / 1445.49 ±30.76 / 1478.68 ms │ 1.09x slower │
│ QQuery 19 │        29.08 / 30.51 ±1.40 / 33.14 ms │       36.21 / 48.20 ±21.93 / 92.02 ms │ 1.58x slower │
│ QQuery 20 │     513.78 / 519.50 ±7.45 / 532.40 ms │    723.20 / 736.33 ±17.37 / 770.58 ms │ 1.42x slower │
│ QQuery 21 │     588.37 / 590.80 ±1.79 / 593.40 ms │     766.35 / 772.61 ±3.79 / 776.57 ms │ 1.31x slower │
│ QQuery 22 │  1043.73 / 1053.31 ±5.94 / 1062.41 ms │  1133.06 / 1137.53 ±3.01 / 1141.24 ms │ 1.08x slower │
│ QQuery 23 │ 3253.14 / 3266.47 ±18.83 / 3302.32 ms │ 3130.00 / 3150.85 ±14.95 / 3172.17 ms │    no change │
│ QQuery 24 │        42.37 / 45.60 ±2.80 / 50.11 ms │      97.22 / 103.77 ±4.35 / 110.69 ms │ 2.28x slower │
│ QQuery 25 │     112.52 / 114.20 ±1.62 / 116.49 ms │     141.41 / 142.86 ±1.07 / 144.61 ms │ 1.25x slower │
│ QQuery 26 │        42.05 / 43.97 ±1.68 / 46.04 ms │     100.43 / 101.95 ±0.89 / 103.17 ms │ 2.32x slower │
│ QQuery 27 │     656.52 / 662.84 ±4.76 / 669.98 ms │     853.68 / 856.73 ±2.97 / 860.40 ms │ 1.29x slower │
│ QQuery 28 │  2992.37 / 2998.58 ±4.17 / 3003.59 ms │ 3232.36 / 3264.76 ±16.70 / 3276.59 ms │ 1.09x slower │
│ QQuery 29 │        42.60 / 47.83 ±4.34 / 53.62 ms │        51.02 / 57.01 ±5.92 / 67.62 ms │ 1.19x slower │
│ QQuery 30 │     318.24 / 326.14 ±6.97 / 339.22 ms │     359.37 / 363.78 ±3.93 / 369.57 ms │ 1.12x slower │
│ QQuery 31 │     327.89 / 332.98 ±3.47 / 338.66 ms │    363.43 / 378.73 ±14.42 / 405.47 ms │ 1.14x slower │
│ QQuery 32 │  999.39 / 1020.21 ±27.92 / 1074.75 ms │ 1121.82 / 1147.75 ±18.10 / 1170.18 ms │ 1.13x slower │
│ QQuery 33 │  1413.89 / 1419.41 ±5.17 / 1428.30 ms │ 1424.81 / 1448.38 ±34.71 / 1516.63 ms │    no change │
│ QQuery 34 │ 1415.06 / 1438.87 ±21.74 / 1479.49 ms │ 1424.70 / 1472.70 ±28.34 / 1506.81 ms │    no change │
│ QQuery 35 │    310.50 / 316.50 ±10.02 / 336.52 ms │     383.74 / 388.20 ±3.36 / 393.00 ms │ 1.23x slower │
│ QQuery 36 │        65.73 / 68.32 ±2.46 / 72.50 ms │     117.80 / 120.79 ±1.84 / 123.49 ms │ 1.77x slower │
│ QQuery 37 │        36.16 / 37.71 ±1.11 / 39.20 ms │        45.59 / 47.20 ±1.51 / 49.85 ms │ 1.25x slower │
│ QQuery 38 │        40.70 / 41.50 ±0.64 / 42.53 ms │        73.44 / 76.18 ±1.64 / 78.02 ms │ 1.84x slower │
│ QQuery 39 │     124.38 / 130.70 ±6.46 / 142.24 ms │     210.68 / 214.26 ±3.91 / 221.79 ms │ 1.64x slower │
│ QQuery 40 │        14.98 / 17.03 ±1.64 / 19.09 ms │        23.43 / 25.52 ±2.02 / 29.19 ms │ 1.50x slower │
│ QQuery 41 │        13.98 / 14.68 ±0.99 / 16.61 ms │        18.68 / 22.35 ±2.43 / 25.60 ms │ 1.52x slower │
│ QQuery 42 │        13.28 / 13.57 ±0.23 / 13.92 ms │        18.48 / 19.84 ±1.11 / 21.85 ms │ 1.46x slower │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴──────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 20275.39ms │
│ Total Time (row-group-morsel-split)   │ 22469.10ms │
│ Average Time (HEAD)                   │   471.52ms │
│ Average Time (row-group-morsel-split) │   522.54ms │
│ Queries Faster                        │          0 │
│ Queries Slower                        │         38 │
│ Queries with No Change                │          5 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	105.0s
Peak memory	39.9 GiB
Avg memory	33.2 GiB
CPU user	1065.5s
CPU sys	77.4s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	115.0s
Peak memory	38.6 GiB
Avg memory	27.7 GiB
CPU user	1064.1s
CPU sys	93.1s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:58:33Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and row-group-morsel-split
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                     HEAD ┃                   row-group-morsel-split ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │              6.48 / 6.90 ±0.68 / 8.26 ms │              6.53 / 6.96 ±0.75 / 8.45 ms │     no change │
│ QQuery 2  │        145.00 / 145.70 ±0.52 / 146.38 ms │        145.43 / 146.06 ±0.48 / 146.62 ms │     no change │
│ QQuery 3  │        112.81 / 114.40 ±1.73 / 117.32 ms │        112.50 / 113.62 ±1.17 / 115.57 ms │     no change │
│ QQuery 4  │    1252.76 / 1276.25 ±12.68 / 1288.18 ms │    1244.25 / 1268.28 ±17.37 / 1298.32 ms │     no change │
│ QQuery 5  │        171.28 / 172.16 ±1.00 / 173.93 ms │        171.45 / 173.68 ±1.34 / 175.38 ms │     no change │
│ QQuery 6  │       802.49 / 827.69 ±21.15 / 852.84 ms │       826.42 / 854.23 ±25.10 / 900.19 ms │     no change │
│ QQuery 7  │        337.53 / 341.04 ±3.13 / 346.26 ms │        340.11 / 341.77 ±1.35 / 344.06 ms │     no change │
│ QQuery 8  │        116.01 / 117.04 ±0.74 / 117.85 ms │        115.44 / 116.81 ±1.18 / 118.86 ms │     no change │
│ QQuery 9  │        100.15 / 104.99 ±2.53 / 107.46 ms │         99.73 / 103.14 ±5.84 / 114.76 ms │     no change │
│ QQuery 10 │        104.31 / 105.74 ±0.87 / 106.71 ms │        105.30 / 106.71 ±0.89 / 107.60 ms │     no change │
│ QQuery 11 │       871.37 / 889.09 ±14.05 / 914.01 ms │        860.97 / 875.53 ±8.25 / 885.62 ms │     no change │
│ QQuery 12 │           44.35 / 46.14 ±1.19 / 47.42 ms │           44.14 / 46.16 ±2.29 / 50.54 ms │     no change │
│ QQuery 13 │        396.01 / 399.14 ±3.15 / 405.07 ms │        399.10 / 402.78 ±1.99 / 404.88 ms │     no change │
│ QQuery 14 │       983.21 / 993.79 ±9.25 / 1010.69 ms │        985.67 / 989.93 ±2.89 / 993.88 ms │     no change │
│ QQuery 15 │           15.38 / 16.58 ±1.09 / 18.31 ms │           14.89 / 16.15 ±1.56 / 19.15 ms │     no change │
│ QQuery 16 │              7.18 / 7.72 ±0.72 / 9.14 ms │              7.04 / 7.29 ±0.20 / 7.65 ms │ +1.06x faster │
│ QQuery 17 │        225.65 / 226.79 ±1.04 / 228.22 ms │        226.48 / 228.30 ±1.46 / 230.48 ms │     no change │
│ QQuery 18 │        125.83 / 126.72 ±0.76 / 127.91 ms │        125.90 / 126.95 ±0.69 / 127.71 ms │     no change │
│ QQuery 19 │        154.39 / 155.45 ±0.74 / 156.35 ms │        155.40 / 156.72 ±1.44 / 159.46 ms │     no change │
│ QQuery 20 │           13.25 / 13.61 ±0.26 / 13.97 ms │           13.33 / 14.48 ±0.93 / 16.18 ms │  1.06x slower │
│ QQuery 21 │           18.89 / 19.53 ±0.52 / 20.17 ms │           19.38 / 19.75 ±0.22 / 20.05 ms │     no change │
│ QQuery 22 │        474.81 / 477.12 ±2.10 / 480.91 ms │        471.83 / 475.16 ±2.50 / 478.45 ms │     no change │
│ QQuery 23 │        842.73 / 852.83 ±8.20 / 866.29 ms │        849.96 / 856.38 ±5.49 / 863.13 ms │     no change │
│ QQuery 24 │        376.47 / 381.65 ±3.91 / 387.67 ms │        376.60 / 379.54 ±2.40 / 383.40 ms │     no change │
│ QQuery 25 │        337.24 / 339.23 ±1.45 / 341.49 ms │        337.55 / 340.13 ±1.86 / 343.17 ms │     no change │
│ QQuery 26 │           80.44 / 81.59 ±0.67 / 82.33 ms │           79.28 / 81.13 ±1.91 / 84.18 ms │     no change │
│ QQuery 27 │              6.77 / 7.26 ±0.54 / 8.27 ms │              6.65 / 7.37 ±0.71 / 8.42 ms │     no change │
│ QQuery 28 │        147.95 / 148.93 ±0.91 / 150.09 ms │        148.04 / 148.97 ±1.11 / 151.06 ms │     no change │
│ QQuery 29 │        277.74 / 279.60 ±0.94 / 280.29 ms │        275.95 / 280.41 ±3.48 / 286.47 ms │     no change │
│ QQuery 30 │           43.91 / 44.75 ±0.84 / 46.29 ms │           41.98 / 43.38 ±1.19 / 45.17 ms │     no change │
│ QQuery 31 │        167.05 / 169.57 ±1.65 / 171.74 ms │        167.46 / 168.41 ±1.00 / 170.33 ms │     no change │
│ QQuery 32 │           13.09 / 13.40 ±0.36 / 14.07 ms │           13.41 / 13.64 ±0.30 / 14.23 ms │     no change │
│ QQuery 33 │        139.37 / 140.91 ±0.95 / 142.09 ms │        140.18 / 141.51 ±1.28 / 143.72 ms │     no change │
│ QQuery 34 │              6.75 / 6.91 ±0.19 / 7.27 ms │              6.76 / 6.95 ±0.25 / 7.44 ms │     no change │
│ QQuery 35 │        105.47 / 107.65 ±1.21 / 109.04 ms │        107.10 / 108.49 ±1.25 / 110.77 ms │     no change │
│ QQuery 36 │              6.32 / 6.72 ±0.22 / 6.92 ms │              6.44 / 6.62 ±0.23 / 7.06 ms │     no change │
│ QQuery 37 │             8.65 / 9.28 ±0.80 / 10.84 ms │             8.41 / 9.43 ±0.96 / 10.88 ms │     no change │
│ QQuery 38 │           83.76 / 87.76 ±2.29 / 89.97 ms │           80.68 / 84.92 ±2.70 / 88.52 ms │     no change │
│ QQuery 39 │        122.38 / 125.73 ±2.31 / 128.91 ms │        121.05 / 124.83 ±3.24 / 130.69 ms │     no change │
│ QQuery 40 │        109.41 / 113.42 ±4.35 / 121.71 ms │        106.66 / 112.32 ±7.88 / 127.85 ms │     no change │
│ QQuery 41 │           14.29 / 15.05 ±0.56 / 15.84 ms │           14.15 / 15.03 ±1.05 / 16.98 ms │     no change │
│ QQuery 42 │        108.05 / 109.17 ±0.61 / 109.87 ms │        108.44 / 109.99 ±1.59 / 112.69 ms │     no change │
│ QQuery 43 │              5.56 / 5.67 ±0.13 / 5.91 ms │              5.78 / 5.90 ±0.15 / 6.20 ms │     no change │
│ QQuery 44 │           11.56 / 11.97 ±0.40 / 12.66 ms │           11.00 / 11.50 ±0.36 / 12.04 ms │     no change │
│ QQuery 45 │           50.11 / 51.09 ±0.77 / 52.00 ms │           50.01 / 50.95 ±1.01 / 52.20 ms │     no change │
│ QQuery 46 │              8.05 / 8.31 ±0.22 / 8.59 ms │              8.27 / 8.73 ±0.27 / 9.06 ms │  1.05x slower │
│ QQuery 47 │        675.16 / 682.65 ±4.94 / 687.86 ms │        678.84 / 683.28 ±5.76 / 693.88 ms │     no change │
│ QQuery 48 │        278.22 / 285.06 ±5.18 / 291.15 ms │        281.20 / 285.33 ±3.30 / 289.92 ms │     no change │
│ QQuery 49 │        249.27 / 251.74 ±1.48 / 253.58 ms │        251.86 / 253.69 ±1.43 / 256.24 ms │     no change │
│ QQuery 50 │        215.54 / 220.76 ±4.34 / 226.87 ms │        213.76 / 218.62 ±4.68 / 226.61 ms │     no change │
│ QQuery 51 │        177.46 / 180.75 ±2.49 / 184.78 ms │        180.09 / 183.43 ±2.14 / 186.18 ms │     no change │
│ QQuery 52 │        106.45 / 107.77 ±1.48 / 110.20 ms │        106.49 / 109.24 ±1.82 / 112.02 ms │     no change │
│ QQuery 53 │        101.22 / 102.08 ±0.52 / 102.85 ms │        102.33 / 104.22 ±1.49 / 106.37 ms │     no change │
│ QQuery 54 │        143.63 / 145.97 ±1.32 / 147.31 ms │        144.61 / 147.03 ±1.35 / 148.79 ms │     no change │
│ QQuery 55 │        105.63 / 107.33 ±1.54 / 109.32 ms │        106.32 / 108.40 ±2.15 / 112.40 ms │     no change │
│ QQuery 56 │        137.58 / 139.59 ±1.54 / 141.72 ms │        139.67 / 141.00 ±0.97 / 142.63 ms │     no change │
│ QQuery 57 │        170.74 / 171.90 ±0.76 / 172.90 ms │        171.71 / 173.05 ±0.96 / 174.10 ms │     no change │
│ QQuery 58 │        267.97 / 272.20 ±2.40 / 274.98 ms │        270.08 / 272.91 ±1.88 / 275.47 ms │     no change │
│ QQuery 59 │        193.32 / 196.02 ±3.10 / 201.85 ms │        195.11 / 196.43 ±0.69 / 196.96 ms │     no change │
│ QQuery 60 │        141.28 / 143.79 ±1.48 / 145.35 ms │        140.59 / 143.72 ±2.60 / 146.90 ms │     no change │
│ QQuery 61 │           12.70 / 12.97 ±0.37 / 13.65 ms │           12.82 / 13.16 ±0.49 / 14.10 ms │     no change │
│ QQuery 62 │       877.51 / 901.87 ±25.45 / 948.39 ms │       867.47 / 888.11 ±17.57 / 910.15 ms │     no change │
│ QQuery 63 │        102.15 / 104.26 ±1.22 / 105.74 ms │        103.27 / 105.23 ±1.49 / 107.11 ms │     no change │
│ QQuery 64 │        677.60 / 681.29 ±3.27 / 686.66 ms │        673.65 / 677.02 ±3.20 / 681.85 ms │     no change │
│ QQuery 65 │        246.60 / 249.72 ±2.32 / 253.56 ms │        242.34 / 247.41 ±3.03 / 251.81 ms │     no change │
│ QQuery 66 │        237.51 / 249.84 ±6.87 / 258.62 ms │        232.80 / 248.53 ±9.72 / 261.62 ms │     no change │
│ QQuery 67 │        304.55 / 313.28 ±5.70 / 321.41 ms │        298.64 / 310.12 ±6.62 / 317.66 ms │     no change │
│ QQuery 68 │            8.23 / 10.53 ±1.67 / 12.40 ms │            9.37 / 10.46 ±0.85 / 11.91 ms │     no change │
│ QQuery 69 │         99.61 / 103.20 ±3.98 / 110.66 ms │        101.27 / 103.06 ±1.37 / 105.47 ms │     no change │
│ QQuery 70 │       317.99 / 342.79 ±15.42 / 364.19 ms │       337.46 / 349.21 ±10.05 / 362.40 ms │     no change │
│ QQuery 71 │        134.58 / 137.82 ±2.55 / 140.37 ms │        135.35 / 137.18 ±1.22 / 139.07 ms │     no change │
│ QQuery 72 │        613.84 / 619.18 ±6.18 / 628.71 ms │        609.11 / 617.56 ±8.06 / 632.64 ms │     no change │
│ QQuery 73 │              6.88 / 7.92 ±0.71 / 8.77 ms │              6.56 / 8.05 ±1.11 / 9.51 ms │     no change │
│ QQuery 74 │        536.93 / 546.19 ±8.82 / 557.65 ms │       541.87 / 553.04 ±10.33 / 566.83 ms │     no change │
│ QQuery 75 │        270.31 / 273.92 ±2.74 / 278.36 ms │        272.53 / 275.21 ±2.09 / 277.94 ms │     no change │
│ QQuery 76 │        130.18 / 131.52 ±1.27 / 133.45 ms │        132.53 / 133.89 ±1.21 / 135.64 ms │     no change │
│ QQuery 77 │        187.49 / 190.82 ±1.99 / 193.32 ms │        188.10 / 189.52 ±1.54 / 192.23 ms │     no change │
│ QQuery 78 │        333.14 / 340.24 ±3.74 / 343.70 ms │        337.87 / 339.33 ±1.25 / 341.59 ms │     no change │
│ QQuery 79 │        224.67 / 227.65 ±2.43 / 230.54 ms │        227.19 / 228.59 ±1.33 / 230.36 ms │     no change │
│ QQuery 80 │        320.55 / 323.75 ±3.10 / 328.97 ms │        321.49 / 325.78 ±2.64 / 328.80 ms │     no change │
│ QQuery 81 │           25.32 / 26.49 ±1.27 / 28.79 ms │           25.71 / 27.18 ±0.91 / 28.20 ms │     no change │
│ QQuery 82 │           39.45 / 42.22 ±2.18 / 45.10 ms │           39.80 / 40.60 ±0.49 / 41.16 ms │     no change │
│ QQuery 83 │           37.49 / 39.28 ±1.19 / 41.16 ms │           38.46 / 39.80 ±1.04 / 40.92 ms │     no change │
│ QQuery 84 │           47.97 / 48.86 ±0.53 / 49.64 ms │           48.12 / 48.76 ±0.76 / 50.24 ms │     no change │
│ QQuery 85 │        146.26 / 148.08 ±1.74 / 150.95 ms │        146.51 / 148.14 ±1.18 / 149.70 ms │     no change │
│ QQuery 86 │           38.01 / 39.41 ±0.74 / 40.18 ms │           38.81 / 39.35 ±0.42 / 39.88 ms │     no change │
│ QQuery 87 │           84.93 / 88.67 ±2.58 / 91.67 ms │           82.60 / 87.06 ±3.61 / 92.33 ms │     no change │
│ QQuery 88 │          97.62 / 98.97 ±1.08 / 100.67 ms │          98.04 / 98.83 ±1.06 / 100.86 ms │     no change │
│ QQuery 89 │        116.22 / 117.53 ±0.82 / 118.60 ms │        117.36 / 118.84 ±0.83 / 119.81 ms │     no change │
│ QQuery 90 │           22.73 / 22.91 ±0.16 / 23.17 ms │           23.09 / 23.69 ±0.45 / 24.43 ms │     no change │
│ QQuery 91 │           63.12 / 64.15 ±0.79 / 65.36 ms │           62.63 / 63.76 ±0.81 / 64.89 ms │     no change │
│ QQuery 92 │           55.46 / 56.82 ±0.75 / 57.55 ms │           56.41 / 57.11 ±0.51 / 57.99 ms │     no change │
│ QQuery 93 │        182.00 / 186.40 ±2.42 / 188.74 ms │        184.18 / 185.15 ±0.96 / 186.81 ms │     no change │
│ QQuery 94 │           61.28 / 61.94 ±0.85 / 63.57 ms │           61.49 / 62.77 ±0.75 / 63.58 ms │     no change │
│ QQuery 95 │        127.51 / 129.09 ±0.99 / 130.14 ms │        127.68 / 128.76 ±0.87 / 130.18 ms │     no change │
│ QQuery 96 │           70.86 / 72.42 ±1.36 / 74.28 ms │           68.51 / 71.02 ±2.01 / 73.67 ms │     no change │
│ QQuery 97 │        122.14 / 123.67 ±1.09 / 125.18 ms │        121.71 / 123.89 ±1.49 / 125.94 ms │     no change │
│ QQuery 98 │        152.06 / 154.12 ±1.63 / 156.20 ms │        151.95 / 153.63 ±2.05 / 157.59 ms │     no change │
│ QQuery 99 │ 10676.89 / 10719.20 ±30.19 / 10758.74 ms │ 10628.45 / 10686.04 ±31.08 / 10715.55 ms │     no change │
└───────────┴──────────────────────────────────────────┴──────────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 30718.61ms │
│ Total Time (row-group-morsel-split)   │ 30692.21ms │
│ Average Time (HEAD)                   │   310.29ms │
│ Average Time (row-group-morsel-split) │   310.02ms │
│ Queries Faster                        │          1 │
│ Queries Slower                        │          2 │
│ Queries with No Change                │         96 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric	Value
Wall time	155.0s
Peak memory	6.3 GiB
Avg memory	5.4 GiB
CPU user	255.1s
CPU sys	15.8s
Peak spill	0 B

tpcds — branch

Metric	Value
Wall time	155.0s
Peak memory	6.6 GiB
Avg memory	5.5 GiB
CPU user	254.3s
CPU sys	15.6s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:59:21Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4285760597-1651-fj75x 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (311a854) to afc0784 (merge-base) diff using: tpcds
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:59:26Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4285760597-1650-hbv5v 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (311a854) to afc0784 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T03:59:37Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4285760597-1652-k8zxp 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (311a854) to afc0784 (merge-base) diff using: tpch
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T04:03:50Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and row-group-morsel-split
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                      HEAD ┃                     row-group-morsel-split ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │               6.24 / 6.66 ±0.73 / 8.12 ms │                6.23 / 6.69 ±0.77 / 8.22 ms │     no change │
│ QQuery 2  │         113.07 / 114.19 ±0.98 / 115.37 ms │         111.56 / 131.30 ±23.86 / 161.00 ms │  1.15x slower │
│ QQuery 3  │         109.75 / 110.82 ±0.97 / 112.41 ms │          109.34 / 110.42 ±0.95 / 111.92 ms │     no change │
│ QQuery 4  │     1040.51 / 1053.28 ±10.50 / 1065.90 ms │      1029.45 / 1054.97 ±16.21 / 1070.18 ms │     no change │
│ QQuery 5  │         194.69 / 199.32 ±2.83 / 202.49 ms │          188.35 / 197.17 ±4.88 / 201.91 ms │     no change │
│ QQuery 6  │         255.14 / 260.35 ±3.79 / 265.79 ms │          258.09 / 266.38 ±5.22 / 271.81 ms │     no change │
│ QQuery 7  │         318.87 / 332.86 ±9.19 / 344.81 ms │          329.89 / 337.49 ±4.13 / 341.44 ms │     no change │
│ QQuery 8  │         158.42 / 162.84 ±3.68 / 168.61 ms │          159.10 / 164.60 ±5.28 / 170.96 ms │     no change │
│ QQuery 9  │        228.10 / 246.09 ±16.50 / 274.47 ms │         227.16 / 240.38 ±10.83 / 253.43 ms │     no change │
│ QQuery 10 │         169.40 / 176.95 ±7.27 / 190.58 ms │          164.16 / 170.37 ±4.79 / 176.96 ms │     no change │
│ QQuery 11 │         687.78 / 694.39 ±5.57 / 701.77 ms │          682.87 / 700.42 ±9.47 / 709.81 ms │     no change │
│ QQuery 12 │            38.11 / 39.86 ±1.14 / 41.41 ms │             36.23 / 39.03 ±2.46 / 42.82 ms │     no change │
│ QQuery 13 │         551.22 / 564.88 ±8.95 / 576.70 ms │          561.05 / 575.16 ±9.57 / 584.04 ms │     no change │
│ QQuery 14 │         907.49 / 911.66 ±3.48 / 915.93 ms │          902.97 / 906.54 ±1.98 / 908.83 ms │     no change │
│ QQuery 15 │            18.99 / 21.79 ±3.02 / 26.44 ms │             18.89 / 21.36 ±3.47 / 28.19 ms │     no change │
│ QQuery 16 │               7.00 / 7.65 ±0.60 / 8.63 ms │                6.77 / 7.14 ±0.45 / 7.92 ms │ +1.07x faster │
│ QQuery 17 │         199.46 / 205.97 ±3.57 / 209.13 ms │          202.06 / 206.23 ±3.27 / 211.45 ms │     no change │
│ QQuery 18 │        333.94 / 433.45 ±54.96 / 486.57 ms │         389.36 / 468.42 ±40.55 / 500.93 ms │  1.08x slower │
│ QQuery 19 │         146.08 / 149.94 ±4.39 / 156.43 ms │          139.02 / 144.02 ±3.36 / 148.27 ms │     no change │
│ QQuery 20 │            15.63 / 16.07 ±0.59 / 17.19 ms │             15.29 / 17.63 ±2.55 / 22.37 ms │  1.10x slower │
│ QQuery 21 │            26.52 / 27.76 ±0.87 / 29.04 ms │             26.84 / 27.56 ±0.48 / 28.13 ms │     no change │
│ QQuery 22 │         505.09 / 509.66 ±3.88 / 515.33 ms │          502.12 / 508.67 ±8.47 / 525.35 ms │     no change │
│ QQuery 23 │      1222.24 / 1227.09 ±4.18 / 1233.91 ms │       1201.46 / 1214.67 ±8.04 / 1223.64 ms │     no change │
│ QQuery 24 │         119.34 / 126.16 ±3.68 / 129.71 ms │          122.85 / 125.10 ±2.08 / 128.21 ms │     no change │
│ QQuery 25 │         301.92 / 304.43 ±2.06 / 307.34 ms │          300.87 / 306.57 ±4.81 / 312.69 ms │     no change │
│ QQuery 26 │         148.74 / 154.98 ±7.87 / 170.01 ms │          139.31 / 144.80 ±5.53 / 155.33 ms │ +1.07x faster │
│ QQuery 27 │               6.45 / 6.64 ±0.11 / 6.75 ms │                6.55 / 7.10 ±0.75 / 8.57 ms │  1.07x slower │
│ QQuery 28 │         231.78 / 236.24 ±4.92 / 245.44 ms │          225.11 / 226.78 ±1.49 / 229.56 ms │     no change │
│ QQuery 29 │         246.85 / 250.91 ±4.16 / 258.48 ms │          253.21 / 256.86 ±2.63 / 260.24 ms │     no change │
│ QQuery 30 │            55.34 / 58.99 ±5.29 / 69.20 ms │             56.38 / 59.16 ±3.76 / 66.56 ms │     no change │
│ QQuery 31 │         173.90 / 178.43 ±2.39 / 180.44 ms │          175.25 / 179.44 ±2.67 / 183.53 ms │     no change │
│ QQuery 32 │            14.36 / 14.71 ±0.48 / 15.66 ms │             14.00 / 15.44 ±1.11 / 16.87 ms │     no change │
│ QQuery 33 │         134.65 / 136.08 ±1.01 / 137.73 ms │          133.23 / 134.73 ±1.10 / 136.21 ms │     no change │
│ QQuery 34 │               6.87 / 7.09 ±0.35 / 7.78 ms │                6.84 / 7.55 ±0.53 / 8.40 ms │  1.07x slower │
│ QQuery 35 │         149.04 / 153.06 ±2.97 / 156.50 ms │          147.30 / 151.15 ±2.36 / 153.12 ms │     no change │
│ QQuery 36 │               6.17 / 6.67 ±0.60 / 7.81 ms │                6.36 / 6.69 ±0.32 / 7.21 ms │     no change │
│ QQuery 37 │               5.20 / 5.41 ±0.20 / 5.67 ms │                5.28 / 5.38 ±0.11 / 5.55 ms │     no change │
│ QQuery 38 │         103.34 / 107.40 ±3.81 / 114.47 ms │           98.33 / 107.69 ±5.58 / 115.70 ms │     no change │
│ QQuery 39 │         134.35 / 137.89 ±2.75 / 142.00 ms │          135.91 / 137.66 ±1.03 / 138.75 ms │     no change │
│ QQuery 40 │         131.95 / 137.63 ±7.45 / 151.58 ms │          132.35 / 137.54 ±7.70 / 152.77 ms │     no change │
│ QQuery 41 │            14.10 / 15.26 ±0.93 / 16.83 ms │             13.77 / 15.17 ±1.22 / 17.08 ms │     no change │
│ QQuery 42 │         115.96 / 117.29 ±1.11 / 119.04 ms │          112.70 / 115.17 ±2.12 / 118.83 ms │     no change │
│ QQuery 43 │               5.75 / 5.96 ±0.18 / 6.23 ms │                5.77 / 6.00 ±0.26 / 6.42 ms │     no change │
│ QQuery 44 │            11.35 / 12.56 ±1.41 / 15.04 ms │             11.11 / 12.20 ±1.41 / 14.96 ms │     no change │
│ QQuery 45 │            45.85 / 46.49 ±0.53 / 47.09 ms │             43.48 / 45.09 ±1.26 / 47.33 ms │     no change │
│ QQuery 46 │               8.24 / 8.79 ±0.60 / 9.86 ms │                8.18 / 8.44 ±0.28 / 8.98 ms │     no change │
│ QQuery 47 │        762.03 / 781.25 ±12.16 / 794.68 ms │          776.23 / 780.72 ±3.55 / 784.91 ms │     no change │
│ QQuery 48 │        485.77 / 498.08 ±12.87 / 519.29 ms │          482.93 / 492.56 ±9.34 / 508.73 ms │     no change │
│ QQuery 49 │         278.87 / 290.12 ±6.35 / 298.36 ms │          287.56 / 291.60 ±3.26 / 294.76 ms │     no change │
│ QQuery 50 │         535.86 / 539.18 ±2.97 / 544.17 ms │         513.25 / 535.03 ±11.54 / 545.49 ms │     no change │
│ QQuery 51 │         219.07 / 222.73 ±2.44 / 226.58 ms │          217.54 / 222.49 ±3.35 / 227.30 ms │     no change │
│ QQuery 52 │         111.02 / 117.13 ±3.14 / 119.60 ms │          111.37 / 116.42 ±2.81 / 119.69 ms │     no change │
│ QQuery 53 │         138.87 / 143.04 ±3.96 / 149.44 ms │          138.38 / 147.42 ±5.43 / 155.03 ms │     no change │
│ QQuery 54 │         129.67 / 132.57 ±2.51 / 135.37 ms │          128.49 / 133.02 ±3.49 / 136.97 ms │     no change │
│ QQuery 55 │         109.39 / 113.98 ±2.42 / 115.99 ms │          111.07 / 115.18 ±2.08 / 116.51 ms │     no change │
│ QQuery 56 │         130.77 / 134.67 ±3.00 / 139.03 ms │          129.93 / 135.23 ±3.04 / 138.30 ms │     no change │
│ QQuery 57 │         188.66 / 191.22 ±2.78 / 195.26 ms │          189.52 / 192.90 ±2.42 / 196.31 ms │     no change │
│ QQuery 58 │         226.18 / 230.05 ±2.33 / 233.11 ms │          226.64 / 229.19 ±1.52 / 231.20 ms │     no change │
│ QQuery 59 │         283.04 / 287.82 ±3.82 / 292.56 ms │          280.33 / 284.16 ±3.10 / 288.11 ms │     no change │
│ QQuery 60 │         139.14 / 141.34 ±2.32 / 145.54 ms │          138.22 / 140.79 ±2.08 / 143.55 ms │     no change │
│ QQuery 61 │            11.91 / 12.40 ±0.87 / 14.15 ms │             12.39 / 12.99 ±0.58 / 13.97 ms │     no change │
│ QQuery 62 │       888.13 / 951.93 ±61.12 / 1050.28 ms │        873.65 / 950.22 ±67.77 / 1060.24 ms │     no change │
│ QQuery 63 │         144.83 / 147.72 ±2.29 / 150.58 ms │          144.26 / 149.41 ±4.18 / 155.92 ms │     no change │
│ QQuery 64 │ 27665.80 / 28642.69 ±620.89 / 29551.44 ms │ 27681.36 / 28917.40 ±1045.46 / 30806.68 ms │     no change │
│ QQuery 65 │         359.59 / 369.52 ±6.58 / 376.92 ms │          364.45 / 369.27 ±3.15 / 374.25 ms │     no change │
│ QQuery 66 │         197.37 / 202.76 ±4.26 / 209.67 ms │          193.05 / 197.57 ±2.81 / 200.97 ms │     no change │
│ QQuery 67 │         502.57 / 515.15 ±9.95 / 526.63 ms │          499.69 / 504.80 ±3.61 / 509.45 ms │     no change │
│ QQuery 68 │             9.59 / 11.05 ±2.47 / 15.97 ms │              8.95 / 11.29 ±2.03 / 14.65 ms │     no change │
│ QQuery 69 │         178.63 / 185.13 ±4.75 / 190.77 ms │          178.41 / 180.96 ±1.86 / 184.01 ms │     no change │
│ QQuery 70 │         429.29 / 438.78 ±6.70 / 445.98 ms │          433.59 / 443.30 ±8.45 / 455.85 ms │     no change │
│ QQuery 71 │         125.24 / 131.57 ±5.88 / 141.57 ms │          126.40 / 132.99 ±3.63 / 137.44 ms │     no change │
│ QQuery 72 │      1190.91 / 1203.49 ±7.96 / 1211.92 ms │      1181.94 / 1198.71 ±12.45 / 1220.17 ms │     no change │
│ QQuery 73 │               6.61 / 7.83 ±0.99 / 9.47 ms │                6.39 / 6.92 ±0.48 / 7.77 ms │ +1.13x faster │
│ QQuery 74 │         498.07 / 507.69 ±6.54 / 518.24 ms │          506.07 / 511.21 ±3.73 / 515.99 ms │     no change │
│ QQuery 75 │         285.12 / 293.27 ±4.14 / 296.56 ms │          287.39 / 291.06 ±3.02 / 294.27 ms │     no change │
│ QQuery 76 │         279.17 / 280.86 ±1.36 / 283.33 ms │          276.52 / 282.53 ±4.93 / 290.05 ms │     no change │
│ QQuery 77 │         228.49 / 232.81 ±2.45 / 235.55 ms │          229.10 / 231.52 ±1.90 / 234.88 ms │     no change │
│ QQuery 78 │         307.27 / 308.76 ±1.37 / 310.93 ms │          307.95 / 311.28 ±2.15 / 313.38 ms │     no change │
│ QQuery 79 │         254.78 / 261.53 ±5.47 / 269.49 ms │          260.51 / 264.15 ±2.72 / 268.21 ms │     no change │
│ QQuery 80 │         285.06 / 290.17 ±3.77 / 294.77 ms │          282.85 / 285.89 ±2.00 / 288.39 ms │     no change │
│ QQuery 81 │            31.34 / 32.61 ±1.25 / 34.51 ms │             31.26 / 32.11 ±0.97 / 33.90 ms │     no change │
│ QQuery 82 │            48.29 / 50.18 ±1.64 / 52.99 ms │             49.63 / 51.42 ±1.00 / 52.45 ms │     no change │
│ QQuery 83 │            44.86 / 45.95 ±0.77 / 47.03 ms │             45.44 / 46.75 ±1.43 / 49.47 ms │     no change │
│ QQuery 84 │            64.95 / 65.73 ±0.49 / 66.47 ms │             63.79 / 65.34 ±0.94 / 66.66 ms │     no change │
│ QQuery 85 │         279.57 / 287.94 ±5.09 / 293.82 ms │          278.88 / 289.17 ±5.57 / 294.30 ms │     no change │
│ QQuery 86 │            47.26 / 49.51 ±1.18 / 50.68 ms │             47.72 / 51.17 ±2.73 / 55.12 ms │     no change │
│ QQuery 87 │          98.20 / 108.23 ±7.13 / 119.14 ms │          103.33 / 110.43 ±4.96 / 117.47 ms │     no change │
│ QQuery 88 │         131.33 / 135.73 ±5.08 / 144.02 ms │          125.82 / 132.80 ±4.54 / 139.43 ms │     no change │
│ QQuery 89 │         153.67 / 166.97 ±7.35 / 173.57 ms │          161.30 / 167.71 ±4.47 / 173.85 ms │     no change │
│ QQuery 90 │            26.56 / 27.11 ±0.46 / 27.65 ms │             25.54 / 27.47 ±1.18 / 28.76 ms │     no change │
│ QQuery 91 │          99.95 / 106.70 ±4.69 / 114.17 ms │          102.96 / 108.86 ±3.25 / 112.96 ms │     no change │
│ QQuery 92 │            52.82 / 53.38 ±0.49 / 54.27 ms │             52.13 / 55.13 ±1.80 / 57.63 ms │     no change │
│ QQuery 93 │         188.69 / 189.77 ±0.96 / 191.34 ms │          189.26 / 190.38 ±1.07 / 192.27 ms │     no change │
│ QQuery 94 │            68.46 / 70.03 ±1.76 / 73.44 ms │             69.46 / 70.37 ±0.78 / 71.56 ms │     no change │
│ QQuery 95 │         147.88 / 150.48 ±1.46 / 151.96 ms │          148.17 / 150.12 ±1.35 / 152.40 ms │     no change │
│ QQuery 96 │            89.34 / 90.04 ±0.45 / 90.60 ms │             87.82 / 89.03 ±1.20 / 90.51 ms │     no change │
│ QQuery 97 │         146.77 / 152.52 ±3.09 / 155.52 ms │          152.99 / 155.85 ±1.93 / 158.92 ms │     no change │
│ QQuery 98 │         125.18 / 126.24 ±0.78 / 127.37 ms │          122.41 / 123.71 ±1.15 / 125.68 ms │     no change │
│ QQuery 99 │  10663.45 / 10690.01 ±19.83 / 10716.32 ms │   10665.60 / 10700.24 ±21.75 / 10721.45 ms │     no change │
└───────────┴───────────────────────────────────────────┴────────────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 61119.99ms │
│ Total Time (row-group-morsel-split)   │ 61416.55ms │
│ Average Time (HEAD)                   │   617.37ms │
│ Average Time (row-group-morsel-split) │   620.37ms │
│ Queries Faster                        │          3 │
│ Queries Slower                        │          5 │
│ Queries with No Change                │         91 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric	Value
Wall time	310.1s
Peak memory	35.1 GiB
Avg memory	25.2 GiB
CPU user	603.4s
CPU sys	29.1s
Peak spill	0 B

tpcds — branch

Metric	Value
Wall time	310.1s
Peak memory	38.0 GiB
Avg memory	27.3 GiB
CPU user	621.6s
CPU sys	28.6s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-21T04:14:01Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and row-group-morsel-split
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                     HEAD ┃                   row-group-morsel-split ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │              6.52 / 6.96 ±0.78 / 8.50 ms │              6.59 / 7.00 ±0.72 / 8.44 ms │     no change │
│ QQuery 2  │        147.44 / 148.76 ±0.92 / 150.06 ms │        146.02 / 146.50 ±0.59 / 147.59 ms │     no change │
│ QQuery 3  │        113.12 / 114.42 ±1.25 / 116.80 ms │        113.31 / 113.73 ±0.50 / 114.72 ms │     no change │
│ QQuery 4  │    1233.64 / 1269.34 ±22.72 / 1297.49 ms │    1228.11 / 1254.37 ±18.08 / 1279.70 ms │     no change │
│ QQuery 5  │        171.66 / 174.18 ±2.12 / 176.78 ms │        172.05 / 173.51 ±2.08 / 177.44 ms │     no change │
│ QQuery 6  │       830.78 / 846.41 ±13.39 / 866.91 ms │       823.86 / 851.09 ±17.73 / 876.16 ms │     no change │
│ QQuery 7  │        344.31 / 346.50 ±2.53 / 351.27 ms │        339.47 / 345.26 ±3.17 / 348.36 ms │     no change │
│ QQuery 8  │        116.41 / 117.55 ±1.43 / 120.20 ms │        115.40 / 116.80 ±1.23 / 119.06 ms │     no change │
│ QQuery 9  │        105.45 / 106.78 ±1.03 / 108.39 ms │        100.31 / 104.79 ±2.38 / 106.98 ms │     no change │
│ QQuery 10 │        107.06 / 107.49 ±0.37 / 107.97 ms │        107.75 / 108.21 ±0.45 / 109.02 ms │     no change │
│ QQuery 11 │        873.61 / 876.99 ±2.74 / 881.76 ms │       853.49 / 871.49 ±17.49 / 896.42 ms │     no change │
│ QQuery 12 │           43.41 / 45.65 ±1.43 / 47.72 ms │           43.50 / 45.88 ±2.28 / 48.95 ms │     no change │
│ QQuery 13 │        397.94 / 403.36 ±3.97 / 408.97 ms │        397.15 / 402.74 ±3.26 / 407.26 ms │     no change │
│ QQuery 14 │      995.98 / 1004.51 ±5.98 / 1013.12 ms │      996.81 / 1004.80 ±5.32 / 1010.71 ms │     no change │
│ QQuery 15 │           14.64 / 15.83 ±1.02 / 17.57 ms │           15.09 / 16.76 ±1.10 / 17.95 ms │  1.06x slower │
│ QQuery 16 │              7.10 / 7.73 ±0.76 / 9.19 ms │              7.31 / 8.22 ±0.92 / 9.40 ms │  1.06x slower │
│ QQuery 17 │        226.14 / 228.59 ±1.97 / 231.17 ms │        227.93 / 229.68 ±1.60 / 231.63 ms │     no change │
│ QQuery 18 │        128.08 / 129.02 ±0.79 / 130.38 ms │        128.33 / 129.04 ±0.79 / 130.51 ms │     no change │
│ QQuery 19 │        155.36 / 157.42 ±1.18 / 158.87 ms │        153.61 / 156.68 ±2.46 / 159.34 ms │     no change │
│ QQuery 20 │           13.99 / 14.56 ±0.45 / 15.23 ms │           13.54 / 14.17 ±0.58 / 15.16 ms │     no change │
│ QQuery 21 │           18.94 / 19.40 ±0.26 / 19.74 ms │           18.94 / 19.29 ±0.27 / 19.74 ms │     no change │
│ QQuery 22 │        484.34 / 488.40 ±3.63 / 493.21 ms │        473.66 / 476.42 ±2.80 / 480.28 ms │     no change │
│ QQuery 23 │        867.96 / 875.47 ±5.69 / 882.95 ms │        855.62 / 867.07 ±8.94 / 875.29 ms │     no change │
│ QQuery 24 │        384.11 / 388.44 ±3.34 / 392.34 ms │        380.10 / 385.51 ±3.81 / 391.04 ms │     no change │
│ QQuery 25 │        340.13 / 341.68 ±1.42 / 343.69 ms │        342.08 / 344.27 ±1.41 / 346.34 ms │     no change │
│ QQuery 26 │           81.38 / 82.11 ±0.77 / 83.51 ms │           81.01 / 82.40 ±0.76 / 83.27 ms │     no change │
│ QQuery 27 │              6.89 / 7.71 ±0.74 / 8.82 ms │              6.86 / 7.60 ±0.64 / 8.73 ms │     no change │
│ QQuery 28 │        148.15 / 150.79 ±2.52 / 154.16 ms │        147.82 / 149.62 ±2.69 / 154.94 ms │     no change │
│ QQuery 29 │        279.13 / 282.08 ±1.79 / 284.14 ms │        280.91 / 283.66 ±2.80 / 288.18 ms │     no change │
│ QQuery 30 │           42.92 / 44.16 ±1.08 / 45.63 ms │           43.04 / 44.50 ±2.21 / 48.88 ms │     no change │
│ QQuery 31 │        167.89 / 170.61 ±1.89 / 172.74 ms │        169.09 / 170.74 ±1.24 / 172.91 ms │     no change │
│ QQuery 32 │           13.70 / 14.11 ±0.39 / 14.76 ms │           13.84 / 14.43 ±0.54 / 15.31 ms │     no change │
│ QQuery 33 │        137.13 / 140.81 ±1.99 / 142.45 ms │        140.09 / 142.24 ±1.90 / 145.09 ms │     no change │
│ QQuery 34 │              6.78 / 7.08 ±0.22 / 7.46 ms │              6.78 / 6.95 ±0.25 / 7.43 ms │     no change │
│ QQuery 35 │        107.83 / 108.45 ±0.48 / 109.24 ms │        107.64 / 108.88 ±1.04 / 110.29 ms │     no change │
│ QQuery 36 │              6.41 / 6.58 ±0.16 / 6.80 ms │              6.43 / 6.63 ±0.19 / 6.98 ms │     no change │
│ QQuery 37 │             8.18 / 9.25 ±1.10 / 11.36 ms │              8.47 / 8.78 ±0.16 / 8.92 ms │ +1.05x faster │
│ QQuery 38 │           82.05 / 87.63 ±3.86 / 93.90 ms │           82.23 / 87.04 ±3.42 / 92.51 ms │     no change │
│ QQuery 39 │        122.77 / 124.46 ±1.48 / 126.73 ms │        122.46 / 125.03 ±1.83 / 128.09 ms │     no change │
│ QQuery 40 │        109.39 / 113.63 ±5.66 / 124.84 ms │        110.39 / 117.45 ±7.42 / 131.29 ms │     no change │
│ QQuery 41 │           14.43 / 15.30 ±0.61 / 16.34 ms │           13.98 / 15.16 ±0.91 / 16.78 ms │     no change │
│ QQuery 42 │        107.85 / 109.74 ±1.35 / 111.53 ms │        106.91 / 108.40 ±1.15 / 109.72 ms │     no change │
│ QQuery 43 │              5.92 / 6.07 ±0.16 / 6.38 ms │              6.08 / 6.16 ±0.08 / 6.27 ms │     no change │
│ QQuery 44 │           11.60 / 11.91 ±0.38 / 12.64 ms │           11.63 / 11.94 ±0.21 / 12.18 ms │     no change │
│ QQuery 45 │           50.30 / 51.34 ±1.00 / 53.05 ms │           50.33 / 51.77 ±1.11 / 52.98 ms │     no change │
│ QQuery 46 │              8.39 / 8.60 ±0.19 / 8.90 ms │              8.25 / 8.75 ±0.57 / 9.66 ms │     no change │
│ QQuery 47 │        675.79 / 683.90 ±5.85 / 692.52 ms │       673.43 / 689.73 ±12.92 / 710.56 ms │     no change │
│ QQuery 48 │        282.98 / 288.06 ±4.92 / 297.35 ms │        279.41 / 289.04 ±6.19 / 296.55 ms │     no change │
│ QQuery 49 │        252.54 / 254.93 ±1.69 / 257.20 ms │        249.61 / 251.93 ±1.82 / 254.92 ms │     no change │
│ QQuery 50 │        216.49 / 225.40 ±5.21 / 232.59 ms │        221.97 / 226.93 ±3.29 / 230.82 ms │     no change │
│ QQuery 51 │        177.85 / 181.50 ±3.82 / 187.98 ms │        177.02 / 183.94 ±4.26 / 190.46 ms │     no change │
│ QQuery 52 │        107.66 / 108.64 ±0.64 / 109.24 ms │        106.83 / 108.53 ±1.10 / 110.03 ms │     no change │
│ QQuery 53 │        100.95 / 101.52 ±0.49 / 102.32 ms │        100.61 / 102.15 ±1.75 / 105.51 ms │     no change │
│ QQuery 54 │        146.25 / 147.69 ±1.86 / 151.28 ms │        146.02 / 146.75 ±0.69 / 147.69 ms │     no change │
│ QQuery 55 │        106.32 / 107.84 ±1.18 / 109.31 ms │        106.47 / 107.49 ±1.03 / 109.25 ms │     no change │
│ QQuery 56 │        139.15 / 142.14 ±1.82 / 144.68 ms │        139.94 / 142.49 ±1.98 / 145.14 ms │     no change │
│ QQuery 57 │        170.98 / 173.57 ±1.95 / 177.01 ms │        173.30 / 175.31 ±1.27 / 176.88 ms │     no change │
│ QQuery 58 │        268.45 / 272.38 ±2.59 / 276.40 ms │        273.76 / 275.51 ±1.97 / 279.17 ms │     no change │
│ QQuery 59 │        196.87 / 198.48 ±1.16 / 200.08 ms │        193.84 / 196.35 ±2.04 / 199.69 ms │     no change │
│ QQuery 60 │        142.05 / 145.08 ±1.63 / 146.93 ms │        141.95 / 143.59 ±0.99 / 144.79 ms │     no change │
│ QQuery 61 │           12.84 / 13.23 ±0.39 / 13.90 ms │           13.13 / 13.38 ±0.35 / 14.07 ms │     no change │
│ QQuery 62 │       867.28 / 889.50 ±14.03 / 905.78 ms │       877.85 / 921.34 ±27.30 / 955.21 ms │     no change │
│ QQuery 63 │        102.81 / 105.59 ±2.81 / 109.15 ms │        105.29 / 106.62 ±1.54 / 109.60 ms │     no change │
│ QQuery 64 │        677.76 / 681.00 ±2.18 / 683.61 ms │        683.72 / 687.88 ±3.26 / 692.22 ms │     no change │
│ QQuery 65 │        247.81 / 251.46 ±3.64 / 258.05 ms │        243.96 / 248.82 ±3.05 / 252.50 ms │     no change │
│ QQuery 66 │       246.72 / 255.96 ±10.71 / 275.52 ms │        240.94 / 257.32 ±8.71 / 265.43 ms │     no change │
│ QQuery 67 │        306.96 / 312.62 ±4.40 / 319.05 ms │        302.06 / 310.69 ±5.62 / 316.36 ms │     no change │
│ QQuery 68 │            9.20 / 10.41 ±1.21 / 12.69 ms │            8.80 / 10.95 ±1.60 / 13.26 ms │  1.05x slower │
│ QQuery 69 │        101.08 / 102.97 ±1.85 / 105.87 ms │        102.66 / 104.56 ±1.25 / 106.48 ms │     no change │
│ QQuery 70 │        332.09 / 346.64 ±8.35 / 355.55 ms │        340.60 / 348.33 ±8.55 / 363.94 ms │     no change │
│ QQuery 71 │        137.10 / 139.10 ±2.80 / 144.44 ms │        133.13 / 136.13 ±2.53 / 139.70 ms │     no change │
│ QQuery 72 │        603.92 / 612.39 ±6.75 / 620.72 ms │        601.08 / 611.30 ±7.39 / 619.22 ms │     no change │
│ QQuery 73 │              6.57 / 8.37 ±1.25 / 9.92 ms │              7.04 / 7.81 ±0.75 / 9.12 ms │ +1.07x faster │
│ QQuery 74 │       531.99 / 551.63 ±14.98 / 573.07 ms │       535.95 / 547.22 ±10.23 / 566.53 ms │     no change │
│ QQuery 75 │        275.93 / 277.54 ±0.98 / 278.96 ms │        273.47 / 275.57 ±1.26 / 277.14 ms │     no change │
│ QQuery 76 │        130.31 / 131.93 ±1.37 / 133.65 ms │        130.09 / 132.63 ±2.24 / 136.50 ms │     no change │
│ QQuery 77 │        189.26 / 191.70 ±1.30 / 192.89 ms │        188.40 / 189.86 ±1.28 / 191.48 ms │     no change │
│ QQuery 78 │        341.34 / 343.89 ±1.89 / 346.48 ms │        333.48 / 341.10 ±5.40 / 349.52 ms │     no change │
│ QQuery 79 │        231.79 / 234.46 ±1.73 / 236.24 ms │        226.39 / 230.61 ±2.39 / 233.44 ms │     no change │
│ QQuery 80 │        326.12 / 328.15 ±1.60 / 330.17 ms │        320.96 / 327.65 ±3.95 / 332.75 ms │     no change │
│ QQuery 81 │           25.63 / 26.93 ±1.37 / 29.38 ms │           25.95 / 26.93 ±0.53 / 27.37 ms │     no change │
│ QQuery 82 │           41.12 / 42.15 ±0.84 / 43.29 ms │           40.10 / 42.18 ±2.03 / 44.82 ms │     no change │
│ QQuery 83 │           38.28 / 39.59 ±1.05 / 41.04 ms │           38.27 / 39.22 ±0.74 / 40.24 ms │     no change │
│ QQuery 84 │           48.62 / 49.84 ±0.72 / 50.58 ms │           48.63 / 49.16 ±0.53 / 50.13 ms │     no change │
│ QQuery 85 │        149.14 / 150.65 ±1.14 / 151.99 ms │        148.64 / 150.16 ±1.89 / 153.67 ms │     no change │
│ QQuery 86 │           37.60 / 39.02 ±0.84 / 40.09 ms │           38.45 / 39.17 ±0.40 / 39.62 ms │     no change │
│ QQuery 87 │           83.25 / 86.09 ±4.08 / 94.18 ms │           84.42 / 87.72 ±2.63 / 92.21 ms │     no change │
│ QQuery 88 │          98.11 / 99.60 ±1.61 / 102.60 ms │         98.62 / 102.61 ±5.63 / 113.70 ms │     no change │
│ QQuery 89 │        116.00 / 118.15 ±1.46 / 119.63 ms │        118.39 / 119.63 ±1.02 / 121.50 ms │     no change │
│ QQuery 90 │           22.56 / 23.26 ±0.50 / 23.85 ms │           22.61 / 23.66 ±0.61 / 24.51 ms │     no change │
│ QQuery 91 │           63.02 / 65.46 ±1.58 / 67.31 ms │           60.52 / 63.47 ±2.89 / 68.21 ms │     no change │
│ QQuery 92 │           57.87 / 58.70 ±0.58 / 59.56 ms │           56.66 / 58.04 ±1.14 / 59.70 ms │     no change │
│ QQuery 93 │        185.44 / 187.10 ±1.18 / 188.28 ms │        187.66 / 189.23 ±0.98 / 190.49 ms │     no change │
│ QQuery 94 │           61.34 / 62.48 ±1.39 / 65.14 ms │           61.35 / 62.93 ±1.11 / 64.60 ms │     no change │
│ QQuery 95 │        129.72 / 131.22 ±1.70 / 134.33 ms │        129.73 / 130.75 ±0.68 / 131.52 ms │     no change │
│ QQuery 96 │           69.59 / 71.44 ±1.37 / 73.00 ms │           70.72 / 73.34 ±1.45 / 75.16 ms │     no change │
│ QQuery 97 │        125.94 / 126.82 ±1.25 / 129.27 ms │        125.87 / 127.31 ±1.13 / 129.06 ms │     no change │
│ QQuery 98 │        151.67 / 153.38 ±1.86 / 156.91 ms │        149.93 / 154.11 ±2.88 / 157.85 ms │     no change │
│ QQuery 99 │ 10677.44 / 10714.83 ±29.26 / 10759.08 ms │ 10634.18 / 10703.80 ±45.49 / 10764.67 ms │     no change │
└───────────┴──────────────────────────────────────────┴──────────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 30854.25ms │
│ Total Time (row-group-morsel-split)   │ 30844.27ms │
│ Average Time (HEAD)                   │   311.66ms │
│ Average Time (row-group-morsel-split) │   311.56ms │
│ Queries Faster                        │          2 │
│ Queries Slower                        │          3 │
│ Queries with No Change                │         94 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric	Value
Wall time	155.0s
Peak memory	6.1 GiB
Avg memory	5.1 GiB
CPU user	255.2s
CPU sys	16.9s
Peak spill	0 B

tpcds — branch

Metric	Value
Wall time	155.0s
Peak memory	6.3 GiB
Avg memory	5.1 GiB
CPU user	254.9s
CPU sys	17.5s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-21T04:14:03Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and row-group-morsel-split
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                row-group-morsel-split ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.21 / 4.43 ±6.34 / 17.12 ms │          1.17 / 4.34 ±6.24 / 16.82 ms │     no change │
│ QQuery 1  │        14.72 / 15.61 ±0.71 / 16.89 ms │        15.30 / 15.82 ±0.28 / 16.14 ms │     no change │
│ QQuery 2  │        45.95 / 46.36 ±0.30 / 46.73 ms │        45.48 / 46.03 ±0.41 / 46.69 ms │     no change │
│ QQuery 3  │        41.51 / 42.57 ±1.11 / 44.49 ms │        42.67 / 42.84 ±0.13 / 43.02 ms │     no change │
│ QQuery 4  │     286.54 / 288.54 ±2.03 / 292.21 ms │     281.70 / 291.93 ±5.50 / 297.36 ms │     no change │
│ QQuery 5  │     334.67 / 338.97 ±3.38 / 343.96 ms │     333.24 / 337.48 ±2.67 / 340.54 ms │     no change │
│ QQuery 6  │           5.38 / 6.45 ±0.83 / 7.77 ms │           5.00 / 5.83 ±0.86 / 7.50 ms │ +1.11x faster │
│ QQuery 7  │        17.40 / 17.80 ±0.38 / 18.53 ms │        17.54 / 20.82 ±3.37 / 25.02 ms │  1.17x slower │
│ QQuery 8  │     412.85 / 419.70 ±8.61 / 435.52 ms │     410.92 / 418.68 ±6.19 / 428.09 ms │     no change │
│ QQuery 9  │     639.13 / 648.10 ±5.14 / 653.88 ms │     630.03 / 638.04 ±4.72 / 643.82 ms │     no change │
│ QQuery 10 │       93.07 / 96.25 ±3.36 / 102.73 ms │       94.30 / 97.52 ±3.06 / 101.39 ms │     no change │
│ QQuery 11 │     106.96 / 107.69 ±0.68 / 108.92 ms │     107.12 / 107.79 ±0.39 / 108.24 ms │     no change │
│ QQuery 12 │     331.87 / 337.14 ±3.48 / 341.15 ms │     332.61 / 339.39 ±6.51 / 351.12 ms │     no change │
│ QQuery 13 │    450.04 / 465.95 ±11.33 / 480.96 ms │    448.96 / 466.34 ±17.74 / 497.45 ms │     no change │
│ QQuery 14 │     336.63 / 342.69 ±3.53 / 347.64 ms │     346.19 / 350.94 ±3.40 / 354.77 ms │     no change │
│ QQuery 15 │     348.25 / 356.13 ±7.92 / 369.32 ms │     354.85 / 363.23 ±8.29 / 376.06 ms │     no change │
│ QQuery 16 │    700.54 / 713.58 ±10.24 / 727.20 ms │    698.90 / 716.77 ±10.04 / 726.79 ms │     no change │
│ QQuery 17 │     693.87 / 704.68 ±6.29 / 711.06 ms │     701.96 / 705.65 ±3.37 / 711.66 ms │     no change │
│ QQuery 18 │ 1380.49 / 1448.97 ±35.89 / 1479.17 ms │ 1423.24 / 1436.24 ±15.75 / 1466.43 ms │     no change │
│ QQuery 19 │       36.01 / 45.28 ±16.20 / 77.65 ms │        37.62 / 38.86 ±1.20 / 40.88 ms │ +1.17x faster │
│ QQuery 20 │    722.58 / 743.75 ±18.71 / 772.04 ms │     731.38 / 737.78 ±7.46 / 750.59 ms │     no change │
│ QQuery 21 │     770.69 / 774.16 ±3.52 / 780.75 ms │    778.00 / 798.16 ±25.52 / 846.05 ms │     no change │
│ QQuery 22 │  1132.23 / 1140.73 ±9.79 / 1159.02 ms │  1133.10 / 1139.13 ±4.46 / 1146.77 ms │     no change │
│ QQuery 23 │ 3099.28 / 3122.51 ±19.93 / 3150.39 ms │ 3121.75 / 3142.33 ±10.75 / 3153.41 ms │     no change │
│ QQuery 24 │      99.10 / 102.15 ±2.03 / 104.57 ms │     104.00 / 105.36 ±1.29 / 107.09 ms │     no change │
│ QQuery 25 │     139.51 / 141.74 ±2.54 / 146.55 ms │     141.43 / 143.27 ±1.20 / 144.90 ms │     no change │
│ QQuery 26 │      99.00 / 103.02 ±2.33 / 106.28 ms │     101.75 / 104.72 ±1.72 / 106.98 ms │     no change │
│ QQuery 27 │     858.30 / 867.38 ±6.86 / 876.94 ms │     860.27 / 865.12 ±5.87 / 875.93 ms │     no change │
│ QQuery 28 │ 3225.16 / 3249.92 ±13.36 / 3265.14 ms │ 3231.85 / 3264.74 ±25.39 / 3294.94 ms │     no change │
│ QQuery 29 │        51.60 / 56.98 ±6.58 / 68.21 ms │        51.04 / 54.07 ±3.36 / 59.66 ms │ +1.05x faster │
│ QQuery 30 │    353.18 / 365.90 ±10.21 / 381.20 ms │     360.71 / 367.58 ±5.34 / 375.90 ms │     no change │
│ QQuery 31 │     371.24 / 383.22 ±6.73 / 391.79 ms │    371.48 / 389.89 ±12.10 / 406.14 ms │     no change │
│ QQuery 32 │ 1146.69 / 1204.97 ±49.70 / 1297.21 ms │ 1009.88 / 1031.55 ±16.78 / 1057.12 ms │ +1.17x faster │
│ QQuery 33 │ 1426.90 / 1457.91 ±40.22 / 1536.03 ms │  1431.39 / 1439.50 ±5.12 / 1445.85 ms │     no change │
│ QQuery 34 │ 1459.69 / 1472.06 ±11.88 / 1492.15 ms │ 1447.20 / 1472.53 ±13.21 / 1484.31 ms │     no change │
│ QQuery 35 │     375.82 / 386.20 ±8.90 / 402.65 ms │     382.68 / 385.32 ±2.35 / 388.40 ms │     no change │
│ QQuery 36 │     117.63 / 120.43 ±2.06 / 123.31 ms │     110.97 / 118.60 ±3.96 / 121.85 ms │     no change │
│ QQuery 37 │        48.54 / 50.92 ±1.45 / 52.42 ms │        49.02 / 50.89 ±1.52 / 52.91 ms │     no change │
│ QQuery 38 │        74.15 / 76.53 ±2.44 / 80.89 ms │        75.53 / 76.78 ±0.98 / 77.92 ms │     no change │
│ QQuery 39 │     205.89 / 213.64 ±5.53 / 221.35 ms │     209.02 / 221.94 ±8.43 / 233.84 ms │     no change │
│ QQuery 40 │        23.49 / 26.18 ±2.00 / 28.96 ms │        24.28 / 25.77 ±1.46 / 28.55 ms │     no change │
│ QQuery 41 │        20.33 / 21.57 ±1.03 / 22.85 ms │        20.26 / 21.93 ±0.99 / 23.26 ms │     no change │
│ QQuery 42 │        19.43 / 19.97 ±0.40 / 20.57 ms │        19.01 / 19.80 ±0.45 / 20.37 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 22548.74ms │
│ Total Time (row-group-morsel-split)   │ 22421.32ms │
│ Average Time (HEAD)                   │   524.39ms │
│ Average Time (row-group-morsel-split) │   521.43ms │
│ Queries Faster                        │          4 │
│ Queries Slower                        │          1 │
│ Queries with No Change                │         38 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	115.0s
Peak memory	37.5 GiB
Avg memory	27.1 GiB
CPU user	1060.7s
CPU sys	98.2s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	115.0s
Peak memory	39.2 GiB
Avg memory	27.3 GiB
CPU user	1065.1s
CPU sys	90.9s
Peak spill	0 B

File an issue against this benchmark runner

adriangb · 2026-04-21T05:28:53Z

Root cause: stale base, not a morsel-split regression

The apparent regression is a comparison artifact — this branch is based on afc07843a2 (Apr 17), which predates apache#21351 "Dynamic work scheduling in FileStream" (Apr 20). That single commit is worth ~2× on the worst-regression queries here. When the benchmark compares against apache/main (which has apache#21351), the morsel branch looks slow — but it's actually unchanged relative to its own base.

Local reproduction (5 iterations, M-series)

Query	parent (`afc0784`)	apache#21351 commit	apache/main	morsel (this PR)	morsel rebased onto main
Q24	82.74	38.99	43.10	79.02	44.99
Q26	72.50	37.47	38.72	74.66	43.99
Q36	80.47	45.18	46.78	79.67	45.32
Q38	56.39	31.21	31.60	56.14	31.96
Q39	140.81	84.84	85.12	142.42	86.18

The branch's parent is already ~82ms on Q24. apache#21351 alone takes it to 39ms. This PR neither helps nor hurts relative to its (pre-apache#21351) base — it's just missing the speedup.

Investigation path

Bumping DEFAULT_MORSEL_MAX_ROWS / DEFAULT_MORSEL_MAX_COMPRESSED_BYTES to u64::MAX (forcing 1 chunk per file) — no change → fan-out isn't the cause.
Reverting build_stream to the pre-morsel single-pass logic, still returning Vec![one_stream] — no change → the refactor isn't the cause.
Instrumenting build_stream with microsecond timers — total build_stream cost < 1ms across all 100 files per query — not in build_stream at all.
Building at exactly the branch's parent commit afc07843a2 — same ~82ms. The branch is no slower than what it started from.
Cherry-picking the morsel commit onto current main and re-running — all 5 queries back to ~main parity.

Fix

Rebase onto current main. I pushed the result as morsel-rebased; the mashup PR #9 already sits on top of adriangb/main so it's not affected.

🤖 Generated with Claude Code

adriangbot · 2026-04-21T05:36:30Z

Benchmark for this request hit the 7200s job deadline before finishing.

Benchmarks requested: tpch

Kubernetes message

Job was active longer than specified deadline

File an issue against this benchmark runner

adriangbot · 2026-04-21T05:36:58Z

Benchmark for this request hit the 7200s job deadline before finishing.

Benchmarks requested: tpch

Kubernetes message

Job was active longer than specified deadline

File an issue against this benchmark runner

adriangbot · 2026-04-21T05:59:01Z

Benchmark for this request hit the 7200s job deadline before finishing.

Benchmarks requested: tpch

Kubernetes message

Job was active longer than specified deadline

File an issue against this benchmark runner

adriangbot · 2026-04-21T13:22:26Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4288835918-1679-ng54b 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (5b0a69a) to 7acbe03 (merge-base) diff using: tpcds
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T13:39:08Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and row-group-morsel-split
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                row-group-morsel-split ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.17 / 4.43 ±6.35 / 17.12 ms │          1.18 / 4.49 ±6.57 / 17.64 ms │     no change │
│ QQuery 1  │        12.04 / 12.59 ±0.31 / 12.85 ms │        13.34 / 13.57 ±0.12 / 13.67 ms │  1.08x slower │
│ QQuery 2  │        37.19 / 37.37 ±0.14 / 37.52 ms │        38.20 / 38.48 ±0.27 / 39.00 ms │     no change │
│ QQuery 3  │        31.99 / 32.45 ±0.61 / 33.61 ms │        32.83 / 33.16 ±0.18 / 33.34 ms │     no change │
│ QQuery 4  │     250.05 / 254.98 ±5.10 / 263.81 ms │     243.52 / 248.01 ±2.64 / 251.41 ms │     no change │
│ QQuery 5  │     288.09 / 292.68 ±3.52 / 298.81 ms │     281.74 / 284.70 ±3.63 / 291.70 ms │     no change │
│ QQuery 6  │           5.54 / 5.89 ±0.41 / 6.66 ms │           6.11 / 6.43 ±0.37 / 7.14 ms │  1.09x slower │
│ QQuery 7  │        15.89 / 16.15 ±0.17 / 16.33 ms │        17.01 / 19.13 ±3.65 / 26.42 ms │  1.18x slower │
│ QQuery 8  │     354.59 / 360.69 ±4.44 / 365.83 ms │     328.49 / 330.71 ±2.01 / 333.14 ms │ +1.09x faster │
│ QQuery 9  │     502.70 / 513.12 ±6.12 / 520.29 ms │     491.71 / 506.26 ±9.78 / 516.60 ms │     no change │
│ QQuery 10 │      98.46 / 100.78 ±1.64 / 103.32 ms │       97.65 / 99.07 ±0.93 / 100.27 ms │     no change │
│ QQuery 11 │     108.72 / 109.40 ±0.82 / 111.01 ms │     108.90 / 109.95 ±1.01 / 111.83 ms │     no change │
│ QQuery 12 │     323.52 / 331.33 ±6.34 / 341.05 ms │     307.86 / 311.25 ±3.82 / 317.69 ms │ +1.06x faster │
│ QQuery 13 │    452.21 / 469.63 ±12.60 / 484.47 ms │    430.60 / 441.89 ±10.20 / 455.85 ms │ +1.06x faster │
│ QQuery 14 │     340.66 / 341.91 ±1.22 / 343.82 ms │     320.75 / 326.53 ±4.36 / 331.69 ms │     no change │
│ QQuery 15 │    327.46 / 341.89 ±11.23 / 360.85 ms │     282.97 / 288.04 ±3.66 / 293.13 ms │ +1.19x faster │
│ QQuery 16 │     665.86 / 677.49 ±7.22 / 684.64 ms │     620.17 / 631.96 ±9.66 / 646.42 ms │ +1.07x faster │
│ QQuery 17 │     671.93 / 674.22 ±2.43 / 678.70 ms │     622.31 / 624.82 ±2.25 / 628.70 ms │ +1.08x faster │
│ QQuery 18 │ 1355.62 / 1435.10 ±44.74 / 1489.64 ms │ 1251.12 / 1269.66 ±16.12 / 1291.03 ms │ +1.13x faster │
│ QQuery 19 │       30.63 / 44.58 ±24.41 / 93.34 ms │        31.03 / 31.51 ±0.47 / 32.19 ms │ +1.41x faster │
│ QQuery 20 │     513.40 / 527.57 ±9.72 / 539.93 ms │     519.03 / 528.86 ±7.52 / 542.30 ms │     no change │
│ QQuery 21 │     576.90 / 581.81 ±3.10 / 584.39 ms │     565.85 / 575.46 ±5.56 / 581.30 ms │     no change │
│ QQuery 22 │    943.17 / 963.92 ±12.48 / 976.58 ms │     926.06 / 926.80 ±0.57 / 927.53 ms │     no change │
│ QQuery 23 │     121.96 / 128.32 ±4.09 / 132.97 ms │    114.36 / 130.96 ±19.63 / 169.47 ms │     no change │
│ QQuery 24 │        42.24 / 44.76 ±1.32 / 45.96 ms │        42.87 / 43.01 ±0.10 / 43.10 ms │     no change │
│ QQuery 25 │     144.82 / 146.09 ±1.05 / 147.88 ms │     146.13 / 150.52 ±3.96 / 157.29 ms │     no change │
│ QQuery 26 │      64.72 / 93.75 ±42.19 / 175.36 ms │        62.75 / 63.11 ±0.52 / 64.13 ms │ +1.49x faster │
│ QQuery 27 │    718.13 / 734.41 ±18.10 / 767.57 ms │    728.61 / 746.75 ±19.47 / 783.63 ms │     no change │
│ QQuery 28 │ 3061.54 / 3129.50 ±34.51 / 3156.96 ms │ 3029.33 / 3050.17 ±11.89 / 3065.84 ms │     no change │
│ QQuery 29 │        43.72 / 48.50 ±4.51 / 55.98 ms │       43.66 / 49.92 ±12.08 / 74.08 ms │     no change │
│ QQuery 30 │     327.66 / 332.23 ±6.13 / 344.28 ms │     316.23 / 320.78 ±4.01 / 326.59 ms │     no change │
│ QQuery 31 │     325.12 / 332.55 ±5.39 / 339.66 ms │     310.23 / 319.21 ±7.26 / 328.52 ms │     no change │
│ QQuery 32 │ 1224.60 / 1268.57 ±36.81 / 1329.71 ms │ 1014.94 / 1027.70 ±15.95 / 1058.80 ms │ +1.23x faster │
│ QQuery 33 │ 1517.46 / 1547.10 ±29.27 / 1594.96 ms │ 1427.31 / 1458.62 ±24.75 / 1500.55 ms │ +1.06x faster │
│ QQuery 34 │ 1460.13 / 1478.73 ±10.99 / 1493.42 ms │ 1429.62 / 1443.35 ±12.99 / 1466.49 ms │     no change │
│ QQuery 35 │    312.91 / 323.02 ±11.99 / 345.14 ms │    285.02 / 303.96 ±15.30 / 319.26 ms │ +1.06x faster │
│ QQuery 36 │        63.11 / 70.90 ±4.95 / 78.11 ms │        61.67 / 68.93 ±6.97 / 81.95 ms │     no change │
│ QQuery 37 │        40.12 / 44.07 ±3.15 / 49.77 ms │        35.82 / 40.71 ±5.54 / 51.14 ms │ +1.08x faster │
│ QQuery 38 │        36.22 / 39.55 ±2.24 / 41.94 ms │        34.88 / 36.99 ±1.41 / 38.78 ms │ +1.07x faster │
│ QQuery 39 │     110.47 / 119.12 ±6.55 / 125.29 ms │     116.02 / 127.31 ±7.62 / 139.86 ms │  1.07x slower │
│ QQuery 40 │        20.42 / 21.84 ±0.72 / 22.43 ms │        18.14 / 18.59 ±0.42 / 19.16 ms │ +1.18x faster │
│ QQuery 41 │        18.46 / 19.34 ±1.14 / 21.60 ms │        16.35 / 19.13 ±4.73 / 28.57 ms │     no change │
│ QQuery 42 │        14.89 / 15.52 ±0.38 / 15.97 ms │        14.11 / 14.41 ±0.18 / 14.58 ms │ +1.08x faster │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 18067.86ms │
│ Total Time (row-group-morsel-split)   │ 17084.88ms │
│ Average Time (HEAD)                   │   420.18ms │
│ Average Time (row-group-morsel-split) │   397.32ms │
│ Queries Faster                        │         16 │
│ Queries Slower                        │          4 │
│ Queries with No Change                │         23 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	95.0s
Peak memory	35.8 GiB
Avg memory	26.3 GiB
CPU user	911.9s
CPU sys	99.1s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	90.0s
Peak memory	29.4 GiB
Avg memory	22.9 GiB
CPU user	903.4s
CPU sys	54.5s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-21T13:39:48Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and row-group-morsel-split
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                row-group-morsel-split ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.18 / 4.41 ±6.33 / 17.07 ms │          1.19 / 4.67 ±6.82 / 18.31 ms │  1.06x slower │
│ QQuery 1  │        11.99 / 12.52 ±0.27 / 12.68 ms │        14.11 / 14.44 ±0.27 / 14.85 ms │  1.15x slower │
│ QQuery 2  │        35.88 / 36.07 ±0.11 / 36.18 ms │        40.08 / 40.25 ±0.14 / 40.44 ms │  1.12x slower │
│ QQuery 3  │        31.14 / 31.95 ±0.70 / 33.18 ms │        34.00 / 35.80 ±2.89 / 41.51 ms │  1.12x slower │
│ QQuery 4  │     256.54 / 260.28 ±1.90 / 261.70 ms │     256.49 / 260.51 ±3.41 / 265.25 ms │     no change │
│ QQuery 5  │     299.58 / 304.25 ±3.93 / 311.10 ms │     295.20 / 297.63 ±2.06 / 300.44 ms │     no change │
│ QQuery 6  │           6.21 / 7.08 ±0.48 / 7.60 ms │           5.88 / 6.28 ±0.34 / 6.75 ms │ +1.13x faster │
│ QQuery 7  │        13.61 / 13.96 ±0.24 / 14.28 ms │        15.50 / 15.70 ±0.15 / 15.91 ms │  1.12x slower │
│ QQuery 8  │     365.60 / 373.33 ±7.48 / 386.72 ms │     345.21 / 346.90 ±1.35 / 348.88 ms │ +1.08x faster │
│ QQuery 9  │    508.12 / 527.92 ±13.90 / 543.11 ms │     520.94 / 537.11 ±9.02 / 545.13 ms │     no change │
│ QQuery 10 │        74.12 / 75.49 ±1.94 / 79.13 ms │        76.14 / 77.26 ±0.78 / 78.58 ms │     no change │
│ QQuery 11 │        84.59 / 85.24 ±0.80 / 86.44 ms │        88.04 / 88.80 ±0.92 / 90.60 ms │     no change │
│ QQuery 12 │     291.14 / 298.35 ±5.48 / 307.36 ms │     282.14 / 293.16 ±8.02 / 303.79 ms │     no change │
│ QQuery 13 │    422.89 / 434.79 ±11.58 / 453.75 ms │     410.01 / 418.91 ±7.12 / 429.60 ms │     no change │
│ QQuery 14 │     305.27 / 308.71 ±4.03 / 316.29 ms │    293.86 / 306.57 ±19.45 / 345.14 ms │     no change │
│ QQuery 15 │    334.97 / 354.61 ±23.07 / 398.33 ms │     296.47 / 302.20 ±4.73 / 308.95 ms │ +1.17x faster │
│ QQuery 16 │     680.55 / 682.46 ±1.13 / 683.65 ms │     634.77 / 644.94 ±8.20 / 659.91 ms │ +1.06x faster │
│ QQuery 17 │     681.00 / 685.65 ±3.86 / 692.40 ms │     645.90 / 651.68 ±5.20 / 659.82 ms │     no change │
│ QQuery 18 │ 1422.40 / 1482.64 ±43.62 / 1558.42 ms │ 1280.75 / 1300.50 ±14.30 / 1321.09 ms │ +1.14x faster │
│ QQuery 19 │        30.31 / 32.09 ±2.69 / 37.45 ms │        30.15 / 30.59 ±0.32 / 31.01 ms │     no change │
│ QQuery 20 │    519.99 / 533.09 ±10.09 / 545.72 ms │     524.04 / 529.93 ±4.89 / 538.96 ms │     no change │
│ QQuery 21 │     595.12 / 598.64 ±2.52 / 602.50 ms │     602.81 / 610.98 ±5.72 / 620.08 ms │     no change │
│ QQuery 22 │  1060.40 / 1063.07 ±3.80 / 1070.47 ms │ 1067.06 / 1079.91 ±11.17 / 1100.50 ms │     no change │
│ QQuery 23 │ 3300.63 / 3331.93 ±18.18 / 3357.10 ms │  3378.75 / 3392.00 ±8.53 / 3404.93 ms │     no change │
│ QQuery 24 │        44.37 / 46.17 ±1.34 / 48.36 ms │       50.22 / 58.62 ±10.82 / 78.13 ms │  1.27x slower │
│ QQuery 25 │     114.82 / 115.36 ±0.56 / 116.44 ms │     116.11 / 117.50 ±1.87 / 121.10 ms │     no change │
│ QQuery 26 │        45.61 / 46.56 ±1.02 / 48.13 ms │        51.06 / 51.62 ±0.38 / 52.24 ms │  1.11x slower │
│ QQuery 27 │     663.76 / 668.48 ±2.64 / 671.90 ms │     675.45 / 680.25 ±2.88 / 684.22 ms │     no change │
│ QQuery 28 │ 2997.62 / 3027.14 ±16.02 / 3041.94 ms │  3041.20 / 3053.06 ±9.82 / 3068.47 ms │     no change │
│ QQuery 29 │        44.31 / 48.47 ±4.56 / 55.99 ms │        44.97 / 45.50 ±0.68 / 46.74 ms │ +1.07x faster │
│ QQuery 30 │     327.13 / 331.59 ±5.53 / 341.88 ms │     313.71 / 320.48 ±9.15 / 338.51 ms │     no change │
│ QQuery 31 │     332.04 / 338.45 ±4.87 / 344.80 ms │     314.43 / 317.94 ±2.23 / 320.92 ms │ +1.06x faster │
│ QQuery 32 │ 1166.21 / 1218.94 ±36.83 / 1280.98 ms │ 1031.68 / 1057.28 ±15.88 / 1079.72 ms │ +1.15x faster │
│ QQuery 33 │ 1485.66 / 1534.51 ±42.09 / 1598.70 ms │ 1465.93 / 1488.38 ±19.06 / 1520.15 ms │     no change │
│ QQuery 34 │ 1497.12 / 1545.71 ±26.59 / 1569.70 ms │ 1481.37 / 1498.81 ±19.54 / 1535.10 ms │     no change │
│ QQuery 35 │     324.01 / 328.37 ±6.29 / 340.82 ms │    306.58 / 323.22 ±19.38 / 359.76 ms │     no change │
│ QQuery 36 │        65.63 / 68.09 ±2.03 / 71.81 ms │        63.65 / 69.80 ±8.76 / 87.15 ms │     no change │
│ QQuery 37 │        38.35 / 39.18 ±0.54 / 39.95 ms │        36.62 / 38.94 ±1.92 / 41.68 ms │     no change │
│ QQuery 38 │        42.10 / 42.95 ±0.51 / 43.49 ms │        40.03 / 45.41 ±6.32 / 56.76 ms │  1.06x slower │
│ QQuery 39 │     127.56 / 134.52 ±4.57 / 141.52 ms │     137.74 / 141.32 ±2.15 / 144.43 ms │  1.05x slower │
│ QQuery 40 │        15.52 / 18.12 ±3.41 / 24.84 ms │        15.06 / 15.29 ±0.23 / 15.57 ms │ +1.18x faster │
│ QQuery 41 │        14.34 / 15.64 ±0.73 / 16.45 ms │        14.43 / 14.79 ±0.39 / 15.50 ms │ +1.06x faster │
│ QQuery 42 │        14.44 / 14.61 ±0.14 / 14.84 ms │        14.27 / 16.10 ±3.41 / 22.92 ms │  1.10x slower │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 21121.36ms │
│ Total Time (row-group-morsel-split)   │ 20641.00ms │
│ Average Time (HEAD)                   │   491.19ms │
│ Average Time (row-group-morsel-split) │   480.02ms │
│ Queries Faster                        │         10 │
│ Queries Slower                        │         10 │
│ Queries with No Change                │         23 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	110.0s
Peak memory	34.6 GiB
Avg memory	26.8 GiB
CPU user	1087.5s
CPU sys	96.6s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	105.0s
Peak memory	30.0 GiB
Avg memory	23.3 GiB
CPU user	1093.3s
CPU sys	66.4s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-21T13:39:56Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and row-group-morsel-split
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━┓
┃ Query     ┃                                      HEAD ┃                    row-group-morsel-split ┃         Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━┩
│ QQuery 1  │               6.48 / 6.88 ±0.66 / 8.20 ms │               7.50 / 7.98 ±0.74 / 9.45 ms │   1.16x slower │
│ QQuery 2  │         145.73 / 147.02 ±1.39 / 149.55 ms │         148.64 / 148.98 ±0.46 / 149.89 ms │      no change │
│ QQuery 3  │         114.39 / 115.14 ±0.67 / 116.10 ms │         115.08 / 116.35 ±1.37 / 118.71 ms │      no change │
│ QQuery 4  │     1263.21 / 1291.26 ±23.58 / 1328.22 ms │     1278.97 / 1382.18 ±77.37 / 1476.30 ms │   1.07x slower │
│ QQuery 5  │         172.87 / 173.41 ±0.45 / 174.16 ms │         172.26 / 173.70 ±1.73 / 177.02 ms │      no change │
│ QQuery 6  │         833.54 / 839.65 ±4.48 / 846.08 ms │        811.22 / 841.48 ±27.13 / 889.39 ms │      no change │
│ QQuery 7  │         341.47 / 344.47 ±3.69 / 351.47 ms │         335.22 / 337.44 ±1.84 / 339.49 ms │      no change │
│ QQuery 8  │         116.16 / 116.91 ±0.91 / 118.49 ms │         112.39 / 114.06 ±2.11 / 118.21 ms │      no change │
│ QQuery 9  │         106.37 / 108.27 ±2.81 / 113.85 ms │         100.50 / 103.86 ±2.41 / 106.14 ms │      no change │
│ QQuery 10 │         104.42 / 106.39 ±1.33 / 108.16 ms │         101.67 / 102.13 ±0.36 / 102.71 ms │      no change │
│ QQuery 11 │        882.45 / 908.18 ±20.87 / 939.13 ms │        839.55 / 862.97 ±30.92 / 923.34 ms │      no change │
│ QQuery 12 │            44.61 / 46.74 ±1.23 / 48.07 ms │            44.43 / 44.65 ±0.26 / 45.14 ms │      no change │
│ QQuery 13 │         397.93 / 402.29 ±2.55 / 405.92 ms │        401.38 / 414.43 ±11.47 / 430.20 ms │      no change │
│ QQuery 14 │       998.33 / 1005.08 ±5.36 / 1012.50 ms │      984.40 / 1017.30 ±26.12 / 1048.33 ms │      no change │
│ QQuery 15 │            15.54 / 16.42 ±0.90 / 17.74 ms │            16.58 / 16.86 ±0.16 / 17.05 ms │      no change │
│ QQuery 16 │               7.18 / 7.52 ±0.38 / 8.23 ms │               8.46 / 8.56 ±0.12 / 8.79 ms │   1.14x slower │
│ QQuery 17 │         229.94 / 232.54 ±2.44 / 236.98 ms │         231.89 / 236.23 ±2.25 / 238.19 ms │      no change │
│ QQuery 18 │         125.70 / 127.88 ±1.17 / 129.01 ms │         124.57 / 126.70 ±1.30 / 128.45 ms │      no change │
│ QQuery 19 │         153.32 / 156.13 ±1.83 / 159.05 ms │         152.12 / 155.25 ±1.72 / 157.07 ms │      no change │
│ QQuery 20 │            13.29 / 13.98 ±0.56 / 14.70 ms │            13.23 / 13.77 ±0.70 / 15.16 ms │      no change │
│ QQuery 21 │            19.23 / 19.82 ±0.41 / 20.31 ms │            18.95 / 19.12 ±0.15 / 19.31 ms │      no change │
│ QQuery 22 │        477.22 / 485.92 ±12.87 / 511.30 ms │         481.92 / 482.94 ±0.95 / 484.75 ms │      no change │
│ QQuery 23 │         866.31 / 873.36 ±4.84 / 880.34 ms │        815.32 / 827.62 ±18.97 / 865.02 ms │  +1.06x faster │
│ QQuery 24 │         383.97 / 387.16 ±2.52 / 390.65 ms │         383.53 / 392.17 ±6.70 / 402.96 ms │      no change │
│ QQuery 25 │         340.58 / 343.78 ±1.98 / 346.84 ms │         340.87 / 344.67 ±3.63 / 351.15 ms │      no change │
│ QQuery 26 │            79.68 / 82.08 ±2.10 / 85.92 ms │            78.04 / 78.49 ±0.35 / 78.96 ms │      no change │
│ QQuery 27 │               6.64 / 6.92 ±0.24 / 7.30 ms │               7.13 / 7.64 ±0.77 / 9.16 ms │   1.10x slower │
│ QQuery 28 │         148.83 / 149.40 ±0.52 / 150.12 ms │         147.71 / 149.27 ±1.49 / 152.06 ms │      no change │
│ QQuery 29 │         276.77 / 282.95 ±3.68 / 286.74 ms │         271.93 / 274.05 ±2.01 / 277.70 ms │      no change │
│ QQuery 30 │            43.39 / 45.43 ±1.76 / 47.99 ms │            40.86 / 42.22 ±1.59 / 45.18 ms │  +1.08x faster │
│ QQuery 31 │         169.59 / 171.35 ±1.50 / 173.73 ms │         163.27 / 164.98 ±1.08 / 166.28 ms │      no change │
│ QQuery 32 │            13.27 / 13.61 ±0.49 / 14.57 ms │            13.05 / 13.40 ±0.19 / 13.59 ms │      no change │
│ QQuery 33 │         138.91 / 141.44 ±1.76 / 143.94 ms │         138.27 / 140.70 ±3.03 / 146.63 ms │      no change │
│ QQuery 34 │               6.79 / 6.96 ±0.14 / 7.21 ms │               6.85 / 6.96 ±0.15 / 7.27 ms │      no change │
│ QQuery 35 │         106.09 / 108.47 ±1.29 / 109.93 ms │          99.40 / 101.19 ±1.48 / 103.33 ms │  +1.07x faster │
│ QQuery 36 │               6.37 / 6.65 ±0.20 / 6.87 ms │               6.41 / 6.75 ±0.20 / 6.96 ms │      no change │
│ QQuery 37 │              8.41 / 9.24 ±0.67 / 10.01 ms │               8.20 / 8.46 ±0.35 / 9.15 ms │  +1.09x faster │
│ QQuery 38 │            85.19 / 89.10 ±3.10 / 94.65 ms │            85.60 / 86.80 ±1.06 / 88.64 ms │      no change │
│ QQuery 39 │         127.21 / 129.71 ±2.68 / 134.58 ms │         116.06 / 118.68 ±2.66 / 123.67 ms │  +1.09x faster │
│ QQuery 40 │         106.24 / 114.38 ±6.32 / 125.02 ms │         102.60 / 104.50 ±2.63 / 109.56 ms │  +1.09x faster │
│ QQuery 41 │            14.33 / 15.18 ±0.93 / 16.89 ms │            13.98 / 14.16 ±0.17 / 14.47 ms │  +1.07x faster │
│ QQuery 42 │         109.82 / 111.15 ±1.19 / 112.94 ms │         106.02 / 108.08 ±1.27 / 109.72 ms │      no change │
│ QQuery 43 │               5.63 / 5.78 ±0.18 / 6.12 ms │               5.69 / 5.79 ±0.14 / 6.07 ms │      no change │
│ QQuery 44 │            11.71 / 12.53 ±0.98 / 14.37 ms │            11.33 / 11.47 ±0.10 / 11.60 ms │  +1.09x faster │
│ QQuery 45 │            51.31 / 51.67 ±0.49 / 52.62 ms │            48.52 / 48.84 ±0.36 / 49.47 ms │  +1.06x faster │
│ QQuery 46 │               8.23 / 8.38 ±0.18 / 8.72 ms │               8.34 / 8.47 ±0.17 / 8.81 ms │      no change │
│ QQuery 47 │        703.95 / 719.97 ±13.02 / 741.12 ms │        678.96 / 696.95 ±28.30 / 752.96 ms │      no change │
│ QQuery 48 │         284.15 / 289.22 ±3.19 / 293.48 ms │         269.73 / 274.38 ±2.91 / 278.39 ms │  +1.05x faster │
│ QQuery 49 │         252.90 / 254.75 ±1.67 / 257.48 ms │         248.96 / 250.07 ±0.64 / 250.69 ms │      no change │
│ QQuery 50 │         228.52 / 234.48 ±4.30 / 241.34 ms │         202.24 / 206.75 ±3.59 / 211.65 ms │  +1.13x faster │
│ QQuery 51 │         182.06 / 185.45 ±2.29 / 188.81 ms │         177.27 / 178.95 ±1.90 / 182.48 ms │      no change │
│ QQuery 52 │         108.28 / 109.89 ±1.39 / 112.44 ms │         106.61 / 107.60 ±1.21 / 109.94 ms │      no change │
│ QQuery 53 │         104.54 / 106.15 ±1.66 / 109.03 ms │         101.16 / 101.91 ±0.82 / 103.49 ms │      no change │
│ QQuery 54 │         147.46 / 150.21 ±1.54 / 151.70 ms │         142.15 / 143.59 ±1.54 / 146.40 ms │      no change │
│ QQuery 55 │         107.77 / 109.28 ±1.76 / 111.56 ms │         105.82 / 106.64 ±0.56 / 107.48 ms │      no change │
│ QQuery 56 │         141.37 / 145.26 ±2.90 / 148.52 ms │         138.10 / 139.29 ±0.89 / 140.33 ms │      no change │
│ QQuery 57 │         180.49 / 181.13 ±0.36 / 181.49 ms │         165.25 / 168.27 ±3.02 / 174.06 ms │  +1.08x faster │
│ QQuery 58 │         274.10 / 280.29 ±3.45 / 283.76 ms │         312.74 / 313.96 ±1.35 / 316.04 ms │   1.12x slower │
│ QQuery 59 │         203.47 / 206.56 ±2.01 / 209.32 ms │         196.00 / 197.89 ±1.39 / 199.70 ms │      no change │
│ QQuery 60 │         146.98 / 149.95 ±1.68 / 151.89 ms │         140.42 / 141.32 ±0.68 / 142.13 ms │  +1.06x faster │
│ QQuery 61 │            13.50 / 13.74 ±0.18 / 13.94 ms │            13.33 / 13.52 ±0.18 / 13.87 ms │      no change │
│ QQuery 62 │       944.39 / 974.80 ±18.89 / 1003.57 ms │         853.13 / 860.02 ±4.23 / 865.90 ms │  +1.13x faster │
│ QQuery 63 │         104.69 / 108.67 ±4.40 / 116.98 ms │         101.84 / 102.50 ±0.48 / 103.21 ms │  +1.06x faster │
│ QQuery 64 │         710.03 / 717.89 ±5.74 / 724.42 ms │         658.48 / 664.50 ±3.07 / 666.97 ms │  +1.08x faster │
│ QQuery 65 │         267.32 / 270.65 ±2.93 / 274.62 ms │         240.93 / 243.16 ±1.45 / 244.85 ms │  +1.11x faster │
│ QQuery 66 │        253.92 / 266.36 ±11.87 / 284.98 ms │         212.80 / 220.38 ±6.89 / 229.11 ms │  +1.21x faster │
│ QQuery 67 │         325.40 / 333.22 ±6.50 / 341.37 ms │        290.27 / 296.50 ±11.35 / 319.16 ms │  +1.12x faster │
│ QQuery 68 │             9.41 / 11.01 ±1.01 / 12.14 ms │               8.43 / 8.66 ±0.27 / 9.18 ms │  +1.27x faster │
│ QQuery 69 │         103.40 / 106.51 ±3.73 / 113.77 ms │           97.38 / 98.42 ±1.13 / 100.56 ms │  +1.08x faster │
│ QQuery 70 │        359.84 / 371.31 ±11.58 / 385.56 ms │        308.31 / 320.02 ±13.89 / 345.71 ms │  +1.16x faster │
│ QQuery 71 │         137.01 / 139.14 ±2.04 / 142.91 ms │         131.50 / 133.80 ±2.06 / 137.29 ms │      no change │
│ QQuery 72 │        638.31 / 654.38 ±10.75 / 670.77 ms │         578.83 / 587.30 ±6.30 / 595.90 ms │  +1.11x faster │
│ QQuery 73 │              7.24 / 8.61 ±1.27 / 10.76 ms │               6.54 / 6.69 ±0.23 / 7.14 ms │  +1.29x faster │
│ QQuery 74 │        643.91 / 659.37 ±17.73 / 693.13 ms │         528.99 / 541.66 ±6.76 / 549.12 ms │  +1.22x faster │
│ QQuery 75 │         278.38 / 282.54 ±4.41 / 289.52 ms │         266.68 / 268.22 ±0.95 / 269.34 ms │  +1.05x faster │
│ QQuery 76 │         132.27 / 134.59 ±1.98 / 138.08 ms │         130.78 / 132.08 ±1.20 / 133.88 ms │      no change │
│ QQuery 77 │         189.56 / 192.60 ±1.81 / 195.20 ms │         186.75 / 187.54 ±0.57 / 188.41 ms │      no change │
│ QQuery 78 │         362.91 / 366.68 ±3.59 / 372.79 ms │         327.76 / 329.38 ±1.25 / 330.77 ms │  +1.11x faster │
│ QQuery 79 │         240.67 / 243.70 ±1.89 / 245.63 ms │         226.15 / 229.27 ±2.26 / 232.05 ms │  +1.06x faster │
│ QQuery 80 │         331.78 / 334.33 ±2.02 / 336.64 ms │         317.98 / 320.15 ±2.07 / 324.03 ms │      no change │
│ QQuery 81 │            26.27 / 29.51 ±2.25 / 33.13 ms │            25.30 / 25.87 ±0.49 / 26.71 ms │  +1.14x faster │
│ QQuery 82 │            41.92 / 42.57 ±0.53 / 43.28 ms │            38.59 / 39.25 ±0.51 / 39.82 ms │  +1.08x faster │
│ QQuery 83 │            39.51 / 39.94 ±0.51 / 40.92 ms │            36.45 / 37.02 ±0.71 / 38.41 ms │  +1.08x faster │
│ QQuery 84 │            49.09 / 50.32 ±0.74 / 51.19 ms │            46.09 / 46.53 ±0.55 / 47.59 ms │  +1.08x faster │
│ QQuery 85 │         151.36 / 152.44 ±0.71 / 153.36 ms │         140.27 / 140.60 ±0.31 / 141.04 ms │  +1.08x faster │
│ QQuery 86 │            39.77 / 40.88 ±0.91 / 42.48 ms │            36.90 / 37.37 ±0.72 / 38.79 ms │  +1.09x faster │
│ QQuery 87 │            87.88 / 91.07 ±2.99 / 96.70 ms │               3.53 / 3.59 ±0.09 / 3.76 ms │ +25.33x faster │
│ QQuery 88 │         100.96 / 101.82 ±0.54 / 102.65 ms │           98.47 / 99.73 ±1.22 / 101.99 ms │      no change │
│ QQuery 89 │         122.45 / 123.36 ±0.78 / 124.62 ms │         116.04 / 117.86 ±2.98 / 123.80 ms │      no change │
│ QQuery 90 │            24.15 / 24.51 ±0.35 / 25.11 ms │            21.90 / 22.36 ±0.36 / 22.93 ms │  +1.10x faster │
│ QQuery 91 │            63.88 / 66.06 ±1.39 / 67.60 ms │            57.88 / 58.25 ±0.31 / 58.58 ms │  +1.13x faster │
│ QQuery 92 │            58.75 / 59.32 ±0.72 / 60.48 ms │            55.41 / 56.10 ±0.50 / 56.92 ms │  +1.06x faster │
│ QQuery 93 │         190.73 / 192.98 ±1.59 / 195.17 ms │         179.72 / 182.21 ±2.08 / 185.69 ms │  +1.06x faster │
│ QQuery 94 │            63.12 / 63.88 ±0.61 / 64.82 ms │            60.67 / 61.64 ±1.71 / 65.05 ms │      no change │
│ QQuery 95 │         132.41 / 132.88 ±0.36 / 133.47 ms │         125.60 / 126.30 ±0.55 / 127.22 ms │      no change │
│ QQuery 96 │            73.68 / 74.47 ±0.80 / 75.95 ms │            68.51 / 69.90 ±1.13 / 71.82 ms │  +1.07x faster │
│ QQuery 97 │         126.96 / 128.25 ±1.63 / 130.94 ms │         116.28 / 118.17 ±1.26 / 119.60 ms │  +1.09x faster │
│ QQuery 98 │         154.00 / 155.65 ±1.19 / 157.05 ms │         148.73 / 151.95 ±2.18 / 154.83 ms │      no change │
│ QQuery 99 │ 10682.32 / 10836.73 ±235.66 / 11301.55 ms │ 10720.51 / 11164.36 ±345.16 / 11555.01 ms │      no change │
└───────────┴───────────────────────────────────────────┴───────────────────────────────────────────┴────────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 31557.86ms │
│ Total Time (row-group-morsel-split)   │ 30942.69ms │
│ Average Time (HEAD)                   │   318.77ms │
│ Average Time (row-group-morsel-split) │   312.55ms │
│ Queries Faster                        │         41 │
│ Queries Slower                        │          5 │
│ Queries with No Change                │         53 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric	Value
Wall time	160.0s
Peak memory	6.6 GiB
Avg memory	5.5 GiB
CPU user	262.9s
CPU sys	17.1s
Peak spill	0 B

tpcds — branch

Metric	Value
Wall time	160.0s
Peak memory	6.4 GiB
Avg memory	5.6 GiB
CPU user	258.0s
CPU sys	8.2s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-21T13:44:22Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and row-group-morsel-split
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━┓
┃ Query     ┃                                      HEAD ┃                    row-group-morsel-split ┃         Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━┩
│ QQuery 1  │               6.20 / 6.62 ±0.77 / 8.15 ms │               6.53 / 7.02 ±0.89 / 8.81 ms │   1.06x slower │
│ QQuery 2  │        111.56 / 132.82 ±23.94 / 162.53 ms │         113.50 / 113.85 ±0.35 / 114.51 ms │  +1.17x faster │
│ QQuery 3  │         107.84 / 110.09 ±1.35 / 112.04 ms │         108.17 / 108.60 ±0.41 / 109.19 ms │      no change │
│ QQuery 4  │      1044.26 / 1051.35 ±8.70 / 1065.33 ms │      1012.57 / 1026.46 ±7.73 / 1034.27 ms │      no change │
│ QQuery 5  │         194.42 / 196.23 ±1.14 / 197.54 ms │         185.23 / 187.82 ±2.17 / 191.53 ms │      no change │
│ QQuery 6  │         251.77 / 259.62 ±7.89 / 272.44 ms │         228.27 / 236.35 ±6.92 / 247.40 ms │  +1.10x faster │
│ QQuery 7  │         329.96 / 333.80 ±3.71 / 338.53 ms │         312.48 / 316.64 ±2.18 / 318.50 ms │  +1.05x faster │
│ QQuery 8  │         160.91 / 166.22 ±3.01 / 169.37 ms │         148.22 / 151.83 ±3.56 / 157.32 ms │  +1.09x faster │
│ QQuery 9  │        228.30 / 247.23 ±12.21 / 266.54 ms │        196.70 / 221.46 ±15.39 / 235.64 ms │  +1.12x faster │
│ QQuery 10 │         166.93 / 176.09 ±7.64 / 185.06 ms │        154.48 / 162.30 ±11.58 / 185.16 ms │  +1.08x faster │
│ QQuery 11 │         699.81 / 704.97 ±5.33 / 714.87 ms │         640.05 / 647.63 ±4.72 / 653.02 ms │  +1.09x faster │
│ QQuery 12 │            38.05 / 40.92 ±2.25 / 43.72 ms │            34.40 / 35.42 ±1.62 / 38.63 ms │  +1.16x faster │
│ QQuery 13 │         561.22 / 570.07 ±5.67 / 579.12 ms │         519.01 / 522.46 ±3.17 / 526.76 ms │  +1.09x faster │
│ QQuery 14 │        898.62 / 916.19 ±11.91 / 934.89 ms │         836.61 / 844.54 ±7.38 / 856.90 ms │  +1.08x faster │
│ QQuery 15 │            18.71 / 22.27 ±4.79 / 31.75 ms │            16.69 / 17.06 ±0.35 / 17.65 ms │  +1.31x faster │
│ QQuery 16 │               6.85 / 7.41 ±0.73 / 8.84 ms │               6.92 / 7.06 ±0.18 / 7.40 ms │      no change │
│ QQuery 17 │         201.95 / 203.89 ±1.16 / 205.05 ms │         177.36 / 180.34 ±3.75 / 187.42 ms │  +1.13x faster │
│ QQuery 18 │        317.16 / 422.22 ±77.24 / 493.09 ms │        376.64 / 399.42 ±15.45 / 414.52 ms │  +1.06x faster │
│ QQuery 19 │         138.49 / 144.75 ±4.97 / 153.46 ms │         131.22 / 132.61 ±1.35 / 134.73 ms │  +1.09x faster │
│ QQuery 20 │            15.57 / 16.29 ±0.94 / 18.12 ms │            14.25 / 14.57 ±0.18 / 14.81 ms │  +1.12x faster │
│ QQuery 21 │            26.41 / 27.57 ±0.75 / 28.57 ms │            25.49 / 25.71 ±0.17 / 25.91 ms │  +1.07x faster │
│ QQuery 22 │         503.67 / 509.73 ±4.92 / 517.96 ms │         498.22 / 504.20 ±4.25 / 511.45 ms │      no change │
│ QQuery 23 │      1203.28 / 1213.23 ±6.60 / 1222.03 ms │      1129.06 / 1138.45 ±6.16 / 1145.09 ms │  +1.07x faster │
│ QQuery 24 │         124.47 / 126.84 ±1.95 / 129.72 ms │         106.18 / 109.70 ±3.74 / 116.83 ms │  +1.16x faster │
│ QQuery 25 │         302.05 / 304.72 ±2.09 / 308.08 ms │         266.93 / 272.57 ±6.30 / 284.74 ms │  +1.12x faster │
│ QQuery 26 │         140.91 / 145.94 ±5.73 / 153.51 ms │         133.97 / 140.14 ±4.87 / 148.71 ms │      no change │
│ QQuery 27 │               6.69 / 7.02 ±0.32 / 7.59 ms │               6.53 / 6.68 ±0.14 / 6.95 ms │      no change │
│ QQuery 28 │         221.75 / 231.34 ±5.50 / 236.53 ms │         202.29 / 205.19 ±1.99 / 207.19 ms │  +1.13x faster │
│ QQuery 29 │         245.77 / 251.97 ±4.04 / 257.16 ms │         215.73 / 223.12 ±8.45 / 239.38 ms │  +1.13x faster │
│ QQuery 30 │            53.69 / 58.01 ±2.78 / 62.08 ms │            48.54 / 48.65 ±0.06 / 48.73 ms │  +1.19x faster │
│ QQuery 31 │         175.62 / 177.56 ±2.41 / 182.25 ms │         160.01 / 162.24 ±3.02 / 168.12 ms │  +1.09x faster │
│ QQuery 32 │            13.75 / 14.40 ±0.51 / 14.94 ms │            13.23 / 13.65 ±0.36 / 14.31 ms │  +1.06x faster │
│ QQuery 33 │         130.86 / 134.60 ±2.40 / 138.10 ms │         122.12 / 123.25 ±1.45 / 126.08 ms │  +1.09x faster │
│ QQuery 34 │               6.79 / 7.14 ±0.34 / 7.63 ms │               6.75 / 6.92 ±0.20 / 7.28 ms │      no change │
│ QQuery 35 │         146.07 / 148.85 ±2.24 / 151.56 ms │         130.36 / 131.61 ±1.37 / 134.25 ms │  +1.13x faster │
│ QQuery 36 │               6.39 / 6.98 ±0.71 / 8.36 ms │              6.28 / 7.95 ±3.17 / 14.28 ms │   1.14x slower │
│ QQuery 37 │               5.15 / 5.58 ±0.57 / 6.69 ms │               5.08 / 5.19 ±0.09 / 5.34 ms │  +1.08x faster │
│ QQuery 38 │         104.18 / 109.00 ±3.46 / 114.32 ms │         106.08 / 108.16 ±1.59 / 110.80 ms │      no change │
│ QQuery 39 │         131.86 / 136.98 ±3.33 / 142.32 ms │         124.78 / 129.92 ±8.34 / 146.39 ms │  +1.05x faster │
│ QQuery 40 │         132.64 / 137.92 ±6.30 / 149.76 ms │         121.72 / 125.46 ±3.49 / 130.08 ms │  +1.10x faster │
│ QQuery 41 │            13.48 / 14.36 ±0.78 / 15.54 ms │            13.73 / 13.97 ±0.30 / 14.55 ms │      no change │
│ QQuery 42 │         109.36 / 114.95 ±2.93 / 118.05 ms │         104.30 / 106.63 ±1.86 / 109.87 ms │  +1.08x faster │
│ QQuery 43 │               5.35 / 5.72 ±0.37 / 6.33 ms │               5.43 / 5.59 ±0.17 / 5.92 ms │      no change │
│ QQuery 44 │            11.11 / 11.76 ±0.61 / 12.72 ms │            11.08 / 11.26 ±0.15 / 11.50 ms │      no change │
│ QQuery 45 │            45.48 / 47.53 ±1.61 / 50.11 ms │            38.27 / 38.85 ±0.37 / 39.32 ms │  +1.22x faster │
│ QQuery 46 │               8.15 / 8.67 ±0.32 / 9.08 ms │               8.24 / 8.42 ±0.19 / 8.75 ms │      no change │
│ QQuery 47 │         771.73 / 782.35 ±6.34 / 790.72 ms │         713.94 / 721.92 ±8.36 / 737.02 ms │  +1.08x faster │
│ QQuery 48 │        465.09 / 488.92 ±12.23 / 498.08 ms │         430.87 / 437.37 ±7.81 / 452.33 ms │  +1.12x faster │
│ QQuery 49 │         283.53 / 289.89 ±4.67 / 297.24 ms │         262.40 / 263.36 ±1.70 / 266.75 ms │  +1.10x faster │
│ QQuery 50 │         529.63 / 539.93 ±6.27 / 546.54 ms │         487.54 / 491.74 ±2.77 / 495.50 ms │  +1.10x faster │
│ QQuery 51 │         218.20 / 221.73 ±3.29 / 226.98 ms │         199.09 / 204.39 ±4.01 / 211.28 ms │  +1.08x faster │
│ QQuery 52 │         116.70 / 117.80 ±0.66 / 118.55 ms │         103.10 / 104.56 ±1.23 / 106.59 ms │  +1.13x faster │
│ QQuery 53 │         145.58 / 148.56 ±3.08 / 154.37 ms │         128.23 / 132.54 ±5.16 / 141.57 ms │  +1.12x faster │
│ QQuery 54 │         124.84 / 130.30 ±4.27 / 136.93 ms │         113.62 / 118.69 ±6.32 / 131.06 ms │  +1.10x faster │
│ QQuery 55 │         107.79 / 112.91 ±3.18 / 116.46 ms │         103.72 / 106.93 ±3.53 / 111.51 ms │  +1.06x faster │
│ QQuery 56 │         132.09 / 134.36 ±2.24 / 138.46 ms │         122.86 / 124.27 ±1.54 / 127.09 ms │  +1.08x faster │
│ QQuery 57 │         186.75 / 189.72 ±1.68 / 191.48 ms │         178.53 / 181.33 ±3.75 / 188.54 ms │      no change │
│ QQuery 58 │         227.58 / 230.41 ±2.14 / 233.86 ms │         250.33 / 252.31 ±1.46 / 254.54 ms │   1.10x slower │
│ QQuery 59 │         279.34 / 284.82 ±3.32 / 289.63 ms │         255.19 / 258.07 ±2.43 / 261.50 ms │  +1.10x faster │
│ QQuery 60 │         138.29 / 140.02 ±1.44 / 142.39 ms │         126.73 / 129.44 ±2.68 / 134.16 ms │  +1.08x faster │
│ QQuery 61 │            11.96 / 12.58 ±0.78 / 14.04 ms │            12.37 / 12.55 ±0.18 / 12.89 ms │      no change │
│ QQuery 62 │        873.29 / 912.39 ±38.51 / 984.24 ms │         853.01 / 860.54 ±4.96 / 866.18 ms │  +1.06x faster │
│ QQuery 63 │         146.20 / 151.07 ±3.14 / 154.62 ms │         131.37 / 133.58 ±1.84 / 136.63 ms │  +1.13x faster │
│ QQuery 64 │ 28408.57 / 29542.19 ±870.19 / 30995.09 ms │ 27813.08 / 28545.52 ±637.04 / 29568.69 ms │      no change │
│ QQuery 65 │         355.78 / 368.52 ±7.59 / 377.02 ms │         328.03 / 335.88 ±5.14 / 343.80 ms │  +1.10x faster │
│ QQuery 66 │         194.19 / 200.96 ±3.80 / 205.38 ms │         176.48 / 180.91 ±4.55 / 188.63 ms │  +1.11x faster │
│ QQuery 67 │         494.28 / 504.68 ±9.87 / 520.71 ms │         459.68 / 473.50 ±8.17 / 480.78 ms │  +1.07x faster │
│ QQuery 68 │             8.49 / 10.67 ±1.42 / 12.26 ms │               8.24 / 8.45 ±0.24 / 8.90 ms │  +1.26x faster │
│ QQuery 69 │         177.33 / 182.37 ±3.65 / 186.13 ms │         149.19 / 154.85 ±7.43 / 168.60 ms │  +1.18x faster │
│ QQuery 70 │         423.10 / 433.32 ±9.04 / 447.95 ms │        402.18 / 417.82 ±19.72 / 455.60 ms │      no change │
│ QQuery 71 │         124.82 / 132.95 ±4.75 / 138.41 ms │         123.99 / 128.40 ±3.44 / 133.92 ms │      no change │
│ QQuery 72 │      1197.01 / 1209.33 ±8.89 / 1218.54 ms │     1163.02 / 1185.70 ±32.19 / 1248.19 ms │      no change │
│ QQuery 73 │               6.56 / 7.35 ±0.87 / 8.86 ms │               6.54 / 6.63 ±0.15 / 6.92 ms │  +1.11x faster │
│ QQuery 74 │         490.39 / 497.38 ±4.24 / 503.52 ms │         447.67 / 461.27 ±9.50 / 477.47 ms │  +1.08x faster │
│ QQuery 75 │         284.46 / 287.77 ±2.16 / 290.53 ms │         257.42 / 261.57 ±2.83 / 266.12 ms │  +1.10x faster │
│ QQuery 76 │         279.05 / 282.29 ±3.84 / 289.57 ms │         267.20 / 269.88 ±2.74 / 273.83 ms │      no change │
│ QQuery 77 │         228.20 / 231.17 ±2.45 / 234.49 ms │         211.31 / 217.43 ±3.69 / 221.14 ms │  +1.06x faster │
│ QQuery 78 │         301.46 / 304.58 ±2.78 / 309.27 ms │         289.64 / 292.83 ±1.85 / 294.40 ms │      no change │
│ QQuery 79 │         258.78 / 260.89 ±1.99 / 264.10 ms │         240.59 / 245.70 ±4.04 / 250.96 ms │  +1.06x faster │
│ QQuery 80 │         278.03 / 285.58 ±3.94 / 289.30 ms │         262.89 / 267.63 ±2.70 / 270.24 ms │  +1.07x faster │
│ QQuery 81 │            30.98 / 32.85 ±1.02 / 33.99 ms │            29.66 / 29.96 ±0.22 / 30.33 ms │  +1.10x faster │
│ QQuery 82 │            49.00 / 50.37 ±1.24 / 52.60 ms │            43.60 / 46.19 ±2.35 / 49.85 ms │  +1.09x faster │
│ QQuery 83 │            44.74 / 45.18 ±0.23 / 45.40 ms │            42.65 / 43.78 ±1.66 / 47.07 ms │      no change │
│ QQuery 84 │            64.30 / 65.29 ±0.51 / 65.75 ms │            61.11 / 62.53 ±1.85 / 66.09 ms │      no change │
│ QQuery 85 │         285.67 / 290.08 ±4.15 / 296.50 ms │         245.08 / 251.53 ±4.27 / 257.54 ms │  +1.15x faster │
│ QQuery 86 │            46.95 / 48.48 ±0.99 / 49.82 ms │            46.39 / 49.04 ±3.02 / 54.88 ms │      no change │
│ QQuery 87 │         102.67 / 107.96 ±3.10 / 111.82 ms │               3.46 / 3.55 ±0.12 / 3.78 ms │ +30.37x faster │
│ QQuery 88 │         134.48 / 136.69 ±2.05 / 140.33 ms │         122.45 / 124.82 ±1.39 / 126.78 ms │  +1.10x faster │
│ QQuery 89 │         161.84 / 169.97 ±4.40 / 175.16 ms │         145.58 / 154.08 ±5.89 / 163.82 ms │  +1.10x faster │
│ QQuery 90 │            23.39 / 27.01 ±3.29 / 32.45 ms │            23.56 / 24.84 ±0.87 / 25.86 ms │  +1.09x faster │
│ QQuery 91 │          96.82 / 103.45 ±5.22 / 111.95 ms │            85.27 / 89.05 ±3.25 / 93.41 ms │  +1.16x faster │
│ QQuery 92 │            51.35 / 53.37 ±1.42 / 55.56 ms │            50.09 / 53.85 ±4.19 / 60.78 ms │      no change │
│ QQuery 93 │         187.43 / 188.56 ±0.97 / 190.06 ms │         180.80 / 182.39 ±1.06 / 183.58 ms │      no change │
│ QQuery 94 │            67.24 / 69.03 ±1.09 / 70.23 ms │            62.54 / 64.03 ±1.28 / 65.84 ms │  +1.08x faster │
│ QQuery 95 │         147.03 / 149.55 ±2.22 / 152.46 ms │         137.88 / 141.01 ±1.95 / 143.26 ms │  +1.06x faster │
│ QQuery 96 │            88.90 / 89.40 ±0.64 / 90.52 ms │            75.00 / 78.60 ±2.86 / 81.31 ms │  +1.14x faster │
│ QQuery 97 │         147.91 / 151.78 ±2.74 / 155.54 ms │         138.05 / 141.86 ±3.22 / 146.43 ms │  +1.07x faster │
│ QQuery 98 │         121.43 / 125.60 ±2.59 / 128.38 ms │         112.00 / 115.04 ±2.52 / 118.31 ms │  +1.09x faster │
│ QQuery 99 │  10713.64 / 10732.06 ±17.31 / 10757.50 ms │  10717.94 / 10815.94 ±79.97 / 10949.67 ms │      no change │
└───────────┴───────────────────────────────────────────┴───────────────────────────────────────────┴────────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 61953.92ms │
│ Total Time (row-group-morsel-split)   │ 59466.53ms │
│ Average Time (HEAD)                   │   625.80ms │
│ Average Time (row-group-morsel-split) │   600.67ms │
│ Queries Faster                        │         69 │
│ Queries Slower                        │          3 │
│ Queries with No Change                │         27 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric	Value
Wall time	315.1s
Peak memory	41.0 GiB
Avg memory	28.5 GiB
CPU user	608.0s
CPU sys	27.4s
Peak spill	0 B

tpcds — branch

Metric	Value
Wall time	300.1s
Peak memory	18.3 GiB
Avg memory	9.8 GiB
CPU user	605.1s
CPU sys	14.4s
Peak spill	0 B

File an issue against this benchmark runner

Each Parquet file previously produced a single morsel containing one `ParquetPushDecoder` over the full pruned `ParquetAccessPlan`. Morselize at row-group granularity instead: after all pruning work is done, pack surviving row groups into chunks bounded by a per-morsel row budget and compressed-byte budget (defaults: 100k rows, 64 MiB). Each chunk becomes its own stream so the executor can interleave row-group decode work with other operators and — in a follow-up — let sibling `FileStream`s steal row-group-sized units of work across partitions. A single oversized row group still becomes its own morsel; no sub-row-group splitting is introduced. `EarlyStoppingStream` (which is driven by the non-Clone `FilePruner`) is attached only to the first morsel's stream so the whole file can still short-circuit on dynamic-filter narrowing. Row-group reversal is applied per-chunk on the `PreparedAccessPlan` and the chunk list is reversed so reverse output order is preserved. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

adriangbot · 2026-04-21T15:19:31Z

Benchmark for this request hit the 7200s job deadline before finishing.

Benchmarks requested: tpch

Kubernetes message

Job was active longer than specified deadline

File an issue against this benchmark runner

adriangbot · 2026-04-21T15:19:38Z

Benchmark for this request hit the 7200s job deadline before finishing.

Benchmarks requested: tpch

Kubernetes message

Job was active longer than specified deadline

File an issue against this benchmark runner

The previous `build_stream` built every morsel's `RowFilter`, `ParquetPushDecoder`, `AsyncFileReader`, and `Projector` eagerly in a single loop inside the file planner — before any morsel was scheduled. That loop ran on the scheduler thread and was visible as a 10–15% regression vs. main on ClickBench-partitioned queries that have many row-group morsels per file (e.g. Q15, Q16 at pushdown=off). Replace `ParquetStreamMorsel` (which held a pre-built `BoxStream`) with `ParquetLazyMorsel`, which holds only the per-chunk `ParquetAccessPlan` plus an `Arc<LazyMorselShared>` of the file-level state. The decoder and reader are constructed inside `Morsel::into_stream`, so each morsel pays its setup cost only when the scheduler actually picks it up, and the work is distributed across worker threads instead of serialised on the planner. `FilePruner` is `!Clone` and drives whole-file early-stop via `EarlyStoppingStream`, so it still lives on chunk 0's morsel only. The warm `async_file_reader` from metadata / page-index / bloom-filter load is dropped at the end of `build_stream` — every morsel mints a fresh reader via the factory at `into_stream` time. For both built-in factories (`DefaultParquetFileReaderFactory`, `CachedParquetFileReaderFactory`) the "warm cache" benefit of reusing a reader is negligible because the underlying `Arc<dyn ObjectStore>` / `Arc<dyn FileMetadataCache>` is already shared across readers, so the simplification is free. Local ClickBench-partitioned, 10 iterations, pushdown=off (M-series): | Query | main | eager (before) | lazy (this commit) | |-------|------:|---------------:|-------------------:| | Q14 | 325 | 335 | 313 ms | | Q15 | 309 | 358 | 302 ms | | Q16 | 911 | 1049 | 786 ms | | Q24 | 48 | 55 | 56 ms | | Q26 | 41 | 45 | 45 ms | Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

adriangb · 2026-04-21T17:43:57Z

Updated to the lazy-morsel variant (8ef53b7ff0). The previous eager build_stream loop was measurably slower than apache/main on ClickBench-partitioned queries with many row-group morsels per file — it built every morsel's RowFilter, decoder, reader, and projector up front on the scheduler thread before any of them ran.

The new commit replaces ParquetStreamMorsel (pre-built stream) with ParquetLazyMorsel (per-chunk ParquetAccessPlan + Arc<LazyMorselShared>). The decoder/reader construction runs inside Morsel::into_stream, so each morsel's setup cost is paid only when the scheduler picks it up, distributed across worker threads.

Local 10-iter ClickBench-partitioned, pushdown=off (M-series):

Query	main	eager (previous)	lazy (this commit)
Q14	325	335	313 ms
Q15	309	358	302 ms
Q16	911	1049	786 ms
Q24	48	55	56 ms
Q26	41	45	45 ms

Full analysis + pushdown=on numbers in the isolation comment on PR #9.

🤖 Generated with Claude Code

adriangb · 2026-04-21T18:40:20Z

run benchmarks

env:
     DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS: true
     DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS: true

adriangb · 2026-04-21T18:40:28Z

run benchmarks

env:
     DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS: false
     DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS: false

adriangbot · 2026-04-21T18:43:01Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4290961407-1692-k659t 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (8ef53b7) to 9a1ed57 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T18:43:18Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4290962120-1696-pc772 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (8ef53b7) to 9a1ed57 (merge-base) diff using: tpcds
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T18:43:19Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4290962120-1697-wfzpl 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (8ef53b7) to 9a1ed57 (merge-base) diff using: tpch
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T18:43:32Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4290961407-1694-6gj6c 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (8ef53b7) to 9a1ed57 (merge-base) diff using: tpch
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T18:43:32Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4290961407-1693-r2swc 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (8ef53b7) to 9a1ed57 (merge-base) diff using: tpcds
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T18:43:34Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4290962120-1695-xjsgf 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing row-group-morsel-split (8ef53b7) to 9a1ed57 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-21T20:40:56Z

Benchmark for this request hit the 7200s job deadline before finishing.

Benchmarks requested: tpch

Kubernetes message

Job was active longer than specified deadline

File an issue against this benchmark runner

adriangbot · 2026-04-21T20:41:06Z

Benchmark for this request hit the 7200s job deadline before finishing.

Benchmarks requested: tpch

Kubernetes message

Job was active longer than specified deadline

File an issue against this benchmark runner

github-actions Bot added the datasource label Apr 21, 2026

adriangb force-pushed the row-group-morsel-split branch from 311a854 to 5b0a69a Compare April 21, 2026 13:01

github-actions Bot added documentation Improvements or additions to documentation physical-plan common execution functions labels Apr 21, 2026

adriangb force-pushed the row-group-morsel-split branch from 5b0a69a to ff805cf Compare April 21, 2026 14:19

github-actions Bot removed documentation Improvements or additions to documentation physical-plan common execution functions sqllogictest development-process logical-expr labels Apr 21, 2026

adriangb mentioned this pull request Apr 21, 2026

Adaptive filter scheduling + row-group morsel split #9

Open

5 tasks

Conversation

adriangb commented Apr 21, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

adriangb commented Apr 21, 2026

Uh oh!

adriangb commented Apr 21, 2026

Uh oh!

adriangb commented Apr 21, 2026

Uh oh!

adriangb commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangb commented Apr 21, 2026

Root cause: stale base, not a morsel-split regression

Local reproduction (5 iterations, M-series)

Investigation path

Fix

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!

adriangb commented Apr 21, 2026

Uh oh!

adriangb commented Apr 21, 2026

Uh oh!

adriangb commented Apr 21, 2026

Uh oh!

adriangbot commented Apr 21, 2026

Uh oh!