Skip to content

[VL] Spark 4.x: Enable Structured Streaming test suites #11911

@baibaichen

Description

@baibaichen

Backend

VL (Velox)

Bug description

20 Structured Streaming test suites are disabled (TODO) for both Spark 4.0 and 4.1. Additionally, 3 already-enabled streaming suites have 11 excludes related to streaming API changes.

Goal: Enable SS test suites to run with GlutenPlugin loaded, allowing fallback to vanilla Spark where needed.

Parent issue: #11550

Disabled suites (20)

19 suites use GlutenSQLTestsTrait, 1 uses GlutenTestsCommonTrait.

# Suite Priority
1 GlutenFileStreamSinkV2Suite Simple (1 failure)
2 GlutenMultiStatefulOperatorsSuite Simple (2 failures / 10 tests)
3 GlutenStreamingQueryHashPartitionVerifySuite Simple (1 test, needs SPARK_HOME)
4 GlutenEventTimeWatermarkSuite Medium
5 GlutenFileStreamSourceSuite Medium
6 GlutenStreamSuite Medium (~66 tests)
7 GlutenStreamingAggregationSuite Medium
8 GlutenStreamingAggregationDistributionSuite Medium
9 GlutenStreamingDeduplicationSuite Medium
10 GlutenStreamingDeduplicationDistributionSuite Medium
11 GlutenStreamingInnerJoinSuite Medium
12 GlutenStreamingOuterJoinSuite Medium
13 GlutenStreamingSessionWindowDistributionSuite Medium
14 GlutenStreamingStateStoreFormatCompatibilitySuite Medium
15 GlutenFlatMapGroupsWithStateSuite Medium
16 GlutenFlatMapGroupsWithStateDistributionSuite Medium
17 GlutenFlatMapGroupsInPandasWithStateDistributionSuite Complex (Python/Pandas)
18 GlutenRocksDBStateStoreFlatMapGroupsWithStateSuite Follows #15
19 GlutenRocksDBStateStoreStreamingAggregationSuite Follows #7
20 GlutenRocksDBStateStoreStreamingDeduplicationSuite Follows #9

Excludes in already-enabled suites (from #11400)

  • GlutenStreamRealTimeModeAllowlistSuite (3 excludes)
  • GlutenStreamRealTimeModeE2ESuite (7 excludes)
  • GlutenStreamRealTimeModeSuite (1 exclude)
  • SPARK-53942 stateful shuffle partitions (2 excludes)

Root causes

  1. Plan assertion failures — GlutenPlugin replaces ShuffleExchangeExec with ColumnarShuffleExchangeExec, etc. Tests asserting specific plan nodes fail.
  2. Checkpoint resource loading — Golden files contain plan structures that don't match Gluten-transformed plans.
  3. Spark 4.1 streaming API changes — SPARK-53941 (AQE), SPARK-53233 (package refactor), etc.

Notes

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions