Spark 3.4: Handle skew in writes #7520

aokolnychyi · 2023-05-04T00:52:24Z

This PR enables AQE to handle skew in writes.

aokolnychyi · 2023-05-04T00:53:12Z

cc @singhpk234 @amogh-jahagirdar @jackye1995 @flyrain @RussellSpitzer @szehon-ho

aokolnychyi · 2023-05-04T00:54:09Z

spark/v3.4/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestDelete.java

  }

+  @Test
+  public void testSkewDelete() throws Exception {


Tests for CoW row-level operations already cover SparkWrite, which is used in normal writes. There is not much logic on Iceberg side, the rest is covered by Spark tests.

RussellSpitzer · 2023-05-04T15:42:28Z

This is just for 3.4 because of the new rebalance code for writes right?

aokolnychyi · 2023-05-04T15:52:07Z

@RussellSpitzer, correct. This API does not exist in 3.3.

aokolnychyi · 2023-05-04T15:58:59Z

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SparkWrite.java


+  @Override
+  public boolean distributionStrictlyRequired() {
+    return false;


I may actually need to move it to SparkWriteBuilder as SparkWrite is used for compaction. We explicitly disable table distribution/ordering and AQE in shuffling rewriters but not in bin-pack when the output spec mismatches.

Thoughts, @RussellSpitzer?

I was going to say I don't really mind our Compaction solution atm. I think disabling AQE is our best bet there.

Let's stick to that then, I agree.

singhpk234 · 2023-05-04T15:26:13Z

spark/v3.4/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestDelete.java

+      // that means there are 4 shuffle blocks, all assigned to the same reducer
+      // AQE detects that all 4 shuffle blocks are big and processes them in 4 separate tasks
+      // otherwise, there would be 1 task processing 4 shuffle blocks
+      int addedFiles = Integer.parseInt(summary.get(SnapshotSummary.ADDED_DELETE_FILES_PROP));


[minor] can use PropertyUtil here

I did not use PropertyUtil as it requires a default value and won't fit on one line. I can switch, though.

singhpk234 · 2023-05-04T15:27:28Z

spark/v3.4/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestDelete.java

+      int addedFiles = Integer.parseInt(summary.get(SnapshotSummary.ADDED_FILES_PROP));
+      Assert.assertEquals("Must produce 4 files", 4, addedFiles);


[minor] can this be moved to a private func for ex: assertAddedFiles to use in both MOR / COW ?

Let me add something.

There was existing validateProperty, which I forgot about. I switched to that.

singhpk234 · 2023-05-04T16:13:17Z

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SparkPositionDeltaWrite.java


+  @Override
+  public boolean distributionStrictlyRequired() {
+    return false;


should we also check ADAPTIVE_OPTIMIZE_SKEWS_IN_REBALANCE_PARTITIONS_ENABLED is true as well before disabling this requirement ? otherwise it will be a no-op for OptimizeSkewInRebalancePartitions

Is there ever a good reason to return true from this method? We don't require distributions to be strict and it is up to Spark to either handle the skew or not.

Agree, I was mostly comming from, the point that we are overriding this and setting it to false, in a hope that spark will optimize the skew whereas if the above conf is disabled spark will never do the same. I am fine with keeping it as it is.

I feel it is better to always return false and leave it up to Spark. It seems the safest way as Spark may add new configs or logic on when to do that in the future.

singhpk234 · 2023-05-04T16:43:36Z

spark/v3.4/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestDelete.java

+      // AQE detects that all shuffle blocks are big and processes them in 4 independent tasks
+      // otherwise, there would be 2 tasks processing 2 shuffle blocks each


[doubt] should we also add a UT where coalese is happening ?

I was planning to do so in a separate PR. This change focuses on skew.

singhpk234

LGTM as well, Thanks @aokolnychyi !

aokolnychyi · 2023-05-04T21:25:26Z

Thanks for reviewing, @singhpk234 @RussellSpitzer!

Spark 3.4: Handle skew in writes

bce634d

github-actions bot added the spark label May 4, 2023

aokolnychyi commented May 4, 2023

View reviewed changes

singhpk234 reviewed May 4, 2023

View reviewed changes

RussellSpitzer approved these changes May 4, 2023

View reviewed changes

singhpk234 approved these changes May 4, 2023

View reviewed changes

Review feedback

7ebae60

singhpk234 approved these changes May 4, 2023

View reviewed changes

aokolnychyi merged commit 7fa5fca into apache:master May 4, 2023

manisin pushed a commit to Snowflake-Labs/iceberg that referenced this pull request May 9, 2023

Spark 3.4: Handle skew in writes (apache#7520)

750b9e3

BsoBird mentioned this pull request May 11, 2023

iceberg mor table execute merge very very slow #7431

Closed

namrathamyske mentioned this pull request Jul 11, 2023

Spark 3.3: Adding Rebalance operator solving for small files problem #8042

Closed

		int addedFiles = Integer.parseInt(summary.get(SnapshotSummary.ADDED_FILES_PROP));
		Assert.assertEquals("Must produce 4 files", 4, addedFiles);

		// AQE detects that all shuffle blocks are big and processes them in 4 independent tasks
		// otherwise, there would be 2 tasks processing 2 shuffle blocks each

Spark 3.4: Handle skew in writes #7520

Spark 3.4: Handle skew in writes #7520

Uh oh!

Conversation

aokolnychyi commented May 4, 2023

Uh oh!

aokolnychyi commented May 4, 2023

Uh oh!

aokolnychyi May 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RussellSpitzer commented May 4, 2023

Uh oh!

aokolnychyi commented May 4, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aokolnychyi May 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

singhpk234 left a comment

Choose a reason for hiding this comment

Uh oh!

aokolnychyi commented May 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aokolnychyi May 4, 2023 •

edited

Loading

aokolnychyi May 4, 2023 •

edited

Loading