Make intermediate store for shuffle tasks an extension point by maytasm · Pull Request #11492 · apache/druid

maytasm · 2021-07-23T15:42:20Z

Make intermediate store for shuffle tasks an extension point

Description

This PR makes IntermediaryDataManager and ShuffleClient an interface with method that can be implements by extension to customize intermediate storage location for shuffle tasks. For example, implementation can be added via extensions to support different cloud storages to store intermediate data for shuffle tasks.

More details: #11297

Note that IntermediaryDataManager has been renamed to LocalIntermediaryDataManager with a few additional changes:

Update Javadoc
Add @Override for methods that are now interfaced by (new) IntermediaryDataManager interface class.
findPartitionFile changed to return Optional instead of File

This PR has:

been self-reviewed.
- using the concurrency checklist (Remove this item if the PR doesn't have any relation to concurrency.)
added documentation for new or modified features or behaviors.
added Javadocs for most classes and all non-trivial methods. Linked related entities via Javadoc links.
added or updated version, license, or notice information in licenses.yaml
added comments explaining the "why" and the intent of the code wherever would not be obvious for an unfamiliar reader.
added unit tests or modified existing tests to cover new code paths, ensuring the threshold for code coverage is met.
added integration tests.
been tested in a test Druid cluster.

lgtm-com · 2021-07-23T18:00:58Z

This pull request introduces 1 alert when merging caabe48 into c98e7c3 - view on LGTM.com

new alerts:

1 for Cross-site scripting

lgtm-com · 2021-07-23T21:08:29Z

This pull request introduces 1 alert when merging 3a7a2d6 into c98e7c3 - view on LGTM.com

new alerts:

1 for Cross-site scripting

lgtm-com · 2021-07-26T08:39:07Z

This pull request introduces 1 alert when merging 6086f65 into fcb908d - view on LGTM.com

new alerts:

1 for Cross-site scripting

suneet-s · 2021-07-26T18:38:23Z

-          }
-      ).build();
+      try {
+        long size = partitionFile.get().size();


This will potentially read the entire file to get the size. I believe partitionFile.length() is just a metadata lookup on the file to get the length.

partitionFile is of type Optional<ByteSource>. The get is to get the ByteSource from the wrapping Optional class.

I was looking at the default implementation of ByteSource#size() which opens the stream, reads it and counts the bytes. It looks like FileByteSource has overridden that implementation to return the length of the File without reading the bytes.

At this point, I'm mostly concerned about whether someone can trip over this by changing the implementation and not realizing that this operation should be fast. Anyways, that's a problem for another time.

The LocalIntermediaryDataManager uses Files.asByteSource() which returns FileByteSource which do file.length() which is the same as current behavior. The extension (that implements IntermediaryDataManager) can extends ByteSource and provide a method of returning size efficiently. I added javadoc to indicate this

suneet-s · 2021-07-26T18:43:20Z

            JsonConfigProvider.bind(binder, "druid", DruidNode.class, Parent.class);
            JsonConfigProvider.bind(binder, "druid.worker", WorkerConfig.class);

+            CliPeon.configureIntermediaryData(binder);


Are there any integration tests that verify shuffle with indexers continue to work after this change? I haven't looked at the existing integration tests closely

There are many ITs that run indexers with the new config unset (which basically fallback to using "local" storage for storing intermediary segments via LocalIntermediaryDataManager). Specifically, input source integration test with Indexer runs ingestion with Hashed partitioning and maxNumConcurrentSubTasks=10, which would run the ingestion in two phases (first phase which persist to local using LocalIntermediaryDataManager and second phase which reads segments from first phase). Similarly, there are also some other ITs in compaction/auto compaction that uses Hashed partitioning.

suneet-s

LGTM

maytasm · 2021-07-26T18:55:33Z

The LGTM is a false positive. We would only return the supervisorTaskId (from the request) in the response if the request succeeded. For the request to succeeded, we would have verified that the supervisorTaskId is valid via IdUtils.validateId("supervisorTaskId", supervisorTaskId);

lgtm-com · 2021-07-26T20:18:21Z

This pull request introduces 1 alert when merging aa2a2c2 into fcb908d - view on LGTM.com

new alerts:

1 for Cross-site scripting

maytasm · 2021-07-27T04:29:39Z

Merging this in as the design proposal (#11297), proposed by @pjain1, also have two +1s by myself and @suneet-s

pjain1 · 2021-07-27T04:53:35Z

missed this, @maytasm are you already working on support for deep storage as intermediate data store as I am also in middle of implementing it.

pjain1 · 2021-07-28T08:12:22Z

Deep store support feature - #11507

maytasm added 2 commits July 23, 2021 22:17

add interface

45db2f3

add docs

caabe48

fix errors

3a7a2d6

asdf2014 added the Area - Batch Ingestion label Jul 24, 2021

maytasm added 2 commits July 26, 2021 14:18

fix injection

d5e622b

fix injection

6086f65

suneet-s reviewed Jul 26, 2021

View reviewed changes

suneet-s approved these changes Jul 26, 2021

View reviewed changes

update javadoc

aa2a2c2

maytasm mentioned this pull request Jul 26, 2021

Using deep storage as intermediate store for shuffle tasks #11297

Closed

maytasm merged commit c068906 into apache:master Jul 27, 2021

maytasm deleted the IMPLY-8622 branch July 27, 2021 04:29

clintropolis added this to the 0.22.0 milestone Aug 12, 2021

Conversation

maytasm commented Jul 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

lgtm-com Bot commented Jul 23, 2021

Uh oh!

lgtm-com Bot commented Jul 23, 2021

Uh oh!

lgtm-com Bot commented Jul 26, 2021

Uh oh!

suneet-s Jul 26, 2021

Choose a reason for hiding this comment

Uh oh!

maytasm Jul 26, 2021

Choose a reason for hiding this comment

Uh oh!

suneet-s Jul 26, 2021

Choose a reason for hiding this comment

Uh oh!

maytasm Jul 26, 2021

Choose a reason for hiding this comment

Uh oh!

suneet-s Jul 26, 2021

Choose a reason for hiding this comment

Uh oh!

maytasm Jul 26, 2021

Choose a reason for hiding this comment

Uh oh!

suneet-s left a comment

Choose a reason for hiding this comment

Uh oh!

maytasm commented Jul 26, 2021

Uh oh!

lgtm-com Bot commented Jul 26, 2021

Uh oh!

maytasm commented Jul 27, 2021

Uh oh!

pjain1 commented Jul 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pjain1 commented Jul 28, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

maytasm commented Jul 23, 2021 •

edited

Loading

pjain1 commented Jul 27, 2021 •

edited

Loading

pjain1 commented Jul 28, 2021 •

edited

Loading