-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-45393][BUILD] Upgrade Hadoop to 3.4.0 #45583
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Hi, @LuciferYang . I tried to search your previous PR , but couldn't find it. So, I recreated with your coauthor ship here. If you want to re-open your PR, please let me know, @LuciferYang . |
|
BTW, I'll add the following based on the failed module, if needed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
.... surprising
Thanks @dongjoon-hyun ~ Let's use this one ~ |
|
Got it. Thank you, @LuciferYang . |
5c301dc to
d960d98
Compare
fbe5d40 to
179848f
Compare
179848f to
2358f85
Compare
LuciferYang
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM (pending test)
|
Thank you, @LuciferYang . For this PR, I fixed the following three so far, but I guess there is one more to go. Let's see the CI result. |
ab86d62 to
798c5a1
Compare
|
|
||
| test("running Spark in yarn-cluster mode displays driver log links") { | ||
| // TODO(SPARK-47491): Re-enable `driver log links` test in YarnClusterSuite | ||
| ignore("running Spark in yarn-cluster mode displays driver log links") { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll handle this later in SPARK-47491 because YARN-only test PR doesn't cause a full GitHub Action. It's more easier to fix it. Currently, this PR causes the full CI builds and it makes the investigation difficult.
|
At the previous commit (https://github.com/dongjoon-hyun/spark/runs/22865082407), we already passed all tests except one YARN failure. And, the failed test case is ignored here. Let me merge this to move forward. Thank you, @LuciferYang and @yaooqinn . |
### What changes were proposed in this pull request? This PR aims to ban `AWS SKD for Java v1`. We migrated to v2 via the following. - #45583 - #43510 ### Why are the changes needed? To ensure the migration to AWS SDK for Java v2 because of the following the end of support schedule. `v2` is strongly recommended since July. - https://aws.amazon.com/blogs/developer/announcing-end-of-support-for-aws-sdk-for-java-v1-x-on-december-31-2025/ > AWS SDK for Java v1.x will enter maintenance mode on July 31, 2024, and reach end-of-support on December 31, 2025. ### Does this PR introduce _any_ user-facing change? No, this PR only prevents mixing this old dependency in the future. ### How was this patch tested? Pass the CIs. ### Was this patch authored or co-authored using generative AI tooling? No. Closes #45759 from dongjoon-hyun/SPARK-47632. Authored-by: Dongjoon Hyun <dhyun@apple.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>
|
Sorry to disturb everyone, but when I execute
When using Hadoop 3.4.0, although there were no test failures, an |
|
To @LuciferYang , have you try other combination like
In other words, I'm wondering if you are reporting that the result before this commit and after this commit, @LuciferYang . For your claim, could you share us that result? |
|
@dongjoon-hyun Test on master: before this commit: result: after this commit: result: screenshot of git log: |
|
Ack. Thank you for sharing. Let me take a look at that as a independent JIRA issue because it's Mac only issue, @LuciferYang . |
|
SPARK-49055 is filed, @LuciferYang . |
@dongjoon-hyun I apologize for providing misleading information. I just reviewed the recent GA test logs and I found that this is not a Mac Only issue: |
|
To @LuciferYang , according to the Hadoop code, HADOOP-17982 seems to change the log level only at Hadoop 3.4.0. |
|
There is no functional change in Hadoop code since last 10 year except the above log level change. Given that, we can ignore the warning message. WDYT, @LuciferYang ? |
|
@dongjoon-hyun Thank you for your explanation, agree with you |



What changes were proposed in this pull request?
This PR aims to upgrade to Apache Hadoop 3.4.0 for Apache Spark 4.0.0.
Why are the changes needed?
To bring the new features like the following
S3 Express One ZoneS3 Express One ZoneS3 on OutpostsDoes this PR introduce any user-facing change?
No.
How was this patch tested?
Pass the CIs.
Was this patch authored or co-authored using generative AI tooling?
No.