Skip to content

Add authentication call before cleaning up intermediate files in hadoop ingestions#12030

Merged
capistrant merged 4 commits intoapache:masterfrom
capistrant:index-hadoop-authenticate-before-cleanup
May 2, 2022
Merged

Add authentication call before cleaning up intermediate files in hadoop ingestions#12030
capistrant merged 4 commits intoapache:masterfrom
capistrant:index-hadoop-authenticate-before-cleanup

Conversation

@capistrant
Copy link
Copy Markdown
Contributor

@capistrant capistrant commented Dec 6, 2021

Description

Druid Version: 0.22.0
Hadoop Version: 3.3.0

Fixed issue observed in hadoop ingestion intermediate file cleanup

I observed my hadoop tasks running against my kerberized hadoop cluster having authentication failures when trying to cleanup intermediate files during hadoop ingestion after upgrading to 0.22.X (from 0.20.X). This is a similar issue to what I had seen in the past when the job went into a graceful shutdown of a hadoop ingestion job (#9785). I'm suspicious that moving the file deletes into their own static JobHelper method introduced this experience I am seeing (#11194). The call to authenticate is idempotent. If the authentication is already done, no action will be taken. I added a debug log to help people trouble shooting potential issues in the future.


Key changed/added classes in this PR
  • JobHelper

This PR has:

  • been self-reviewed.
  • added comments explaining the "why" and the intent of the code wherever would not be obvious for an unfamiliar reader.
  • been tested in a test Druid cluster.

@capistrant capistrant requested a review from zachjsh December 29, 2021 19:50
@capistrant
Copy link
Copy Markdown
Contributor Author

capistrant commented Dec 29, 2021

@zachjsh could you take a peek at this when you have a chance? I'm not positive but am thinking it is related to a change you made in the past

@stale
Copy link
Copy Markdown

stale Bot commented Apr 17, 2022

This pull request has been marked as stale due to 60 days of inactivity. It will be closed in 4 weeks if no further activity occurs. If you think that's incorrect or this pull request should instead be reviewed, please simply write any comment. Even if closed, you can still revive the PR at any time or discuss it on the dev@druid.apache.org list. Thank you for your contributions.

@stale stale Bot added the stale label Apr 17, 2022
@capistrant
Copy link
Copy Markdown
Contributor Author

don't close

@stale
Copy link
Copy Markdown

stale Bot commented Apr 19, 2022

This issue is no longer marked as stale.

@stale stale Bot removed the stale label Apr 19, 2022
Copy link
Copy Markdown
Contributor

@a2l007 a2l007 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fix looks reasonable, Thanks for your patience!

@capistrant
Copy link
Copy Markdown
Contributor Author

Thanks @a2l007!

@capistrant capistrant merged commit 39e7191 into apache:master May 2, 2022
@abhishekagarwal87 abhishekagarwal87 added this to the 24.0.0 milestone Aug 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants