k8s-based-ingestion: Wait for task lifecycles to enter RUNNING state before returning from KubernetesTaskRunner.start by georgew5656 · Pull Request #17446 · apache/druid

georgew5656 · 2024-11-04T22:58:03Z

Description

It's possible for the KubenretesTaskRunner.start method to return before the threads in the exec pool running (KubernetesPeonLifecycle.join) have finished gathering information about kubernetes jobs. This can be a problem because other services on the overlord (like supervisors) expect the KubenretesTaskRunner to have information about tasks once it has returned from start (for example each task's location).

This diff adds a wait in the start() method that attempts to wait for all the tasks that have been discovered to go into RUNNING state. This state indicates the KubernetesTaskRunner knows all the information that it needs about a task.

Release note

Bugfix that helps mitigate unexpected behavior when running k8s based ingestion during overlord restarts.

Key changed/added classes in this PR

KubernetesTaskRunner
KubernetesWorkItem
KubernetesPeonLifecycle

This PR has:

gianm · 2024-11-06T19:12:04Z

   * @throws IllegalStateException
   */
-  protected synchronized TaskStatus join(long timeout) throws IllegalStateException
+  protected synchronized TaskStatus join(long timeout, SettableFuture<Boolean> taskActiveStatusFuture) throws IllegalStateException


This is an odd pattern; it's more typical for a method to return ListenableFuture<T>. If the caller needs to chain that onto a SettableFuture<T>, the caller should attach a callback that sets the settable future.

Please also add javadoc explaining what the returned future means.

i guess it would make more sense to maybe split this up into a joinUntilStart method that calls back into the wait method, i can rewrite it a bit

hmm doing it this way kinda requires the entire lifecycle logic to be rewritten, i think i have a simpler semi-hacky way of accomplishing the same thing.

i added a function that just returns a future containing a boolean with info on whether the task started running successfully. in order to get this to work i made KubernetesPeonLifecycle a regular argument for KubernetesWorkItem (instead of setting it with setKubernetesPeonLifecycle afterwards). this seems fine to me becuase KubernetesPeonLifecycle is just a simple POJO until you call run or join on it, i don't really see why it needs to be instantiated in doTask instead of run/joinAsync.

gianm · 2024-11-07T16:47:15Z

      throw e;
    }
    finally {
+      if (!taskStartedSuccessfullyFuture.isDone()) {


taskStartedSuccessfullyFuture is not set to true in run as far as I can tell. is that intentional?

run always calls into join (line 140) so it'll get set there

ah, I see, OK.

gianm · 2024-11-07T16:48:28Z

+    }
+    catch (Exception e) {
+      final long numInitialized =
+          tasks.values().stream().filter(item -> item.getPeonLifeycle().getTaskStartedSuccessfullyFuture().isDone()).count();


This is just looking for how many of the futures finished. Should it be looking for ones that finish and eval to true? I am wondering what the boolean is for.

I don't see anything that recovers from failed join, so I suppose there's three cases?

future is done and true — the task was joined

future is done and false — error joining the task, i suppose it's never going to be joined? at this point should we mark it failed?

future is not yet done — the task is still being joined async

yeah i think looking for ones that eval to true makes sense, updated this.

on the second comment, the main logic of the task lifecycle is performed by the exec thread.
return tasks.computeIfAbsent(task.getId(), k -> new KubernetesWorkItem( task, exec.submit(() -> joinTask(task)), peonLifecycleFactory.build( task, this::emitTaskStateMetrics ) ));

so if the task fails to join the KubernetesWorkItem.getResult() future will complete as failed and the taskQueue will mark the task as failed and shut it down, i don't believe we need to do this in the start() method too.

for the new "did task start successfully future"
i updated the logic so the catch blocks in run/join catch any errors during run/join and mark the future as failed if it didn't start properly. also added a test for the scenario

Ok, thanks. I missed that there was a different future that did have a callback for failure.

georgew5656 · 2024-11-08T16:13:27Z

i don't think the failing unit test is related so i'm going to go ahead and merge this

…before returning from KubernetesTaskRunner.start (apache#17446) * Add a wait on start() for task lifecycle to go into running * handle exceptions * Fix logging messages * Don't pass in the settable future as a arg * add some unit tests

georgew5656 added 2 commits November 4, 2024 14:02

Add a wait on start() for task lifecycle to go into running

70a46de

handle exceptions

4f044fe

georgew5656 requested review from gianm and suneet-s November 4, 2024 22:58

github-actions Bot added Area - Documentation Kubernetes labels Nov 4, 2024

georgew5656 removed the request for review from gianm November 6, 2024 17:35

gianm reviewed Nov 6, 2024

View reviewed changes

georgew5656 added 2 commits November 6, 2024 12:28

Fix logging messages

33c0fab

Don't pass in the settable future as a arg

dff8873

georgew5656 requested a review from gianm November 7, 2024 00:16

gianm reviewed Nov 7, 2024

View reviewed changes

add some unit tests

2aa92f7

georgew5656 requested a review from gianm November 7, 2024 17:40

gianm approved these changes Nov 7, 2024

View reviewed changes

georgew5656 merged commit 5764183 into apache:master Nov 8, 2024

adarshsanjeev added this to the 32.0.0 milestone Jan 16, 2025

adarshsanjeev mentioned this pull request Jan 28, 2025

[DRAFT] 32.0.0 release notes #17677

Closed

Conversation

georgew5656 commented Nov 4, 2024

Description

Release note

Key changed/added classes in this PR

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

georgew5656 Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

georgew5656 commented Nov 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

georgew5656 Nov 7, 2024 •

edited

Loading