KAFKA-12380: Executor in Connect's Worker is not shut down when the worker is by sridhav · Pull Request #10337 · apache/kafka

sridhav · 2021-03-17T02:32:35Z

More detailed description of your change,
if necessary. The PR title and PR message become
the squashed commit message, so use a separate
comment to ping reviewers.

When the worker is stopped, it does not shutdown this executor.

Summary of testing strategy (including rationale)
for the feature or bug fix. Unit and/or integration
tests are expected for any behaviour change and
system tests should be considered for larger changes.

The following tests are run:

./gradlew connect:test
./gradlew connect:unitTest
./gradlew connect:integrationTest

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

sridhav · 2021-03-17T02:36:05Z

@rhauch can you please review?

…pping the execution of all in-progress and queued tasks

rhauch

Thanks for this simple PR, @sridhav. A pretty minor question/suggestion below.

Also, it's probably worth pointing out a potential edge case that comes with explicitly closing this executor. Recall that this worker is only called by the herder via the herder's enqueued requests. When the herder is stopped/halted, the herder first cleans out this request queue (failing any outstanding requests) and then stops its services, which includes the worker. That means that when the worker is finally stopped, there should be no more requests to start connectors or tasks. And since those are the only methods that potentially submit work to this executor, we should never submit work to the executor once it is shutdown, and thus those executor.submit(...) calls don't need to handle the RejectedExecutionHandler exceptions.

When the herder stops, it also stops its assigned connectors and tasks, and those requests wait until the connectors and tasks are actually stopped. This means that the Worker's executor should have no running tasks when the worker is stopped and it calls executor.shutdownNow(). However, if the executor were still running tasks when the shutdownNow() method is called, that method may attempt to interrupt those running threads, which would cause exceptions in those runnables of the connector and task, though WorkerTask.run() catches all exceptions and only propagates Errors. But none of that should happen, though, since the herder stops all running connectors and tasks before stopping the worker.

While these should always be the case, is there any way of asserting that with a unit test?

rhauch · 2021-03-17T19:49:02Z


        offsetBackingStore.stop();
        metrics.stop();
+        stopExecutor();


Are there advantages of putting this simple if-check in a separate methods? Would it be simpler and more straightforward to just do the check here:

Suggested change

stopExecutor();

if (executor != null) {

executor.shutdownNow();

}

and then remove the stopExecutor() method?

There is already precedence for an if-check a few lines above.

rhauch · 2021-03-17T20:06:38Z

Also, there are 24 failed unit tests on the different jobs, of the form:

java.lang.AssertionError: 
  Unexpected method call ExecutorService.shutdownNow():

Please make sure you run the tests locally. The WorkerTest mocks the executor service, and thus it is checking that only the methods specified in the tests are called, so shutdownNow() is unexpected. You'll need to add that call as expected.

showuon · 2022-04-01T03:22:44Z

@sridhav , are you still interested in completing this PR? There's a duplicated PR (#11955 ) that is fixing the same issue. I'd like to make sure if you want to continue this PR or not. Thanks.

showuon · 2022-04-29T05:36:35Z

Close this PR because it's pending for a long time, and another PR already addressed this issue.

sridhav and others added 2 commits March 16, 2021 19:18

updated code to stop executor when worker stop is called

57cc618

Merge branch 'trunk' into KAFKA-12380

70e8eb0

sridhav added 2 commits March 16, 2021 19:39

remove redundant code

b691127

hard signal to destroy the ExecutorService immediately along with sto…

2a8836f

…pping the execution of all in-progress and queued tasks

rhauch reviewed Mar 17, 2021

View reviewed changes

kkonstantine added the connect label Mar 18, 2021

showuon mentioned this pull request Apr 1, 2022

KAFKA-12380 Executor in Connect's Worker is not shut down when the worker is #11955

Merged

3 tasks

showuon closed this Apr 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAFKA-12380: Executor in Connect's Worker is not shut down when the worker is#10337

KAFKA-12380: Executor in Connect's Worker is not shut down when the worker is#10337
sridhav wants to merge 4 commits intoapache:trunkfrom
sridhav:KAFKA-12380

sridhav commented Mar 17, 2021

Uh oh!

sridhav commented Mar 17, 2021

Uh oh!

rhauch left a comment •

edited

Loading

Uh oh!

rhauch Mar 17, 2021

Uh oh!

rhauch commented Mar 17, 2021

Uh oh!

showuon commented Apr 1, 2022

Uh oh!

showuon commented Apr 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

-        stopExecutor();
+        if (executor != null) {
+            executor.shutdownNow();
+        }

Conversation

sridhav commented Mar 17, 2021

Committer Checklist (excluded from commit message)

Uh oh!

sridhav commented Mar 17, 2021

Uh oh!

rhauch left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rhauch Mar 17, 2021

Choose a reason for hiding this comment

Uh oh!

rhauch commented Mar 17, 2021

Uh oh!

showuon commented Apr 1, 2022

Uh oh!

showuon commented Apr 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rhauch left a comment •

edited

Loading