Remove query status report from BE when query is cancelled normally #1489

morningman · 2019-07-17T01:33:10Z

When query result reach limit, the Coordinator in FE will send a cancel
request to BE to cancel the query. And when being cancelled, BE will report
query status to FE for debug purpose. But actually it is not necessary
and will generate too many logs.

So I add a CancelReason to distinguish the difference between 'normally'
cancellation and 'internal error' cancellation. if 'normally' cancelled,
no status will be reported to FE.

When query reach limit, or user cancel it actively, it is being cancelled 'normally'.
Otherwise, the query is cancelled due to internal error, which will need
a report from BE.

When query result reach limit, the Coordinator in FE will send a cancel request to BE to cancel the query. And when being cancelled, BE will report query status to FE for debug purpose. But actually it is not necessary and will generate too many logs. So I add a CancelReason to distinguish the difference between 'normally' cancellation and 'internal error' cancellation. if 'normally' cancelled, no status will be reported from BE. When query reach limit, or user cancel it actively, it is being cancelled 'normally'. Otherwise, the query is cancelled due to internal error, which will need a report from BE.

imay · 2019-07-17T01:40:39Z

gensrc/proto/internal_service.proto

 message PCancelPlanFragmentRequest {
    required PUniqueId finst_id = 1;
+    enum CancelReason {
+        LIMIT_REACH = 0;


reserve 0， start from 1

imay · 2019-07-17T01:44:31Z

gensrc/proto/internal_service.proto


 message PCancelPlanFragmentRequest {
    required PUniqueId finst_id = 1;
+    enum CancelReason {


I think it's better to move CancelReason out of PCancelPlanFragmentRequest. Because fragment can be cancelled in other format. For example when Fragment is timed out, cancel thread will cancel this instance.

imay · 2019-07-17T01:47:15Z

fe/src/main/java/org/apache/doris/qe/Coordinator.java

+
+    // The reason of cancellation of this query.
+    // This is used for telling BE whether it need to report query status when being cancelled.
+    public enum CancelReason {


Because you have defined in protobuf, we should use it to avoid define one thing in different places.

imay · 2019-07-17T01:51:39Z

be/src/runtime/fragment_mgr.cpp

+Status FragmentExecState::cancel(const PCancelPlanFragmentRequest::CancelReason& reason) {
    std::lock_guard<std::mutex> l(_status_lock);
    RETURN_IF_ERROR(_exec_status);
+    if (reason != PCancelPlanFragmentRequest::INTERNAL_ERROR) {


I think it's better to use equal other than not equal here. Because when we add more types of error later, we should let it report in default.

imay · 2019-07-17T01:54:17Z

fe/src/main/java/org/apache/doris/rpc/BackendServiceProxy.java


    public Future<PCancelPlanFragmentResult> cancelPlanFragmentAsync(
-            TNetworkAddress address, TUniqueId finstId) throws RpcException {
+            TNetworkAddress address, TUniqueId finstId, CancelReason cancelReason) throws RpcException {


This cancelReason is not used?

imay · 2019-07-17T01:55:35Z

fe/src/main/java/org/apache/doris/qe/Coordinator.java

        for (BackendExecState backendExecState : backendExecStates) {
            TNetworkAddress address = backendExecState.getBackendAddress();
-            LOG.info("cancelRemoteFragments initiated={} done={} hasCanceled={} ip={} port={} fragment instance id={}",
+            LOG.info("cancelRemoteFragments initiated={} done={} hasCanceled={} ip={} port={} fragment instance id={}, reason: {}",


I think this log can be debug level

imay · 2019-07-17T06:03:26Z

gensrc/proto/internal_service.proto

    required PStatus status = 1;
 };

+enum PCancelReason {


PFragmentCancelReason?

imay

LGTM

…pache#1489) When query result reach limit, the Coordinator in FE will send a cancel request to BE to cancel the query. And when being cancelled, BE will report query status to FE for debug purpose. But actually it is not necessary and will generate too many logs. So I add a CancelReason to distinguish the difference between 'normally' cancellation and 'internal error' cancellation. if 'normally' cancelled, no status will be reported from BE. When query reach limit, or user cancel it actively, it is being cancelled 'normally'. Otherwise, the query is cancelled due to internal error, which will need a report from BE.

…ster (apache#1489) When there are pure compute plan nodes, such as join nodes, in a plan, the original impl. may assign a BE to the node which does not belong to the current cloud cluster. It leads to incomplete isolation of clusters.

morningman added 2 commits July 16, 2019 23:06

fix bug

6362aa8

imay requested changes Jul 17, 2019

View reviewed changes

morningman added 2 commits July 17, 2019 10:25

move CancelReason to PCancelReason

e6c5c9b

fix fe code

73ffba5

imay reviewed Jul 17, 2019

View reviewed changes

gensrc/proto/internal_service.proto Outdated

required PStatus status = 1;

};

enum PCancelReason {

Copy link

Contributor

imay Jul 17, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PFragmentCancelReason?

rename PCancelReason to PPlanFragmentCancelReason

6641e67

imay approved these changes Jul 17, 2019

View reviewed changes

morningman closed this Jul 19, 2019

morningman reopened this Jul 19, 2019

morningman merged commit 556299a into apache:master Jul 19, 2019

imay mentioned this pull request Sep 26, 2019

Release Notes 0.11.0 #1891

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove query status report from BE when query is cancelled normally #1489

Remove query status report from BE when query is cancelled normally #1489

Uh oh!

morningman commented Jul 17, 2019

Uh oh!

imay Jul 17, 2019

Uh oh!

morningman Jul 17, 2019

Uh oh!

imay Jul 17, 2019

Uh oh!

morningman Jul 17, 2019

Uh oh!

imay Jul 17, 2019

Uh oh!

morningman Jul 17, 2019

Uh oh!

imay Jul 17, 2019

Uh oh!

morningman Jul 17, 2019

Uh oh!

imay Jul 17, 2019

Uh oh!

imay Jul 17, 2019

Uh oh!

imay Jul 17, 2019

Uh oh!

imay left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Remove query status report from BE when query is cancelled normally #1489

Remove query status report from BE when query is cancelled normally #1489

Uh oh!

Conversation

morningman commented Jul 17, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

imay left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants