[WIP]Use Announcer for internal announcements + ClusterInforResource by guobingkun · Pull Request #2242 · apache/druid

guobingkun · 2016-01-11T16:28:24Z

This PR fixes #2040

(1) makes Druid nodes(Historical, Realtime, Overlord, Coordinator,MiddleManager, Broker) use Announcer for internal announcements. In Zookeeper you can find all Druid nodes' announcement under /druid/announcement (assuming announcements path base is /druid)

Now under /druid/announcement, there'll be

[historical, realtime, middleManager, coordinator, broker, overlord]

All Druid nodes announce themselves directly under /druid/announcement/{type}

(3) Introduce "capability" concept inside Druid. For example, historical and realtime nodes have the capability of serving segments, at startup, they will announce this capability under /druid/announcement/capability

(4) Use DruidServerDiscovery for internal Druid server discovery. This interface allows us not rely on Zookeeper to discover/distribute internal Druid server information.

(5) Allow Druid to make announcements on an external zookeeper host, which is set by druid.zk.service.externalHost

(4) adds a ClusterInfoResource to Coordinator so that you can query for Druid nodes information through Coordinator endpoint.

Example:

{
  "router": [],
  "realtime": [],
  "coordinator": [
    {
      "name": "localhost:8082",
      "host": "localhost:8082",
      "maxSize": 0,
      "type": "coordinator",
      "tier": "_default_tier",
      "priority": 0
    },
    {
      "name": "localhost:9082",
      "host": "localhost:9082",
      "maxSize": 0,
      "type": "coordinator",
      "tier": "_default_tier",
      "priority": 0
    }
  ],
  "historical": [
    {
      "name": "localhost:8081",
      "host": "localhost:8081",
      "maxSize": 50000000000,
      "type": "historical",
      "tier": "_default_tier",
      "priority": 0
    },
    {
      "name": "localhost:9081",
      "host": "localhost:9081",
      "maxSize": 50000000000,
      "type": "historical",
      "tier": "second_tier",
      "priority": 0
    }
  ],
  "broker": [],
  "overlord": [
    {
      "name": "localhost:8089",
      "host": "localhost:8089",
      "maxSize": 0,
      "type": "overlord",
      "tier": "_default_tier",
      "priority": 0
    }
  ]
}

Todo: Add UTs

drcrallen · 2016-01-11T17:15:06Z

Hi @guobingkun , is there a proposal related to this?

At first glance this seems to be trying to make the cluster have some level of self-awareness. This is a non-trivial thing because there isn't really a solid concept or delineation between "service" and "capacities".

To me it would make more sense to have a host:port combination announce itself, maybe with some predefined set of capacities, but then have any state information be accessible through endpoints exposed at the host:port, with API guaranteed by what type of capability the service has announced.

I'm specifically cautious about having the Announcer store state.

xvrl · 2016-01-11T18:01:05Z

@guobingkun I had put together some suggestions here #2040 it doesn't look any of those have been taken into account.

guobingkun · 2016-01-11T18:32:27Z

@xvrl Sure, let me check #2040

drcrallen · 2016-01-20T18:45:03Z

Had a chat about this PR between @nishantmonu51 @guobingkun and myself. Regarding ONLY the conflicts between #2286 and this PR, the key thing #2286 needs is a way to accomplish this: https://github.com/druid-io/druid/pull/2286/files#diff-610c15be877b04e26b818259a34300e4R63 where something can be announced by some "key" and all things which have announced themselves as some "key" can be found by a discovery service of some kind.

drcrallen · 2016-01-21T01:31:14Z

I've spoken with @guobingkun about this in private a few times and I wanted to make this part of the conversation more open:

One of my concerns with this PR is uncertainty of how useful it is to look at a group of nodes defined by a label determined in their runtime.properties. For example: it is useful for a human operator to be able to discover the known nodes by whatever their service name is defined in the runtime.properties... but those kinds of settings are not terribly helpful to a machine in the cluster.

Likewise knowing what Cli was used to launch the node is also only moderately helpful since there could be any number of modules which extend the basic functionality.

As such, there are two "classes" of service discovery which seem helpful:

Discovery of a descriptor whose mapping of nodes to descriptors is surjective, meaning each node only uses ONE description (example: service name)
Discovery of a descriptor which returns a list of nodes which it describes (example: "can load deep-storage segments"). But any particular node may have any number of descriptors.

An example of this is an internal use case where we have one group of brokers which handles fast queries with prompt expectations on query time, and another group of brokers which handle very long-running queries with longer query time expectations. They are both run via CliBroker, but have different service names, different sets of properties, and serve very different purposes. In such a case the answer to the question of "What are the brokers in my cluster?" becomes unclear, and you have to look at WHY you need to know that information.

I caution against implementation questions like "What are the brokers in my cluster?" instead favoring questions like "What nodes in my cluster serve XXX purpose?".

The difference I see between this PR and #2286 is that #2286 attempts to solve the following: Given that there exist cluster wide resources identified by a unique ID, how do you provide a general framework around being able to post / get / delete those resources across the node, with the intended use case being something which is managing cluster state doing the post / get / delete requests.

The overlap is that #2286 relies on "service discovery" to answer the "What nodes in my cluster serve XXX purpose?" So as long as this PR has a mechanism in the DruidServiceDiscovery to answer that question, a drop-in replacement for the Curator ServiceDiscovery used in #2286 should be trivial.

drcrallen · 2016-01-28T19:22:33Z

Can you clarify the difference between hostText, host, type, tier, name, and service?

I think the confusion comes form the fact that we currently use CuratorServiceAnnouncer for both internal and external announcement, which writes DruidNode into Zookeeper instead of DruidServerMetadata, though it should've written DruidServerMetadata since the later one is intended to contain all the necessary information.

The reason I have to add hostText, port and service is because those information are in DruidNode but not in DruidServerMetadata (though it should've been). I think in the future we could deprecate host since it's basically the concatenation of hostText + port.

drcrallen · 2016-01-28T20:37:36Z

fjy · 2016-02-18T00:18:27Z

@himanshug @cheddar have you guys reviewed this?

fjy · 2016-02-19T21:08:15Z

@himanshug @cheddar ?

fjy · 2016-02-23T00:44:14Z

@himanshug @cheddar

himanshug · 2016-02-23T05:11:53Z

@fjy i haven't reviewed this and wasn't really plugged in. but, from the conversation in PR and similar PRs around this I believe @drcrallen @nishantmonu51 @xvrl should possibly review this. I can take a look at it from code perspective but need to get understanding of the discussions that happened and conclusions reached.

guobingkun · 2016-02-23T17:42:42Z

@drcrallen I introduced the "capability" concept you mentioned for realtime/historical nodes announcing their capability of serving a segment, please have a look.
@xvrl This PR is ready to review.

fjy · 2016-02-27T01:31:03Z

@guobingkun merge conflicts

fjy · 2016-03-16T23:35:21Z

@guobingkun is this still WIP? also merge conflicts

guobingkun · 2016-03-17T03:19:57Z

@fjy Yes, I will finish this up soon.

guobingkun force-pushed the announcement branch 2 times, most recently from ba0e773 to d79867a Compare January 11, 2016 16:58

guobingkun force-pushed the announcement branch from d79867a to 69146fc Compare January 11, 2016 17:26

guobingkun changed the title ~~Use Announcer for internal announcements + ClusterInforResource~~ [WIP]Use Announcer for internal announcements + ClusterInforResource Jan 11, 2016

guobingkun closed this Jan 11, 2016

guobingkun reopened this Jan 11, 2016

guobingkun force-pushed the announcement branch 2 times, most recently from ab6a2af to f214473 Compare January 19, 2016 17:38

guobingkun mentioned this pull request Jan 19, 2016

Add listening-announcer extension #2286

Closed

fjy added the Discuss label Jan 19, 2016

guobingkun force-pushed the announcement branch from f214473 to 6d01f4f Compare January 20, 2016 21:58

drcrallen mentioned this pull request Jan 21, 2016

[Proposal] HTTPAnnouncer + Remove Zookeeper Dependency #2312

Closed

guobingkun force-pushed the announcement branch 4 times, most recently from 2616b61 to 9594c27 Compare January 28, 2016 19:06

drcrallen reviewed Jan 28, 2016
View reviewed changes

guobingkun force-pushed the announcement branch from 9594c27 to dcf1729 Compare January 28, 2016 20:13

drcrallen reviewed Jan 28, 2016
View reviewed changes

guobingkun force-pushed the announcement branch from c1cc524 to 4126387 Compare January 30, 2016 01:11

guobingkun force-pushed the announcement branch 3 times, most recently from d84e017 to bcdaff8 Compare February 12, 2016 21:53

guobingkun force-pushed the announcement branch from bcdaff8 to 7c30a71 Compare February 12, 2016 22:18

new Announcement

938499c

guobingkun force-pushed the announcement branch from 7c30a71 to 938499c Compare February 12, 2016 22:25

guobingkun changed the title ~~[WIP]Use Announcer for internal announcements + ClusterInforResource~~ Use Announcer for internal announcements + ClusterInforResource Feb 16, 2016

fjy added this to the 0.9.1 milestone Feb 18, 2016

guobingkun mentioned this pull request Feb 24, 2016

Plumb task peon host/ports back out to the overlord. #2419

Merged

guobingkun changed the title ~~Use Announcer for internal announcements + ClusterInforResource~~ [WIP]Use Announcer for internal announcements + ClusterInforResource Mar 8, 2016

guobingkun mentioned this pull request Mar 16, 2016

remove serialization of DruidServer #2665

Merged

guobingkun closed this Mar 21, 2016

Conversation

guobingkun commented Jan 11, 2016

Uh oh!

drcrallen commented Jan 11, 2016

Uh oh!

xvrl commented Jan 11, 2016

Uh oh!

guobingkun commented Jan 11, 2016

Uh oh!

drcrallen commented Jan 20, 2016

Uh oh!

drcrallen commented Jan 21, 2016

Uh oh!

drcrallen Jan 28, 2016

Choose a reason for hiding this comment

Uh oh!

guobingkun Jan 28, 2016

Choose a reason for hiding this comment

Uh oh!

drcrallen Jan 28, 2016

Choose a reason for hiding this comment

Uh oh!

guobingkun Feb 23, 2016

Choose a reason for hiding this comment

Uh oh!

fjy commented Feb 18, 2016

Uh oh!

fjy commented Feb 19, 2016

Uh oh!

fjy commented Feb 23, 2016

Uh oh!

himanshug commented Feb 23, 2016

Uh oh!

guobingkun commented Feb 23, 2016

Uh oh!

fjy commented Feb 27, 2016

Uh oh!

fjy commented Mar 16, 2016

Uh oh!

guobingkun commented Mar 17, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants