Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions docs/configuration/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -353,6 +353,7 @@ The following monitors are available:
|`org.apache.druid.segment.realtime.RealtimeMetricsMonitor`|Reports statistics on Realtime processes.|
|`org.apache.druid.server.metrics.EventReceiverFirehoseMonitor`|Reports how many events have been queued in the EventReceiverFirehose.|
|`org.apache.druid.server.metrics.QueryCountStatsMonitor`|Reports how many queries have been successful/failed/interrupted.|
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
|`org.apache.druid.server.metrics.QueryCountStatsMonitor`|Reports how many queries have been successful/failed/interrupted.|
|`org.apache.druid.server.metrics.QueryCountStatsMonitor`|Reports how many queries have succeeded, failed, or were interrupted.|

|`org.apache.druid.server.metrics.TaskCountStatsMonitor`|Reports how many tasks are success/failed/running/pending/waiting.|
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As someone new to the monitors, I think explaining why you would need this is helpful. Are there specific use cases where I should be monitoring the task counts? Maybe I want to set up alerting on failed jobs?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@suneet-s Thank you very much for your reply!

Some implementation classes are registered by default and we don't need to register them manually, such as CacheMonitor, JettyMonitor and ExecutorServiceMonitor. But some are not registered by default and need to be configured ourselves, such as QueryCountStatsMonitor and TaskCountStatsMonitor.
Docs: https://druid.apache.org/docs/latest/operations/metrics.html
PR: https://github.com/apache/druid/pull/6657/files

I found a description for QueryCountStatsMonitor, but not for TaskCountStatsMonitor. The description of TaskCountStatsMonitor might have been missed, so I added.

In our scenario, kafka has a large number of indexing tasks and a large amount of data that needs to be consumed with low latency. Therefore, task monitoring is very important.

We deployed the druid cluster with ambari, so monitoring and alerting were implemented based on ambari-metric-collector.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggest ... tasks are success ... -> ... tasks have been succeeded ...

Copy link
Copy Markdown

@sthetland sthetland Jun 10, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

... or "have been successful", to be consistency with the prior row in the table.

Even better though might be to expand out, for both rows:

Suggested change
|`org.apache.druid.server.metrics.TaskCountStatsMonitor`|Reports how many tasks are success/failed/running/pending/waiting.|
|`org.apache.druid.server.metrics.TaskCountStatsMonitor`|Reports how many tasks have succeeded, failed, or are currently in the running, pending, or waiting state.|

|`org.apache.druid.server.emitter.HttpEmittingMonitor`|Reports internal metrics of `http` or `parametrized` emitter (see below). Must not be used with another emitter type. See the description of the metrics here: https://github.com/apache/druid/pull/4973.|


Expand Down