DataSource segment size total reported does not match coordinator and historical

The Size reported by a historical when looking on the coordinator at `/druid/coordinator/v1/servers` does not match the metrics reported by historicals as the sum of the historical metric `segment/used`. The number reported by the coordinator is MUCH lower. Historical servers which are >99% full are reported by the coordinator as only 91% full! I have confirmed the sum of `segment/used` as reported by the historicals is the correct on-disk size of the segment data on the historical nodes. 

This really screws with capacity planning. One side effect is that the historicals will throw `Exception loading segment` ... `too large for storage` and fail to load the segment on that coordinator balancing round. This is particularly harmful when it happens during handoff, because the resources used by realtime indexing tasks cannot be freed!

The view kept by the coordinator regarding sizes on a historical node should be eventually consistent with the data emitted by the historical node itself.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DataSource segment size total reported does not match coordinator and historical #3283

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

DataSource segment size total reported does not match coordinator and historical #3283

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions