Change default segment loading to http#11760
Change default segment loading to http#11760Caroline1000 wants to merge 3 commits intoapache:masterfrom
Conversation
kfaraz
left a comment
There was a problem hiding this comment.
Thank you for the PR, @Caroline1000 !
Please add a description that explains the change and the reasons involved and update DruidCoordinatorConfigTest to verify the new default value.
|
Is HTTP based loading ready for prime time? I am curious about any at-scale testing that has been done to verify HTTP based loading is performing as expected. Also, whether all major functional issues with it are fixed before we make it the default. I see at least one open bug right now. |
|
@samarthjain +1 on fixing #11717. If I'm not mistaken, that issue was first observed when multiple load rules were changed across different tiers, so hopefully that makes the bug less likely to run into(?) fwiw, I have seen http segment loading work without issue in many production environments (and actually have seen many problems related to curator loading) |
|
This pull request has been marked as stale due to 60 days of inactivity. It will be closed in 4 weeks if no further activity occurs. If you think that's incorrect or this pull request should instead be reviewed, please simply write any comment. Even if closed, you can still revive the PR at any time or discuss it on the dev@druid.apache.org list. Thank you for your contributions. |
|
I am not sure if http is ready for prime time, the problem with http arises when jetty http server runs low on threads. |
|
This issue is no longer marked as stale. |
|
ZK segment loading is broken right now. As of ~2 years ago, a PR was merged that breaks the order of segment loading and dropping via ZK, such that the assignment can enter into deadlocks when a cluster is mostly full. This wasn't widely an issue (personally, I only learned about it ~6 months ago) because the largest clusters (at least that I'm aware of) have all been using http segment assignment. #11717 has been merged. While it is and was a bug, it was a corner case that we've only seen in development environments and never actually saw it in a production environment. Every cluster I touch, I move from ZK assignment to HTTP assignment because my experience is that HTTP assignment is more stable. I'm +1 on this directionally, but the PR does need the tests fixed as Kashif suggested before it can be approved. |
|
Also, anyone else have this problem with We saw this often in our Kubernetes deployments. |
|
@didip , please create an issue for the |
Isn't this addressed with a combination of:
Or are you saying there is a risk of exhaustion on the outgoing side from the coordinator? |
|
closing now that #13092 has been merged |
Description
We have observed more stability with http segment loading than curator segment loading in production clusters. For example, we have observed that problems with zookeeper can lead to the inability to query realtime data.
This PR has: