reconnect to the graphite after transient disconnect#2952
Conversation
|
I'm 100% sure this won't help because It is a know problem of java Socket api, people work it around by checking : Another solution would be reconnecting in the |
|
@l15k4 thanks for your feed back, what you think about this one ? |
|
@b-slim This LGTM 👍 |
|
@himanshug and @fjy can you please check this in ? |
|
👍 |
|
👍 |
|
I just upgraded to So I'm afraid it still happens, I have to restart middleManager for emitter to connect again. |
|
Hey, As you can see, We suggest calling the And also calling it after calling That way, we'll actually be able to reconnect (albeit loosing the current batch of metrics). |
|
@b-slim - you're quick, thank you! |
|
@itaiy i don't think that's needed ? i will close it instead of flush i guess that should work. |
|
@itaiy i have updated the fix please check it out |
|
Guys I'm afraid that graphite emitter only works until graphite server is available. Once it disconnects from the server due to network partition or server restart, it will never reconnect back, so it stops emitting metrics. Druid would have to be restarted so that it would connect again ... This happens with the plain text graphite protocol |
|
Ok, this finally happened |
This is a fix to issue #1978 (comment).
Try to reconnect when the connection is list due to transient problems.