-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Description
Search before asking
- I had searched in the issues and found no similar issues.
Version
2.0.2
What's Wrong?
When adding new follower node to cluster, the follower sync image from FE master timeout.
The cluster meta image is about 1G. By testing, it takes about 20s to synchronizing image files
The error log:
2023-10-23 09:35:57,278 WARN (main|1) [Env.getClusterIdAndRole():1041] current node is not added to the group. please add it first. sleep 5 seconds and retry, current helper nodes: [HostInfo{host='10.153.123.74', port=9010}] 2023-10-23 09:36:02,283 INFO (main|1) [Env.getFeNodeTypeAndNameFromHelpers():1198] get fe node type FOLLOWER, name fe_6aa20a8a_8380_4a27_9c65_f7f343cb8968 from 10.153.123.74:10.153.123.74:8030 2023-10-23 09:36:07,787 WARN (main|1) [DorisFE.start():209] java.io.IOException: java.net.SocketTimeoutException: Read timed out at org.apache.doris.catalog.Env.getNewImage(Env.java:1664) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.Env.getClusterIdAndRole(Env.java:1126) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.Env.initialize(Env.java:884) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.DorisFE.start(DorisFE.java:163) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.DorisFE.main(DorisFE.java:83) ~[doris-fe.jar:1.2-SNAPSHOT] Caused by: java.net.SocketTimeoutException: Read timed out at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) ~[?:1.8.0_121] at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62) ~[?:1.8.0_121] at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) ~[?:1.8.0_121] at java.lang.reflect.Constructor.newInstance(Constructor.java:423) ~[?:1.8.0_121] at sun.net.www.protocol.http.HttpURLConnection$10.run(HttpURLConnection.java:1926) ~[?:1.8.0_121] at sun.net.www.protocol.http.HttpURLConnection$10.run(HttpURLConnection.java:1921) ~[?:1.8.0_121] at java.security.AccessController.doPrivileged(Native Method) ~[?:1.8.0_121] at sun.net.www.protocol.http.HttpURLConnection.getChainedException(HttpURLConnection.java:1920) ~[?:1.8.0_121] at sun.net.www.protocol.http.HttpURLConnection.getInputStream0(HttpURLConnection.java:1490) ~[?:1.8.0_121] at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1474) ~[?:1.8.0_121] at org.apache.doris.master.MetaHelper.getResponse(MetaHelper.java:130) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.master.MetaHelper.getRemoteFile(MetaHelper.java:98) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.Env.getNewImage(Env.java:1657) ~[doris-fe.jar:1.2-SNAPSHOT] ... 4 more Caused by: java.net.SocketTimeoutException: Read timed out at java.net.SocketInputStream.socketRead0(Native Method) ~[?:1.8.0_121] at java.net.SocketInputStream.socketRead(SocketInputStream.java:116) ~[?:1.8.0_121] at java.net.SocketInputStream.read(SocketInputStream.java:171) ~[?:1.8.0_121] at java.net.SocketInputStream.read(SocketInputStream.java:141) ~[?:1.8.0_121] at java.io.BufferedInputStream.fill(BufferedInputStream.java:246) ~[?:1.8.0_121] at java.io.BufferedInputStream.read1(BufferedInputStream.java:286) ~[?:1.8.0_121] at java.io.BufferedInputStream.read(BufferedInputStream.java:345) ~[?:1.8.0_121] at sun.net.www.http.HttpClient.parseHTTPHeader(HttpClient.java:704) ~[?:1.8.0_121] at sun.net.www.http.HttpClient.parseHTTP(HttpClient.java:647) ~[?:1.8.0_121] at sun.net.www.protocol.http.HttpURLConnection.getInputStream0(HttpURLConnection.java:1569) ~[?:1.8.0_121] at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1474) ~[?:1.8.0_121] at sun.net.www.protocol.http.HttpURLConnection.getHeaderField(HttpURLConnection.java:2982) ~[?:1.8.0_121] at org.apache.doris.master.MetaHelper.getRemoteFile(MetaHelper.java:93) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.Env.getNewImage(Env.java:1657) ~[doris-fe.jar:1.2-SNAPSHOT] ... 4 more
What You Expected?
The new follower can synchronize image file from master
How to Reproduce?
No response
Anything Else?
No response
Are you willing to submit PR?
- Yes I am willing to submit a PR!
Code of Conduct
- I agree to follow this project's Code of Conduct